Innovation Information Initiative Technical Working Group Meeting

Matt Marx, Organizer

December 6-7, 2024

Royal Sonesta Hotel
Longfellow Room, 40 Edwin H. Land Blvd.
Cambridge, MA, Zoom

Conference Code of Conduct

Friday, December 6
4:00 pm
Welcome and introductions, Matt Marx
4:15 pm
New Ventures, Products, and Tools
Amir Sariri, Purdue University
Avi Goldfarb, University of Toronto and NBER

Database, Methodological Tools, and Research Opportunities: Creative Destruction Lab and Early-Stage Technology Ventures
Abhiroop Mukherjee, Hong Kong University of Science and Technology
Bruno Pellegrino, Columbia University
Alminas Zaldokas, National University of Singapore
Yiman Ren, University of Michigan
Tomas Thornquist, Shell Street Labs

New Products
Pierre Pelletier, University of Strasbourg
Kevin Wirtz, University of Strasbourg

Novelpy: A Python Package to Measure Novelty and Disruptiveness of Bibliometric and Patent Data
5:45 pm
Introducing the I3 BigQuery Data Repository
Dror Shvadron, University of Toronto
6:00 pm
Adjourn
6:15 pm
Reception and Group Dinner - Somerset Room
Saturday, December 7
8:30 am
Continental Breakfast
9:00 am
Building Datasets with LLMs
Victor Lyonnet, University of Michigan
Amin Shams, The Ohio State University
Shaojun Zhang, The Ohio State University

LLM-based Topic Modeling
Yuan Sun, Shanghai University of Finance and Economics
Xuan Tian, Tsinghua University
Yuanchen Yang, Tsinghua University

A Robust Green Patent Database: A New Dataset of Green Patents Through Large Language Models
Maya Durvasula, Stanford University
Sabri Eyuboglu, Stanford University
David Micha. Ritzwoller, Stanford University

Distilling Data from Large Language Models: An Application to Research Productivity Measurement
10:30 am
Break
10:45 am
Comparative NLP Methods
Michael E. Rose, Max Planck Institute for Innovation and Competition
Erik Buunk, Max Planck Institute for Innovation and Competition
Sebastian Erhardt, Max Planck Institute for Innovation and Competition
Cheng Li, Max Planck Institute for Innovation and Competition
Mainak Ghosh, Max Planck Institute for Innovation and Competition
Dietmar Harhoff, Max Planck Institute for Innovation and Competition

Tracing the Flow of Knowledge From Science to Technology Using Deep Learning (slides)
Ina Ganguli, University of Massachusetts Amherst and NBER
Jeffrey Lin, Federal Reserve Bank of Philadelphia
Vitaly Meursault, Federal Reserve Bank of Philadelphia
Nicholas F. Reynolds, University of Essex

Patent Text and Long-Run Innovation Dynamics: The Critical Role of Model Selection
11:45 am
Global Patent Data
Nishant Chadha, Indian School of Business
Satyaki Chakravarty, Universita Cattolica del Sacro Cuore
Piyasha Majumdar, India Development Foundation

A New Database of Indian Patents (slides)
Josh Lerner, Harvard University and NBER
Namrata Narain, Harvard University
Dimitris Papanikolaou, Northwestern University and NBER
Amit Seru, Stanford University and NBER

Creating the China Patent Dataset
Zenne Hellinga, Utrecht University
Jay Praka. Nagar, Duke University
Stefano Breschi, Bocconi University
Andrea Morrison, University of Pavia
Gianluca Tarasconi, IPQuants

A Novel Dataset for Historical Innovation Studies: Linking USPTO Patents and US Census Data from 1850 to 1940
1:15 pm
Lunch - Room Skyline ABC
2:00 pm
Adjourn