Data Privacy Protection and the Conduct of Applied Research: Methods, Approaches and their Consequences
Ruobin Gong, V. Joseph Hotz, and Ian M. Schmutte, Organizers
May 16-17, 2024
Supported by Alfred P. Sloan Foundation
InterContinental Washington D.C. - The Wharf, 801 Wharf Street SW in Washington D.C.
Thursday, May 16 | ||
Continental Breakfast | ||
9:00 am | Survey Methodologies | |
Jeremy Seeman, University of Michigan Yajuan Si, University of Michigan Jerome P. Reiter, Duke University Differentially Private Population Quantity Estimates via Survey Weight Regularization |
||
Jörg Drechsler, Institute for Employment Research James Bailie, Harvard University Whose Data Is It Anyway? Flavors of Differential Privacy for Survey Sampling |
||
Evan S. Totty, U.S. Census Bureau Thor Watson, U.S. Census Bureau Statistical Disclosure Limitation and Total Survey Error |
||
10:15 am | Break | |
10:45 am | Synthetic Data and Alternatives | |
Minsun Riddles, Westat Wan-Ying Chang, National Science Foundation Angela Chen, Westat Robyn Ferg, Westat Thomas Krenzke, Westat Lin Li, Westat Natalie Shlomo, University of Manchester Medha Uppala, Westat Select Synthetic Data Generation Methods Toward a Public Use File for a Longitudinal Survey |
||
Marcel Neunhoeffer, Institute for Employment Research Jörg Drechsler, Institute for Employment Research Jonathan P. Latner, Institute for Employment Research On the Formal Privacy Guarantees of Synthetic Data (Generated without formal Privacy Guarantees) |
||
Lars Vilhuber, Cornell University Using Containers to Validate Research on Confidential Data |
||
Amy O'Hara, Georgetown University Barry Johnson, Internal Revenue Service Paul Arnsberger, Internal Revenue Service Stephanie Straus, Georgetown University Ron Borzekowski, Yale University Exploring Improved Access to Individual Income Tax Data for Evidence-Building |
||
12:25 pm | Lunch | |
1:40 pm | Privacy Compliance | |
Ryan Steed, Carnegie Mellon University Alessandro Acquisti, Carnegie Mellon University Adoption of 'Privacy-Preserving' Analytics: Drivers, Designs, & Decoupling |
||
Bernhard Ganglmair, ZEW Mannheim Julia K. Krämer, Erasmus University Rotterdam Jacopo Gambato, ZEW Mannheim Regulatory Compliance with Limited Enforceability: Evidence from Privacy Policies |
||
2:30 pm | Break | |
3:00 pm | Discussion Session I - Data Governance | |
Pierre Desrochers, Statistics Canada Eric Rancourt, Statistics Canada Navigating the Privacy Landscape: Harmonizing Legislative and Public Sector Approaches in the Canadian Context |
||
Michael B. Hawes, U.S. Census Bureau Rolando A. Rodriguez, U.S. Census Bureau Matthew Spence, U.S. Census Bureau Towards a Principled Discussion about Disclosure Avoidance: Identifying the Characteristics of an Ideal, Applied Disclosure Avoidance System |
||
Ramon Abraham A. Sarmiento, Bangko Sentral ng Pilipinas The Realities of Disclosure Risks in the Age of Dark Patterns and Big Data |
||
Fernando Stipanicic, University of Oslo Abhishek Nagaraj, University of California, Berkeley and NBER Matteo Tranchero, University of Pennsylvania U.S. Census Data in Economic Research: Adoption, Diffusion and Impact |
||
5:00 pm | Adjourn/Reception | |
6:00 pm | Dinner | |
Friday, May 17 | ||
8:30 am | Continental Breakfast | |
9:00 am | Formal Privacy Techniques | |
Shurong Lin, Boston University Elliot Paquette, McGill University Eric Kolaczyk, McGill University Differentially Private Linear Regression with Linked Data |
||
Zeki R. Kazan, Duke University Jerome P. Reiter, Duke University Bayesian Inference Under Differential Privacy: Prior Selection Considerations with Application to Univariate Gaussian Data and Regression |
||
Aaron R. Williams, Urban Institute Andrés F. Barrientos, Florida State University Joshua Snoke, RAND Corporation Claire Bowen, Urban Institute Benchmarking DP Linear Regression Methods for Statistical Inference |
||
10:15 am | Break | |
10:45 am | Utility and Privacy in Design | |
Nathan Yoder, University of Georgia Ian M. Schmutte, University of Georgia Information Design for Differential Privacy |
||
Elan A. Segarra, U.S. Bureau of Labor Statistics Estimating Preferences Over Data to Inform Statistical Disclosure Control Decisions |
||
Kevin He, University of Pennsylvania Federico Echenique, University of California, Berkeley Screening P-Hackers: Dissemination Noise as Bait |
||
12:00 n | Lunch | |
1:00 pm | Discussion Session II | |
Trivellore Raghunathan, University of Michigan V. Joseph Hotz, Duke University and NBER A Roadmap for Disclosure Avoidance in the Survey of Income and Program Participation SIPP |
||
Jerome P. Reiter, Duke University Jennifer Park, National Academies of Science, Engineering, and Medicine, Committee on National Statistics Toward a 21st Century National Data Infrastructure: Managing Privacy and Confidentiality Risks with Blended Data |
||
Report |
||
Open Discussion | ||
2:30 pm | Adjourn |