Browse or search publications from faculty affiliated with the lab.
Presidential Address: The Economist as Designer in the Innovation Process for Socially Impactful Digital Products
This paper provides an economic perspective on data-driven innovation in digital products, focusing on the role of complex experiments in measuring and improving social impact. The discussion highlights how tools and insights from economics…
Targeted Treatment Assignment Using Data from Randomized Experiments with Noncompliance
This paper studies the estimation and evaluation of targeted treatment assignment policies, where a data-driven prioritization rule is used to determine to whom the treatment is either offered or delivered. We consider a setting where the data…
Machine Learning Who to Nudge: Causal vs Predictive Targeting in a Field Experiment on Student Financial Aid Renewal
In many settings, interventions may be more effective for some individuals than others, so that targeting interventions may be beneficial. We analyze the value of targeting in the context of a large-scale field experiment with over 53,000 college…
The Surrogate Index: Combining Short-Term Proxies to Estimate Long-Term Treatment Effects More Rapidly and Precisely
A common challenge in estimating the long-term impacts of treatments (e.g., job training programs) is that the outcomes of interest (e.g., lifetime earnings) are observed with a long delay. We address this problem by combining several short-term…
On Synthetic Difference-in-Differences and Related Estimation Methods in Stata
In this article, we describe a computational implementation of the synthetic difference-in-differences (SDID) estimator of Arkhangelsky et al. (2021, American Economic Review 111: 4088-4118) for Stata. SDID can be used in many…
Choosing the “Right” Default Donation Amounts for Each Donor to Balance Multiple Fundraising Objectives
This report describes insights gleaned from the Data Fellows collaboration between PayPal and the Golub Capital Social Impact Lab at Stanford University’s Graduate School of Business. By embedding researchers in PayPal’s charitable giving team,…
Evaluating Treatment Prioritization Rules via Rank-Weighted Average Treatment Effects
There are a number of available methods for selecting whom to prioritize for treatment, including ones based on treatment effect estimation, risk scoring, and hand-crafted rules. We propose rank-weighted average treatment effect (RATE) metrics as…
Federated Offline Policy Learning
We consider the problem of learning personalized decision policies from observational bandit feedback data across multiple heterogeneous data sources. In our approach, we introduce a novel regret analysis that establishes finite-sample upper…
Qini Curves for Multi-Armed Treatment Rules
Qini curves have emerged as an attractive and popular approach for evaluating the benefit of data-driven targeting rules for treatment allocation. We propose a generalization of the Qini curve to multiple costly treatment arms that quantifies the…
Qini Curves for Multi-Armed Treatment Rules
Qini curves have emerged as an attractive and popular approach for evaluating the benefit of data-driven targeting rules for treatment allocation. We propose a generalization of the Qini curve to multiple costly treatment arms that quantifies the…
Service Quality on Online Platforms: Empirical Evidence about Driving Quality at Uber
Forthcoming in Management Science
Online marketplaces have adopted new quality control mechanisms that can accommodate a flexible pool of providers. In the context of ride-hailing, we measure the effectiveness of these mechanisms…
Heterogeneous Effects of Medicaid Coverage on Cardiovascular Risk Factors: Secondary Analysis of Randomized Controlled Trial
Objectives: To investigate whether health insurance generated improvements in cardiovascular risk factors (blood pressure and hemoglobin A(1c) (HbA(1c)) levels) for identifiable subpopulations, and using machine learning to…
Estimating Wage Disparities Using Foundation Models
One thread of empirical work in social science focuses on decomposing group differences in outcomes into unexplained components and components explained by observable factors. In this paper, we study gender wage decompositions, which require…
Service Quality in the Gig Economy: Empirical Evidence about Driving Quality at Uber
The rise of marketplaces for goods and services has led to changes in the mechanisms used to ensure high quality. We analyze this phenomenon in the Uber market, where the system of pre-screening that prevailed in the taxi industry has been…
Policy Learning with Adaptively Collected Data
In a wide variety of applications, including healthcare, bidding in first price auctions, digital recommendations, and online education, it can be beneficial to learn a policy that assigns treatments to individuals based on their characteristics…
LABOR-LLM: Language-Based Occupational Representations with Large Language Models
Many empirical studies of labor market questions rely on estimating relatively simple predictive models using small, carefully constructed longitudinal survey datasets based on hand-engineered features. Large Language Models (LLMs), trained on…
Data-driven Error Estimation: Upper Bounding Multiple Errors with No Technical Debt
We formulate the problem of constructing multiple simultaneously valid confidence intervals (CIs) as estimating a high probability upper bound on the maximum error for a class/set of estimate-estimand-error tuples, and refer to this as the error…
Towards Costless Model Selection in Contextual Bandits: A Bias-Variance Perspective
Model selection in supervised learning provides costless guarantees as if the model that best balances bias and variance was known a priori. We study the feasibility of similar guarantees for cumulative regret minimization in the stochastic…
The Heterogeneous Impact of Changes in Default Gift Amounts on Fundraising
When choosing whether and how much to donate, potential donors often observe a set of default donation amounts known as an “ask string.” In an experiment with more than 400,000 PayPal users, we replace a relatively unused donation amount ($75) on…
The Value of Non-traditional Credentials in the Labor Market
This study investigates the labor market value of credentials obtained from Massive Open Online Courses (MOOCs) and shared on business networking platforms. We conducted a randomized experiment involving more than 800,000 learners, primarily from…
Battling the Coronavirus ‘Infodemic’ among Social Media Users in Kenya and Nigeria
How can we induce social media users to be discerning when sharing information during a pandemic? An experiment on Facebook Messenger with users from Kenya (n = 7,498) and Nigeria (n = 7,794) tested interventions designed to…
Using Wasserstein Generative Adversarial Networks for the Design of Monte Carlo Simulations
When researchers develop new econometric methods it is common practice to compare the performance of the new methods to those of existing methods in Monte Carlo studies. The credibility of such Monte Carlo studies is often limited because of the…
CAREER: A Foundation Model for Labor Sequence Data
Labor economists regularly analyze employment data by fitting predictive models to small, carefully constructed longitudinal survey datasets. Although machine learning methods offer promise for such problems, these survey datasets are too small…
Digital Interventions and Habit Formation in Educational Technology
We evaluate a contest-based intervention intended to increase the usage of an educational app that helps children in India learn to read English. The evaluation included approximately 10,000 children, of whom about half were randomly selected to…