ArXiv TLDR

Design-Based Variance Estimation for Modern Heterogeneity-Robust Difference-in-Differences Estimators

🐦 Tweet
2605.04124

Isaac Gerber

stat.MEecon.EM

TLDR

This paper shows how to correctly estimate variance for modern DiD estimators used with complex survey data, preventing misleading results.

Key contributions

  • Standard stratified-cluster variance formula yields design-consistent standard errors for modern DiD estimators.
  • HC1 standard errors (iid assumption) lead to very low coverage (34-11%) with complex survey designs.
  • Combining survey-weighted point estimates with PSU-level clustering recovers near-nominal coverage.
  • Open-source `diff-diff` Python package implements design-based variance for 15 modern DiD estimators.

Why it matters

Many DiD applications to complex survey data yield incorrect standard errors, leading to misleading conclusions. This paper provides a design-consistent variance estimation method, crucial for reliable policy evaluations, and offers an open-source Python package.

Original Abstract

Modern heterogeneity-robust difference-in-differences estimators derive their asymptotic properties under iid, cluster, or fixed-design frameworks that abstract from complex survey sampling, yet practitioners routinely apply them to nationally representative surveys with stratified cluster designs. We show that, under standard regularity conditions, the influence functions of each smooth IF-based or regression-based modern DiD estimator satisfy Binder's (1983) smoothness conditions, so the standard stratified-cluster variance formula applied to their values produces design-consistent standard errors. A Monte Carlo study with 66,000 replications shows where the design effect comes from. HC1 standard errors that treat observations as iid produce coverage as low as 34% under a baseline survey design and below 11% under informative sampling. Combining the survey-weighted point estimate with PSU-level clustering - the practitioner's cluster=psu heuristic - recovers near-nominal coverage across all scenarios. Adding strata and finite-population corrections yields incremental precision but is not required for valid coverage. Survey-weighted doubly robust estimation produces well-calibrated inference when parallel trends hold only conditionally. An NHANES illustration of the ACA dependent coverage provision shows that point estimates and standard errors change substantively - enough to reverse significance conclusions - when the survey design is accounted for. We provide diff-diff (https://github.com/igerber/diff-diff), an open-source Python package implementing design-based variance for fifteen modern DiD estimators.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.