Causal Diffusion Models for Counterfactual Outcome Distributions in Longitudinal Data
Farbod Alinezhad, Jianfei Cao, Gary J. Young, Brady Post
TLDR
Causal Diffusion Model (CDM) predicts full counterfactual outcome distributions in longitudinal data, outperforming state-of-the-art methods.
Key contributions
- Introduces Causal Diffusion Model (CDM) for generating full probabilistic counterfactual outcome distributions.
- Employs a novel residual denoising architecture with relational self-attention for complex temporal dependencies.
- Achieves robust counterfactual prediction without explicit deconfounding methods like IPW or adversarial balancing.
- Outperforms state-of-the-art methods by 15-30% in distributional accuracy on a tumor-growth simulator.
Why it matters
CDM offers a flexible, high-impact tool for decision support by unifying uncertainty quantification and robust counterfactual prediction. It addresses challenges in complex, sequentially confounded longitudinal settings, making it valuable for medicine and policy evaluation.
Original Abstract
Predicting counterfactual outcomes in longitudinal data, where sequential treatment decisions heavily depend on evolving patient states, is critical yet notoriously challenging due to complex time-dependent confounding and inadequate uncertainty quantification in existing methods. We introduce the Causal Diffusion Model (CDM), the first denoising diffusion probabilistic approach explicitly designed to generate full probabilistic distributions of counterfactual outcomes under sequential interventions. CDM employs a novel residual denoising architecture with relational self-attention, capturing intricate temporal dependencies and multimodal outcome trajectories without requiring explicit adjustments (e.g., inverse-probability weighting or adversarial balancing) for confounding. In rigorous evaluation on a pharmacokinetic-pharmacodynamic tumor-growth simulator widely adopted in prior work, CDM consistently outperforms state-of-the-art longitudinal causal inference methods, achieving a 15-30% relative improvement in distributional accuracy (1-Wasserstein distance) while maintaining competitive or superior point-estimate accuracy (RMSE) under high-confounding regimes. By unifying uncertainty quantification and robust counterfactual prediction in complex, sequentially confounded settings, without tailored deconfounding, CDM offers a flexible, high-impact tool for decision support in medicine, policy evaluation, and other longitudinal domains.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.