Causal Diffusion Models for Counterfactual Outcome Distributions in Longitudinal Data

April 14, 20262604.12992

Farbod Alinezhad, Jianfei Cao, Gary J. Young, Brady Post

stat.MLcs.LGecon.EM

TLDR

Causal Diffusion Model (CDM) predicts full counterfactual outcome distributions in longitudinal data, outperforming state-of-the-art methods.

Key contributions

Introduces Causal Diffusion Model (CDM) for generating full probabilistic counterfactual outcome distributions.
Employs a novel residual denoising architecture with relational self-attention for complex temporal dependencies.
Achieves robust counterfactual prediction without explicit deconfounding methods like IPW or adversarial balancing.
Outperforms state-of-the-art methods by 15-30% in distributional accuracy on a tumor-growth simulator.

Why it matters

CDM offers a flexible, high-impact tool for decision support by unifying uncertainty quantification and robust counterfactual prediction. It addresses challenges in complex, sequentially confounded longitudinal settings, making it valuable for medicine and policy evaluation.

Original Abstract

Predicting counterfactual outcomes in longitudinal data, where sequential treatment decisions heavily depend on evolving patient states, is critical yet notoriously challenging due to complex time-dependent confounding and inadequate uncertainty quantification in existing methods. We introduce the Causal Diffusion Model (CDM), the first denoising diffusion probabilistic approach explicitly designed to generate full probabilistic distributions of counterfactual outcomes under sequential interventions. CDM employs a novel residual denoising architecture with relational self-attention, capturing intricate temporal dependencies and multimodal outcome trajectories without requiring explicit adjustments (e.g., inverse-probability weighting or adversarial balancing) for confounding. In rigorous evaluation on a pharmacokinetic-pharmacodynamic tumor-growth simulator widely adopted in prior work, CDM consistently outperforms state-of-the-art longitudinal causal inference methods, achieving a 15-30% relative improvement in distributional accuracy (1-Wasserstein distance) while maintaining competitive or superior point-estimate accuracy (RMSE) under high-confounding regimes. By unifying uncertainty quantification and robust counterfactual prediction in complex, sequentially confounded settings, without tailored deconfounding, CDM offers a flexible, high-impact tool for decision support in medicine, policy evaluation, and other longitudinal domains.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers