M-CaStLe: Uncovering Local Causal Structures in Multivariate Space-Time Gridded Data
J. Jake Nichol, Michael Weylandt, G. Matthew Fricke, Jhayron Perez-Carrasquilla, Melanie E. Moses
TLDR
M-CaStLe extends causal discovery for high-dimensional space-time gridded data to multivariate systems, identifying local causal structures.
Key contributions
- Generalizes CaStLe to jointly model local within-variable and cross-variable space-time causal structures.
- Increases effective sample size by pooling spatial replicates, making discovery tractable in high-dimensional data.
- Decomposes multivariate stencil graphs into reaction and spatial graphs for enhanced interpretability.
- Validated across diverse settings, including VAR, PDE, atmospheric chemistry, and ENSO studies.
Why it matters
This paper introduces M-CaStLe, a significant advancement for causal discovery in complex multivariate space-time systems. It addresses the challenge of high-dimensional gridded data by enabling tractable and interpretable identification of local causal structures. This is crucial for understanding dynamics in fields like climate science and atmospheric chemistry.
Original Abstract
Causal graph discovery for space-time systems is challenging in high-dimensional gridded data, which often has many more grid cells than temporal observations per cell. The Causal Space-Time Stencil Learning (CaStLe) meta-algorithm was developed to address that niche under space-time locality and stationarity assumptions, but it is currently limited to univariate analyses. In this work, we present M-CaStLe. M-CaStLe generalizes the local embedding and parent-identification phases of CaStLe to jointly model local within-variable and cross-variable space-time causal structures in gridded data. Like CaStLe, by constraining candidate parents to a constant-size space-time neighborhood and pooling spatial replicates, M-CaStLe increases effective sample size to make discovery tractable in high-dimensional settings. We further decompose the resulting multivariate stencil graph into reaction and spatial graphs to aid interpretation in complex settings. We study M-CaStLe in four settings: a multivariate space-time vector autoregression benchmark with known ground truth, an advective-diffusive-reaction partial differential equation verification problem with derived physical reference structure, an atmospheric chemistry case study in a low-temporal-sample regime, and an El Niño Southern Oscillation study on reanalysis data, identifying phase-dependent ocean--atmosphere coupling. Across these settings, M-CaStLe more accurately recovers multivariate causal structure in controlled settings and identifies important physical dynamics in real-world case studies. Overall, M-CaStLe advances causal discovery for multivariate space-time systems while retaining interpretability at the grid level.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.