Explanation of Dynamic Physical Field Predictions using WassersteinGrad: Application to Autoregressive Weather Forecasting

April 24, 20262604.22580

Younes Essafouri, Laure Raynaud, Luciano Drozda, Laurent Risser

stat.MLcs.LG

TLDR

WassersteinGrad explains dynamic physical field predictions, like weather forecasts, by geometrically averaging perturbed attribution maps to avoid blurring.

Key contributions

Identifies that pointwise averaging of attribution maps blurs features due to geometric displacement in dynamic fields.
Introduces WassersteinGrad, using entropic Wasserstein barycenter for geometric consensus of perturbed attribution maps.
Validated on regional weather data with a meteorologist-validated neural model.
Shows improved explainability for single-step and autoregressive weather forecasting over baselines.

Why it matters

As AI integrates into high-stakes fields like weather forecasting, explaining its predictions is crucial. Traditional gradient-based methods struggle with dynamic physical fields due to geometric misalignments. WassersteinGrad offers a robust solution, enhancing the trustworthiness and operational utility of autoregressive weather models.

Original Abstract

As the demand to integrate Artificial Intelligence into high-stakes environments continues to grow, explaining the reasoning behind neural-network predictions has shifted from a theoretical curiosity to a strict operational requirement. Our work is motivated by the explanations of autoregressive neural predictions on dynamic physical fields, as in weather forecasting. Gradient-based feature attribution methods are widely used to explain the predictions on such data, in particular due to their scalability to high-dimensional inputs. It is also interesting to remark that gradient-based techniques such as SmoothGrad are now standard on images to robustify the explanations using pointwise averages of the attribution maps obtained from several noised inputs. Our goal is to efficiently adapt this aggregation strategy to dynamic physical fields. To do so, our first contribution is to identify a fundamental failure mode when averaging perturbed attribution maps on dynamic physical fields: stochastic input perturbations do not induce stationary amplitude noise in attribution maps, but instead cause a geometric displacement of the attributions. Consequently, pointwise averaging blurs these spatially misaligned features. To tackle this issue, we introduce WassersteinGrad, which extracts a geometric consensus of perturbed attribution maps by computing their entropic Wasserstein barycenter. The results, obtained on regional weather data and a meteorologist-validated neural model, demonstrate promising explainability properties of WassersteinGrad over gradient-based baselines across both single-step and autoregressive forecasting settings.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers