ArXiv TLDR

Causal Software Engineering: A Vision and Roadmap

🐦 Tweet
2605.02454

Roberto Pietrantuono, Luca Giamattei, Stefano Russo, Julien Siebert, Neil Walkinshaw

cs.SEcs.AI

TLDR

Introduces Causal Software Engineering (CSE) to enhance software engineering decisions by moving beyond correlation to explicit causal reasoning.

Key contributions

  • Proposes Causal Software Engineering (CSE) as a paradigm shift from correlational to causal models.
  • Outlines a causal-first workflow view spanning development and operations.
  • Presents a staged roadmap for tools and organizational adoption of CSE.
  • Defines an evaluation and benchmark agenda for measuring progress in CSE.

Why it matters

Current AI/ML models detect patterns but struggle with critical 'what if' questions in software engineering. CSE integrates causal reasoning to predict intervention impacts and enable counterfactual diagnosis. This paradigm shift promises more robust, informed decision-making.

Original Abstract

Software engineering increasingly involves making high-stakes decisions under uncertainty, using signals from code, field data, and socio-technical processes. Recent AI-driven support (e.g., anomaly detection, predictive analytics, AIOps, as well as LLM-based agents) has amplified engineers' ability to detect patterns and synthesize content and recommendations, but many critical questions are interventional or counterfactual: What is the expected impact of changing a load-balancing strategy? Would an outage have been avoided under a different release plan? Correlational models answer "what tends to co-occur"; they struggle to answer "what would happen if we act." We propose Causal Software Engineering (CSE) as a future paradigm in which causal models and causal reasoning systematically inform activities across the software lifecycle, augmenting existing practices with explicit assumptions, uncertainty-aware effect estimates, and counterfactual diagnosis. We outline (i) a causal-first workflow view spanning development and operations, (ii) a staged roadmap for tools and organizational adoption, and (iii) an evaluation and benchmark agenda for measuring progress.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.