MAS-SZZ: Multi-Agentic SZZ Algorithm for Vulnerability-Inducing Commit Identification

April 27, 20262604.24398

Sicong Cao, Jinxuan Xu, Le Yu, Jing Yang, Xingwei Lin + 2 more

cs.CRcs.SE

TLDR

MAS-SZZ is a multi-agentic algorithm that accurately identifies vulnerability-inducing commits, significantly outperforming existing SZZ methods.

Key contributions

Introduces MAS-SZZ, a multi-agentic algorithm for accurate vulnerability-inducing commit identification.
Summarizes vulnerability root causes from CVEs and fixing commits to guide analysis.
Localizes vulnerable statements using structured prompting based on patch hunk change intent.
Outperforms state-of-the-art SZZ algorithms with F1-score gains up to 65.22%.

Why it matters

Accurately identifying vulnerability-inducing commits is crucial for software security tasks like vulnerability detection and affected version analysis. This paper introduces MAS-SZZ, which significantly improves the reliability and performance of this critical process, making it practical for real-world applications.

Original Abstract

Accurate vulnerability-inducing commit identification serves as a foundation for a series of software security tasks, such as vulnerability detection and affected version analysis. A straightforward solution is the SZZ algorithm, which traces back through the code history to identify the earliest commit that modify the vulnerable code. Unfortunately, neither the customized V-SZZ nor state-of-the-art LLM4SZZ perform satisfactorily due to the incorrect anchor selection and inadequate backtracking capability, making them far beyond a reliable usage in practice. To overcome these challenges, we propose a multi-agentic SZZ algorithm, named MAS-SZZ, that facilitates the identification of vulnerability-inducing commits through collaboration among agents. Specifically, given a CVE description and its corresponding fixing commit, MAS-SZZ summarizes the root cause of the vulnerability and employs a structured step-forward prompting strategy to localize vulnerability-related statements based on the change intent of each patch hunk. These vulnerable statements serve as anchors from which MAS-SZZ autonomously traces backward through the repository's history to find the commit that first introduced the vulnerability. Extensive experiments show that MAS-SZZ outperforms the state-of-the-art baselines across datasets and programming languages, achieving F1-score gains of up to 65.22% over the best-performing SZZ algorithm.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers