MAS-SZZ: Multi-Agentic SZZ Algorithm for Vulnerability-Inducing Commit Identification
Sicong Cao, Jinxuan Xu, Le Yu, Jing Yang, Xingwei Lin + 2 more
TLDR
MAS-SZZ is a multi-agentic algorithm that accurately identifies vulnerability-inducing commits, significantly outperforming existing SZZ methods.
Key contributions
- Introduces MAS-SZZ, a multi-agentic algorithm for accurate vulnerability-inducing commit identification.
- Summarizes vulnerability root causes from CVEs and fixing commits to guide analysis.
- Localizes vulnerable statements using structured prompting based on patch hunk change intent.
- Outperforms state-of-the-art SZZ algorithms with F1-score gains up to 65.22%.
Why it matters
Accurately identifying vulnerability-inducing commits is crucial for software security tasks like vulnerability detection and affected version analysis. This paper introduces MAS-SZZ, which significantly improves the reliability and performance of this critical process, making it practical for real-world applications.
Original Abstract
Accurate vulnerability-inducing commit identification serves as a foundation for a series of software security tasks, such as vulnerability detection and affected version analysis. A straightforward solution is the SZZ algorithm, which traces back through the code history to identify the earliest commit that modify the vulnerable code. Unfortunately, neither the customized V-SZZ nor state-of-the-art LLM4SZZ perform satisfactorily due to the incorrect anchor selection and inadequate backtracking capability, making them far beyond a reliable usage in practice. To overcome these challenges, we propose a multi-agentic SZZ algorithm, named MAS-SZZ, that facilitates the identification of vulnerability-inducing commits through collaboration among agents. Specifically, given a CVE description and its corresponding fixing commit, MAS-SZZ summarizes the root cause of the vulnerability and employs a structured step-forward prompting strategy to localize vulnerability-related statements based on the change intent of each patch hunk. These vulnerable statements serve as anchors from which MAS-SZZ autonomously traces backward through the repository's history to find the commit that first introduced the vulnerability. Extensive experiments show that MAS-SZZ outperforms the state-of-the-art baselines across datasets and programming languages, achieving F1-score gains of up to 65.22% over the best-performing SZZ algorithm.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.