RadAgent: A tool-using AI agent for stepwise interpretation of chest computed tomography
Mélanie Roschewitz, Kenneth Styppa, Yitian Tao, Jiwoong Sohn, Jean-Benoit Delbrouck + 8 more
TLDR
RadAgent is a tool-using AI agent that generates interpretable chest CT reports with a stepwise reasoning trace, improving accuracy and robustness.
Key contributions
- Generates chest CT reports with an inspectable, stepwise reasoning trace for clinician validation.
- Improves clinical accuracy by 36.4% (macro-F1) and 19.6% (micro-F1) over its VLM counterpart.
- Enhances robustness under adversarial conditions by 41.9%.
- Achieves 37.0% faithfulness, a new capability for VLM-based CT reporting.
Why it matters
This paper introduces RadAgent, addressing the critical need for interpretable AI in medical imaging. By providing a transparent reasoning trace, it allows clinicians to validate findings, fostering trust and enabling more reliable AI for radiology.
Original Abstract
Vision-language models (VLM) have markedly advanced AI-driven interpretation and reporting of complex medical imaging, such as computed tomography (CT). Yet, existing methods largely relegate clinicians to passive observers of final outputs, offering no interpretable reasoning trace for them to inspect, validate, or refine. To address this, we introduce RadAgent, a tool-using AI agent that generates CT reports through a stepwise and interpretable process. Each resulting report is accompanied by a fully inspectable trace of intermediate decisions and tool interactions, allowing clinicians to examine how the reported findings are derived. In our experiments, we observe that RadAgent improves Chest CT report generation over its 3D VLM counterpart, CT-Chat, across three dimensions. Clinical accuracy improves by 6.0 points (36.4% relative) in macro-F1 and 5.4 points (19.6% relative) in micro-F1. Robustness under adversarial conditions improves by 24.7 points (41.9% relative). Furthermore, RadAgent achieves 37.0% in faithfulness, a new capability entirely absent in its 3D VLM counterpart. By structuring the interpretation of chest CT as an explicit, tool-augmented and iterative reasoning trace, RadAgent brings us closer toward transparent and reliable AI for radiology.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.