ArXiv TLDR

ADAPTS: Agentic Decomposition for Automated Protocol-agnostic Tracking of Symptoms

🐦 Tweet
2605.03212

Alexandria K. Vail, Marcelo Cicconet, Katie Aafjes-van Doorn, Ryan Maroney, Marc Aafjes

cs.AIcs.CLcs.HCstat.APstat.CO

TLDR

ADAPTS is an LLM-based framework using agentic decomposition to automate depression and anxiety severity ratings from clinical interviews, outperforming human ratings.

Key contributions

  • ADAPTS uses a mixture-of-agents LLM to automate depression/anxiety severity ratings from clinical interviews.
  • Decomposes interviews into symptom-specific tasks, providing auditable justifications and preserving alignment.
  • Automated ratings outperformed human ratings on high-discrepancy interviews, closer to expert benchmarks.
  • An extended protocol, incorporating clinical conventions, significantly stabilized ratings (ICC(2,1) = 0.877).

Why it matters

This framework offers a scalable, objective foundation for psychiatric assessment, especially in resource-limited settings. By automating severity ratings with expert-level precision, it can improve mental health diagnostics. Its extensibility to multimodal inputs also paves the way for future advancements.

Original Abstract

Modeling latent clinical constructs from unconstrained clinical interactions is a unique challenge in affective computing. We present ADAPTS (Agentic Decomposition for Automated Protocol-agnostic Tracking of Symptoms), a framework for automated rating of depression and anxiety severity using a mixture-of-agents LLM architecture. This approach decomposes long-form clinical interviews into symptom-specific reasoning tasks, producing auditable justifications while preserving temporal and speaker alignment. Generalization was evaluated across two independent datasets ($N=204$) with distinct interview structures. On high-discrepancy interviews, automated ratings approximated expert benchmarks ($\text{absolute error}=22$) more closely than original human ratings ($\text{absolute error}=26$). Implementing an ``extended'' protocol that incorporates qualitative clinical conventions significantly stabilized ratings, with absolute agreement reaching $\text{ICC(2,1)} = 0.877$. These findings suggest that the ADAPTS framework enables promising evaluations of psychiatric severity. While the current implementation is purely text-based, the underlying architecture is readily extensible to multimodal inputs, including acoustic and visual features. By approximating expert-level precision in a protocol-agnostic manner, this framework provides a foundation for objective and scalable psychiatric assessment, especially in resource-limited settings.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.