Diagnosable ColBERT: Debugging Late-Interaction Retrieval Models Using a Learned Latent Space as Reference

April 21, 20262604.19566

cs.IRcs.CL

TLDR

Diagnosable ColBERT improves debugging of late-interaction retrieval models by aligning token embeddings to a clinical knowledge-grounded latent space.

Key contributions

Introduces Diagnosable ColBERT for debugging late-interaction retrieval models.
Aligns ColBERT token embeddings to a clinical knowledge-grounded latent space.
Provides inspectable evidence of model understanding for direct error diagnosis.
Enables principled data curation, reducing reliance on many diagnostic queries.

Why it matters

Current late-interaction models offer shallow interpretability, hindering systematic error diagnosis in critical domains like biomedicine. Diagnosable ColBERT provides a framework to make model understanding explicit and debuggable, crucial for reliable and trustworthy AI systems.

Original Abstract

Reliable biomedical and clinical retrieval requires more than strong ranking performance: it requires a practical way to find systematic model failures and curate the training evidence needed to correct them. Late-interaction models such as ColBERT provide a first solution thanks to the interpretable token-level interaction scores they expose between document and query tokens. Yet this interpretability is shallow: it explains a particular document--query pairwise score, but does not reveal whether the model has learned a clinical concept in a stable, reusable, and context-sensitive way across diverse expressions. As a result, these scores provide limited support for diagnosing misunderstandings, identifying irreasonably distant biomedical concepts, or deciding what additional data or feedback is needed to address this. In this short position paper, we propose Diagnosable ColBERT, a framework that aligns ColBERT token embeddings to a reference latent space grounded in clinical knowledge and expert-provided conceptual similarity constraints. This alignment turns document encodings into inspectable evidence of what the model appears to understand, enabling more direct error diagnosis and more principled data curation without relying on large batteries of diagnostic queries.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers