Text Corpora as Concept Fields: Black-Box Hallucination and Novelty Measurement

May 6, 20262605.05103

Nicholas S. Kersting, Vittorio Castelli, Chieh Ting Yeh, Xinzhu Wang, Saad Taame

cs.CLcs.AIcs.CY

TLDR

Concept Fields, a new method using sentence embeddings, detect text hallucination and novelty with high accuracy and interpretability.

Key contributions

Introduces "Concept Field," a local drift field with uncertainty from sentence-embedding deltas.
Develops a Vector Sequence Database (VSDB) to efficiently store embeddings and sequence metadata.
Evaluated for hallucination detection (U.S. Code) and novelty detection (Project Gutenberg).
Achieves strong selective classification performance, offering a fast, interpretable signal.

Why it matters

This paper introduces a novel, black-box, and corpus-attributable method for detecting ungrounded text and novelty. It offers a fast, lightweight, and interpretable signal, complementing existing LLM-as-judge and white-box detectors. Its domain-transferable probabilistic interpretation is a key advantage.

Original Abstract

We introduce the **Concept Field** of a text corpus: a local drift field with pointwise uncertainty, estimated in sentence-embedding space from the deltas between consecutive sentences. Given a candidate sentence transition, we score its agreement with the field by $ζ$, the mean absolute z-distance between the observed delta and the field's local Gaussian estimate. The score is black-box (no model internals), corpus-attributable (every score traces to nearby corpus sentences), and admits a direct probabilistic reading. We support the computation with the introduction of a **Vector Sequence Database (VSDB)** that stores embeddings together with sequence-position and next-delta metadata. We evaluate this approach on two large-scale settings: hallucination-style groundedness detection over the U.S. Code of Federal Regulations, and novelty detection over Project Gutenberg. Using controlled LLM-generated rewrites, Concept Fields achieve strong selective classification performance under a grounded / ungrounded / unsure triage policy, which unlike retrieval-centric baselines have similar coverage-risk behavior across both domains, supporting a probability-based interpretation that transfers across domains. We also sketch how divergence and curl of the Concept Field, computed on dense clusters, surface qualitatively meaningful semantic patterns (logic sources, sinks, and implicit topics), which we offer as hypothesis-generating rather than as a quantitative result. Concept Fields provide a fast, lightweight, and interpretable signal for groundedness and novelty, complementary to LLM-as-judge and white-box detectors.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers