DocQAC: Adaptive Trie-Guided Decoding for Effective In-Document Query Auto-Completion

April 20, 20262604.18257

Rahul Mehta, Kavin R, Indrajit Pal, Tushar Abhishek, Pawan Goyal + 1 more

cs.IRcs.AIcs.CL

TLDR

DocQAC introduces an adaptive trie-guided decoding framework for in-document query auto-completion, outperforming large LMs.

Key contributions

Proposes DocQAC, a novel adaptive trie-guided decoding framework for in-document query auto-completion.
Uses an adaptive penalty mechanism to balance language model confidence with trie-based guidance.
Incorporates document context using RAG and lightweight signals for improved performance.
Outperforms strong baselines and larger LMs (LLaMA-3, Phi-3) on a new DocQAC benchmark.

Why it matters

This paper addresses the underexplored area of in-document query auto-completion, crucial for productivity in long documents. Its adaptive trie-guided approach offers a practical and efficient solution, significantly improving search precision.

Original Abstract

Query auto-completion (QAC) has been widely studied in the context of web search, yet remains underexplored for in-document search, which we term DocQAC. DocQAC aims to enhance search productivity within long documents by helping users craft faster, more precise queries, even for complex or hard-to-spell terms. While global historical queries are available to both WebQAC and DocQAC, DocQAC uniquely accesses document-specific context, including the current document's content and its specific history of user query interactions. To address this setting, we propose a novel adaptive trie-guided decoding framework that uses user query prefixes to softly steer language models toward high-quality completions. Our approach introduces an adaptive penalty mechanism with tunable hyperparameters, enabling a principled trade-off between model confidence and trie-based guidance. To efficiently incorporate document context, we explore retrieval-augmented generation (RAG) and lightweight contextual document signals such as titles, keyphrases, and summaries. When applied to encoder-decoder models like T5 and BART, our trie-guided framework outperforms strong baselines and even surpasses much larger instruction-tuned models such as LLaMA-3 and Phi-3 on seen queries across both seen and unseen documents. This demonstrates its practicality for real-world DocQAC deployments, where efficiency and scalability are critical. We evaluate our method on a newly introduced DocQAC benchmark derived from ORCAS, enriched with query-document pairs. We make both the DocQAC dataset (https://bit.ly/3IGEkbH) and code (https://github.com/rahcode7/DocQAC) publicly available.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers