LongSeeker: Elastic Context Orchestration for Long-Horizon Search Agents

May 6, 20262605.05191

Yijun Lu, Rui Ye, Yuwen Du, Jiajun Wang, Songhua Liu + 1 more

cs.AI

TLDR

LongSeeker introduces Context-ReAct, an adaptive context management paradigm for long-horizon search agents, significantly improving performance and efficiency.

Key contributions

Introduces Context-ReAct, an agentic paradigm for elastic context orchestration in long-horizon search.
Defines five atomic operations (Skip, Compress, Rollback, Snippet, Delete) for dynamic context reshaping.
Develops LongSeeker, an agent fine-tuned on 10k trajectories, outperforming baselines on search benchmarks.
Demonstrates that adaptive context management reduces generation cost and hallucination risk.

Why it matters

This paper addresses a critical challenge in long-horizon agents: managing ever-growing context. By introducing adaptive context orchestration, it enables more reliable and efficient reasoning. The significant performance gains of LongSeeker highlight a promising direction for developing advanced AI agents.

Original Abstract

Long-horizon search agents must manage a rapidly growing working context as they reason, call tools, and observe information. Naively accumulating all intermediate content can overwhelm the agent, increasing costs and the risk of errors. We propose that effective context management should be adaptive: parts of the agent's trajectory are maintained at different levels of detail depending on their current relevance to the task. To operationalize this principle, we introduce Context-ReAct, a general agentic paradigm for elastic context orchestration that integrates reasoning, context management, and tool use in a unified loop. Context-ReAct provides five atomic operations: Skip, Compress, Rollback, Snippet and Delete, which allow the agent to dynamically reshape its working context, preserving important evidence, summarizing resolved information, discarding unhelpful branches, and controlling context size. We prove that the Compress operator is expressively complete, while the other specialized operators provide efficiency and fidelity guarantees that reduce generation cost and hallucination risk. Building on this paradigm, we develop LongSeeker, a long-horizon search agent fine-tuned from Qwen3-30B-A3B on 10k synthesized trajectories. Across four representative search benchmarks, LongSeeker achieves 61.5% on BrowseComp and 62.5% on BrowseComp-ZH, substantially outperforming Tongyi DeepResearch (43.2% and 46.7%) and AgentFold (36.2% and 47.3%). These results highlight the potential of adaptive context management, showing that agents can achieve more reliable and efficient long-horizon reasoning by actively shaping their working memory.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers