ArXiv TLDR

FitText: Evolving Agent Tool Ecologies via Memetic Retrieval

🐦 Tweet
2605.02411

Kyle Zheng, Han Zhang, Renliang Sun, Chenchen Ye, Wei Wang

cs.AIcs.IRcs.LGcs.MA

TLDR

FitText introduces a training-free framework that dynamically evolves agent tool retrieval by generating and refining pseudo-tool descriptions using memetic retrieval.

Key contributions

  • Introduces FitText, a training-free framework for dynamic tool retrieval in large API ecosystems.
  • Generates and iteratively refines natural-language pseudo-tool descriptions as retrieval probes.
  • Employs Memetic Retrieval with evolutionary selection and tool memory to enhance search efficiency.
  • Significantly improves retrieval rank (8.81 to 2.78) and pass rate (0.73) on large tool benchmarks.

Why it matters

FitText offers a crucial, dynamic solution to the semantic gap in large API ecosystems, significantly boosting agent performance where static retrieval fails. This work also highlights that effective evolutionary tool exploration requires a competent base model.

Original Abstract

A semantic gap separates how users describe tasks from how tools are documented. As API ecosystems scale to tens of thousands of endpoints, static retrieval from the initial query alone cannot bridge this gap: the agent's understanding of what it needs evolves during execution, but its tool set does not. We introduce FitText, a training-free framework that makes retrieval dynamic by embedding it directly in the agent's reasoning loop. FitText generates natural-language pseudo-tool descriptions as retrieval probes, refines them iteratively using retrieval feedback, and explores diverse alternatives through stochastic generation. Memetic Retrieval adds evolutionary selection pressure over candidate descriptions, guided by a tool memory that avoids redundant search. On ToolRet (43k tools, 4 domains), FitText improves average retrieval rank from 8.81 to 2.78; on StableToolBench (16,464 APIs), it achieves a 0.73 average pass rate--a 24-point absolute gain over static query retrieval. The gains transfer across base models capable of acting as competent semantic operators; under weaker base models, Memetic's evolutionary search inverts--amplifying noise rather than refining signal--surfacing model capacity as a prerequisite for evolutionary tool exploration.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.