Remember the Decision, Not the Description: A Rate-Distortion Framework for Agent Memory
Mingxi Zou, Zhihan Guo, Langzhang Liang, Zhuo Wang, Qifan Wang + 4 more
TLDR
This paper introduces a decision-centric rate-distortion framework for agent memory, proposing DeMem to optimize memory by preserving distinctions crucial for decisions.
Key contributions
- Introduces a decision-centric rate-distortion framework for agent memory.
- Defines memory quality by loss in achievable decision quality due to compression.
- Proposes DeMem, an online memory learner that refines partitions only when data indicates decision conflict.
- Achieves consistent performance gains on synthetic diagnostics and long-horizon conversational benchmarks.
Why it matters
Existing agent memory systems often prioritize descriptive accuracy over decision utility. This paper shifts focus to a decision-centric view, offering a principled way to manage memory by preserving only the distinctions vital for future actions. This approach leads to more efficient and effective long-horizon language agents.
Original Abstract
Long-horizon language agents must operate under limited runtime memory, yet existing memory mechanisms often organize experience around descriptive criteria such as relevance, salience, or summary quality. For an agent, however, memory is valuable not because it faithfully describes the past, but because it preserves the distinctions between histories that must remain separated under a fixed budget to support good decisions. We cast this as a decision-centric rate-distortion problem, measuring memory quality by the loss in achievable decision quality induced by compression. This yields an exact forgetting boundary for what can be safely forgotten, and a memory-distortion frontier characterizing the optimal tradeoff between memory budget and decision quality. Motivated by this decision-centric view of memory, we propose DeMem, an online memory learner that refines its partition only when data certify that a shared state would induce decision conflict, and prove near-minimax regret guarantees. On both controlled synthetic diagnostics and long-horizon conversational benchmarks, DeMem yields consistent gains under the same runtime budget, supporting the principle that memory should preserve the distinctions that matter for decisions, not descriptions.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.