Evolvable Embodied Agent for Robotic Manipulation via Long Short-Term Reflection and Optimization

April 15, 20262604.13533

Jianzong Wang, Botao Zhao, Yayun He, Junqing Peng, Xulong Zhang

cs.ROcs.CV

TLDR

EEAgent uses VLMs and Long Short-Term Reflective Optimization (LSTRO) for self-evolving robotic manipulation, achieving SOTA on VIMA-Bench.

Key contributions

Proposes EEAgent, an evolvable embodied agent framework for robotic manipulation.
Leverages large vision-language models (VLMs) for better environmental interpretation and policy planning.
Introduces Long Short-Term Reflective Optimization (LSTRO) for dynamic prompt refinement.
Achieves state-of-the-art performance on six complex VIMA-Bench robotic manipulation tasks.

Why it matters

General-purpose robotics requires agents that adapt and evolve. This paper tackles limitations of traditional methods by enabling continuous self-evolution through dynamic reflection. It offers a promising path towards more adaptable and interpretable robotic systems.

Original Abstract

Achieving general-purpose robotics requires empowering robots to adapt and evolve based on their environment and feedback. Traditional methods face limitations such as extensive training requirements, difficulties in cross-task generalization, and lack of interpretability. Prompt learning offers new opportunities for self-evolving robots without extensive training, but simply reflecting on past experiences.However, extracting meaningful insights from task successes and failures remains a challenge. To this end, we propose the evolvable embodied agent (EEAgent) framework, which leverages large vision-language models (VLMs) for better environmental interpretation and policy planning. To enhance reflection on past experiences, we propose a long short-term reflective optimization (LSTRO) mechanism that dynamically refines prompts based on both past experiences and newly learned lessons, facilitating continuous self-evolution, thereby enhancing overall task success rates. Evaluations on six VIMA-Bench tasks reveal that our approach sets a new state-of-the-art, notably outperforming baselines in complex scenarios.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers