Evolvable Embodied Agent for Robotic Manipulation via Long Short-Term Reflection and Optimization
Jianzong Wang, Botao Zhao, Yayun He, Junqing Peng, Xulong Zhang
TLDR
EEAgent uses VLMs and Long Short-Term Reflective Optimization (LSTRO) for self-evolving robotic manipulation, achieving SOTA on VIMA-Bench.
Key contributions
- Proposes EEAgent, an evolvable embodied agent framework for robotic manipulation.
- Leverages large vision-language models (VLMs) for better environmental interpretation and policy planning.
- Introduces Long Short-Term Reflective Optimization (LSTRO) for dynamic prompt refinement.
- Achieves state-of-the-art performance on six complex VIMA-Bench robotic manipulation tasks.
Why it matters
General-purpose robotics requires agents that adapt and evolve. This paper tackles limitations of traditional methods by enabling continuous self-evolution through dynamic reflection. It offers a promising path towards more adaptable and interpretable robotic systems.
Original Abstract
Achieving general-purpose robotics requires empowering robots to adapt and evolve based on their environment and feedback. Traditional methods face limitations such as extensive training requirements, difficulties in cross-task generalization, and lack of interpretability. Prompt learning offers new opportunities for self-evolving robots without extensive training, but simply reflecting on past experiences.However, extracting meaningful insights from task successes and failures remains a challenge. To this end, we propose the evolvable embodied agent (EEAgent) framework, which leverages large vision-language models (VLMs) for better environmental interpretation and policy planning. To enhance reflection on past experiences, we propose a long short-term reflective optimization (LSTRO) mechanism that dynamically refines prompts based on both past experiences and newly learned lessons, facilitating continuous self-evolution, thereby enhancing overall task success rates. Evaluations on six VIMA-Bench tasks reveal that our approach sets a new state-of-the-art, notably outperforming baselines in complex scenarios.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.