EmbodiedClaw: Conversational Workflow Execution for Embodied AI Development
Xueyang Zhou, Yihan Sun, Xijie Gong, Guiyao Tie, Pan Zhou + 2 more
TLDR
EmbodiedClaw is a conversational agent that automates complex embodied AI development workflows, reducing engineering overhead and improving reproducibility.
Key contributions
- Proposes a new paradigm for conversational workflow execution in embodied AI development.
- EmbodiedClaw automates high-cost activities like environment creation, trajectory synthesis, and model evaluation.
- Reduces manual engineering effort while improving executability, consistency, and reproducibility.
- Supports complex multi-task, multi-scene, and multi-model embodied AI settings.
Why it matters
The paper addresses the growing engineering complexity in embodied AI development. By introducing a conversational agent, it streamlines workflows, significantly cutting down development time and manual effort. This could shift how researchers approach and manage complex embodied AI projects.
Original Abstract
Embodied AI research is increasingly moving beyond single-task, single-environment policy learning toward multi-task, multi-scene, and multi-model settings. This shift substantially increases the engineering overhead and development time required for stages such as evaluation environment construction, trajectory collection, model training, and evaluation. To address this challenge, we propose a new paradigm for embodied AI development in which users express goals and constraints through conversation, and the system automatically plans and executes the development workflow. We instantiate this paradigm with EmbodiedClaw, a conversational agent that turns high-frequency, high-cost embodied research activities, including environment creation and revision, benchmark transformation, trajectory synthesis, model evaluation, and asset expansion, into executable skills. Experiments on end-to-end workflow tasks, capability-specific evaluations, human researcher studies, and ablations show that EmbodiedClaw reduces manual engineering effort while improving executability, consistency, and reproducibility. These results suggest a shift from manual toolchains to conversationally executable workflows for embodied AI development.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.