Learning Reactive Dexterous Grasping via Hierarchical Task-Space RL Planning and Joint-Space QP Control
Ho Jae Lee, Yonghyeon Lee, Alexander Alexiev, Tzu-Yuan Lin, Se Hwan Jeon + 1 more
TLDR
A hierarchical RL framework combines task-space planning and joint-space QP control for reactive, dexterous grasping with zero-shot steerability.
Key contributions
- Proposes a hierarchical control framework for reactive dexterous grasping.
- Uses multi-agent RL for high-level task-space planning commands.
- GPU-parallelized QP controller ensures safe joint execution and collision avoidance.
- Achieves zero-shot steerability and robust sim-to-real transfer on unseen objects.
Why it matters
This framework improves robotic grasping by separating planning and control, leading to faster training and safer hardware operation. Its zero-shot steerability allows dynamic adjustments without retraining, making it highly adaptable for real-world, unstructured environments.
Original Abstract
In this work, we propose a hybrid hierarchical control framework for reactive dexterous grasping that explicitly decouples high-level spatial intent from low-level joint execution. We introduce a multi-agent reinforcement learning architecture, specialized into distinct arm and hand agents, that acts as a high-level planner by generating desired task-space velocity commands. These commands are then processed by a GPU-parallelized quadratic programming controller, which translates them into feasible joint velocities while strictly enforcing kinematic limits and collision avoidance. This structural isolation not only accelerates training convergence but also strictly enforces hardware safety. Furthermore, the architecture unlocks zero-shot steerability, allowing system operators to dynamically adjust safety margins and avoid dynamic obstacles without retraining the policy. We extensively validate the proposed framework through a rigorous simulation-to-reality pipeline. Real-world hardware experiments on a 7-DoF arm equipped with a 20-DoF anthropomorphic hand demonstrate highly robust zero-shot transferability for dexterous grasping to a diverse set of unseen objects, highlighting the system's ability to reactively recover from unexpected physical disturbances in unstructured environments.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.