Continual Hand-Eye Calibration for Open-world Robotic Manipulation
Fazeng Li, Gan Sun, Chenxi Liu, Yao He, Wei Cong + 1 more
TLDR
This framework allows robots to continually adapt hand-eye calibration to new open-world scenes, preventing catastrophic forgetting of past knowledge.
Key contributions
- Introduces a continual hand-eye calibration framework for open-world robotic manipulation.
- Proposes Spatial-Aware Replay Strategy (SARS) for geometrically uniform and informative replay buffer sampling.
- Develops Structure-Preserving Dual Distillation (SPDD) to separately distill coarse scene layout and fine pose precision.
- Achieves significant anti-scene forgetting, maintaining accuracy on past scenes while adapting to new ones.
Why it matters
Robotic hand-eye calibration suffers from catastrophic forgetting in dynamic environments. This paper proposes a continual learning framework, enabling robots to adapt to new scenes without losing past knowledge, crucial for real-world use.
Original Abstract
Hand-eye calibration through visual localization is a critical capability for robotic manipulation in open-world environments. However, most deep learning-based calibration models suffer from catastrophic forgetting when adapting into unseen data amongst open-world scene changes, while simple rehearsal-based continual learning strategy cannot well mitigate this issue. To overcome this challenge, we propose a continual hand-eye calibration framework, enabling robots to adapt to sequentially encountered open-world manipulation scenes through spatially replay strategy and structure-preserving distillation. Specifically, a Spatial-Aware Replay Strategy (SARS) constructs a geometrically uniform replay buffer that ensures comprehensive coverage of each scene pose space, replacing redundant adjacent frames with maximally informative viewpoints. Meanwhile, a Structure-Preserving Dual Distillation (SPDD) is proposed to decompose localization knowledge into coarse scene layout and fine pose precision, and distills them separately to alleviate both types of forgetting during continual adaptation. As a new manipulation scene arrives, SARS provides geometrically representative replay samples from all prior scenes, and SPDD applies structured distillation on these samples to retain previously learned knowledge. After training on the new scene, SARS incorporates selected samples from the new scene into the replay buffer for future rehearsal, allowing the model to continuously accumulate multi-scene calibration capability. Experiments on multiple public datasets show significant anti scene forgetting performance, maintaining accuracy on past scenes while preserving adaptation to new scenes, confirming the effectiveness of the framework.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.