Continual Hand-Eye Calibration for Open-world Robotic Manipulation

April 17, 20262604.15814

Fazeng Li, Gan Sun, Chenxi Liu, Yao He, Wei Cong + 1 more

cs.CVcs.RO

TLDR

This framework allows robots to continually adapt hand-eye calibration to new open-world scenes, preventing catastrophic forgetting of past knowledge.

Key contributions

Introduces a continual hand-eye calibration framework for open-world robotic manipulation.
Proposes Spatial-Aware Replay Strategy (SARS) for geometrically uniform and informative replay buffer sampling.
Develops Structure-Preserving Dual Distillation (SPDD) to separately distill coarse scene layout and fine pose precision.
Achieves significant anti-scene forgetting, maintaining accuracy on past scenes while adapting to new ones.

Why it matters

Robotic hand-eye calibration suffers from catastrophic forgetting in dynamic environments. This paper proposes a continual learning framework, enabling robots to adapt to new scenes without losing past knowledge, crucial for real-world use.

Original Abstract

Hand-eye calibration through visual localization is a critical capability for robotic manipulation in open-world environments. However, most deep learning-based calibration models suffer from catastrophic forgetting when adapting into unseen data amongst open-world scene changes, while simple rehearsal-based continual learning strategy cannot well mitigate this issue. To overcome this challenge, we propose a continual hand-eye calibration framework, enabling robots to adapt to sequentially encountered open-world manipulation scenes through spatially replay strategy and structure-preserving distillation. Specifically, a Spatial-Aware Replay Strategy (SARS) constructs a geometrically uniform replay buffer that ensures comprehensive coverage of each scene pose space, replacing redundant adjacent frames with maximally informative viewpoints. Meanwhile, a Structure-Preserving Dual Distillation (SPDD) is proposed to decompose localization knowledge into coarse scene layout and fine pose precision, and distills them separately to alleviate both types of forgetting during continual adaptation. As a new manipulation scene arrives, SARS provides geometrically representative replay samples from all prior scenes, and SPDD applies structured distillation on these samples to retain previously learned knowledge. After training on the new scene, SARS incorporates selected samples from the new scene into the replay buffer for future rehearsal, allowing the model to continuously accumulate multi-scene calibration capability. Experiments on multiple public datasets show significant anti scene forgetting performance, maintaining accuracy on past scenes while preserving adaptation to new scenes, confirming the effectiveness of the framework.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers