Bidirectional Cross-Modal Prompting for Event-Frame Asymmetric Stereo
Ninghui Xu, Fabio Tosi, Lihui Wang, Jiawei Han, Luca Bartolomei + 3 more
TLDR
A new framework, Bi-CMPStereo, uses bidirectional cross-modal prompting to enhance event-frame asymmetric stereo for robust 3D perception.
Key contributions
- Proposes Bi-CMPStereo, a novel bidirectional cross-modal prompting framework.
- Learns finely aligned stereo representations within a shared canonical space.
- Integrates complementary features by projecting each modality into both event and frame domains.
Why it matters
This paper tackles the challenge of combining event and frame camera data for robust 3D perception in dynamic scenes. Bi-CMPStereo bridges the modality gap, enabling more reliable and accurate 3D reconstruction. This is vital for applications needing robust perception under fast motion and difficult lighting.
Original Abstract
Conventional frame-based cameras capture rich contextual information but suffer from limited temporal resolution and motion blur in dynamic scenes. Event cameras offer an alternative visual representation with higher dynamic range free from such limitations. The complementary characteristics of the two modalities make event-frame asymmetric stereo promising for reliable 3D perception under fast motion and challenging illumination. However, the modality gap often leads to marginalization of domain-specific cues essential for cross-modal stereo matching. In this paper, we introduce Bi-CMPStereo, a novel bidirectional cross-modal prompting framework that fully exploits semantic and structural features from both domains for robust matching. Our approach learns finely aligned stereo representations within a target canonical space and integrates complementary representations by projecting each modality into both event and frame domains. Extensive experiments demonstrate that our approach significantly outperforms state-of-the-art methods in accuracy and generalization.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.