ArXiv TLDR

CAVERS: Multimodal SLAM Data from a Natural Karstic Cave with Ground Truth Motion Capture

🐦 Tweet
2604.15052

Giacomo Franchini, David Rodríguez-Martínez, Alfonso Martínez-Petersen, C. J. Pérez-del-Pulgar, Marcello Chiaberge

cs.RO

TLDR

CAVERS is a new multimodal dataset for SLAM in challenging natural karstic caves, featuring diverse sensors and mm-accurate ground truth.

Key contributions

  • Multimodal data from RGB-D-I, thermal, and LiDAR sensors in natural karstic caves.
  • Features mm-accurate 6-DoF ground truth pose and velocity from Optirack MoCap.
  • Includes 24 sequences (335 GB) from two distinct cave rooms, handheld and rover.
  • Benchmarks seven state-of-the-art SLAM and odometry algorithms for usability.

Why it matters

Autonomous robots in natural caves face unique challenges like irregular geometry and near-zero light. This dataset addresses the scarcity of relevant data, providing crucial resources for developing robust SLAM and navigation systems. It will advance robotic exploration in extreme, unstructured environments.

Original Abstract

Autonomous robots operating in natural karstic caves face perception and navigation challenges that are qualitatively distinct from those encountered in mines or tunnels: irregular geometry, reflective wet surfaces, near-zero ambient light, and complex branching passages. Yet publicly available datasets targeting this environment remain scarce and offer limited sensing modalities and environmental diversity. We present CAVERS, a multimodal dataset acquired in two structurally distinct rooms of Cueva de la Victoria, Málaga, Spain, comprising 24 sequences totaling approximately 335 GB of recorded data. The sensor suite combines an Intel RealSense D435i RGB-D-I camera, an Optris PI640i near-IR thermal camera, and a Velodyne VLP-16 LiDAR, operated both handheld and mounted on a wheeled rover under full darkness and artificial illumination. For most of the sequences, mm-accurate 6-DoF ground truth pose and velocity at 120 Hz are provided by an Optirack motion capture system installed directly inside the cave. We benchmark seven state-of-the-art SLAM and odometry algorithms spanning visual, visual-inertial, thermal-inertial, and LiDAR-based pipelines, as well as a 3D reconstruction pipeline, demonstrating the dataset's usability. %The dataset and all supplementary material are publicly available at: https://github.com/spaceuma/cavers.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.