ArXiv TLDR

Sim-to-Real Transfer and Robustness Evaluation of Reinforcement Learning Control with Integrated Perception on an ASV for Floating Waste Capture

🐦 Tweet
2605.02529

Luis F. W. Batista, Stéphanie Aravecchia, Cédric Pradalier

cs.RO

TLDR

This paper presents a field-validated ASV system for floating waste capture, using sim-to-real DRL control and camera-based perception.

Key contributions

  • Developed a field-validated ASV system for floating waste capture using DRL control and polarimetric perception.
  • Introduced a sim-to-real testing methodology with a two-stage simulation and perception abstraction.
  • Evaluated robustness across 14 disturbance regimes, achieving centimeter-level accuracy in field trials.

Why it matters

This paper provides a robust, field-validated solution for autonomous floating waste capture, addressing key challenges in ASV operation. Its sim-to-real methodology and practical lessons are crucial for reliable deployment of DRL in real-world robotic systems.

Original Abstract

Autonomous surface vessels for floating-waste removal operate under varying hydrodynamics, external disturbances, and challenging water-surface perception. We present a field-validated system that combines camera-based polarimetric perception with a lightweight DRL-based controller for floating-waste detection and capture. Camera detections are converted into water-surface target points and tracked by a controller trained entirely in simulation and deployed directly on a retrofitted ASV platform. Our main contribution is a sim-to-real testing methodology that combines a two-stage simulation protocol with a perception abstraction module designed to mimic real camera behavior, enabling reproducible field trials and explicit evaluation of the sim-to-real gap. We apply this framework in matched simulation and field experiments across 14 disturbance regimes to expose failure modes and evaluate robustness. The results show centimeter-level terminal accuracy and indicate robust control performance under the evaluated perturbation regimes. The main source of degradation is insufficient actuation-model fidelity. We also demonstrate the system in a search-and-capture application using real camera detections in real-world conditions over areas of up to $450~m^2$. The study distills practical lessons for reliable transfer, including improved actuation-model fidelity, targeted domain randomization, and careful management of latency and timestamps across modules, while highlighting remaining challenges.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.