Sim-to-Real Transfer and Robustness Evaluation of Reinforcement Learning Control with Integrated Perception on an ASV for Floating Waste Capture

May 4, 20262605.02529

Luis F. W. Batista, Stéphanie Aravecchia, Cédric Pradalier

cs.RO

TLDR

This paper presents a field-validated ASV system for floating waste capture, using sim-to-real DRL control and camera-based perception.

Key contributions

Developed a field-validated ASV system for floating waste capture using DRL control and polarimetric perception.
Introduced a sim-to-real testing methodology with a two-stage simulation and perception abstraction.
Evaluated robustness across 14 disturbance regimes, achieving centimeter-level accuracy in field trials.

Why it matters

This paper provides a robust, field-validated solution for autonomous floating waste capture, addressing key challenges in ASV operation. Its sim-to-real methodology and practical lessons are crucial for reliable deployment of DRL in real-world robotic systems.

Original Abstract

Autonomous surface vessels for floating-waste removal operate under varying hydrodynamics, external disturbances, and challenging water-surface perception. We present a field-validated system that combines camera-based polarimetric perception with a lightweight DRL-based controller for floating-waste detection and capture. Camera detections are converted into water-surface target points and tracked by a controller trained entirely in simulation and deployed directly on a retrofitted ASV platform. Our main contribution is a sim-to-real testing methodology that combines a two-stage simulation protocol with a perception abstraction module designed to mimic real camera behavior, enabling reproducible field trials and explicit evaluation of the sim-to-real gap. We apply this framework in matched simulation and field experiments across 14 disturbance regimes to expose failure modes and evaluate robustness. The results show centimeter-level terminal accuracy and indicate robust control performance under the evaluated perturbation regimes. The main source of degradation is insufficient actuation-model fidelity. We also demonstrate the system in a search-and-capture application using real camera detections in real-world conditions over areas of up to $450~m^2$. The study distills practical lessons for reliable transfer, including improved actuation-model fidelity, targeted domain randomization, and careful management of latency and timestamps across modules, while highlighting remaining challenges.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers