Robots that learn to evaluate models of collective behavior
Mathis Hocke, Andreas Gerken, David Bierbach, Jens Krause, Tim Landgraf
TLDR
This paper introduces a robotic fish framework using RL to evaluate animal behavior models through closed-loop interaction, showing CNN models are more accurate.
Key contributions
- Introduces an RL-based framework using a robotic fish (RoboFish) for closed-loop evaluation of animal behavior models.
- Trains policies in simulation with various fish models (rule-based, CNN) and transfers them to real RoboFish.
- Quantifies "sim-to-real gaps" using Wasserstein distance on behavioral metrics like goal-reaching and distances.
- Demonstrates that a CNN-based model achieved higher fidelity, outperforming traditional rule-based models.
Why it matters
This work provides a novel, quantitative method for evaluating animal behavior models via embodied robotic interaction. It moves beyond static comparisons, enabling researchers to identify model deficiencies and improve accuracy, advancing our understanding of collective behavior.
Original Abstract
Understanding and modeling animal behavior is essential for studying collective motion, decision-making, and bio-inspired robotics. Yet, evaluating the accuracy of behavioral models still often relies on offline comparisons to static trajectory statistics. Here we introduce a reinforcement-learning-based framework that uses a biomimetic robotic fish (RoboFish) to evaluate computational models of live fish behavior through closed-loop interaction. We trained policies in simulation using four distinct fish models-a simple constant-follow baseline, two rule-based models, and a biologically grounded convolutional neural network model-and transferred these policies to the real RoboFish setup, where they interacted with live fish. Policies were trained to guide a simulated fish to goal locations, enabling us to quantify how the response of real fish differs from the simulated fish's response. We evaluate the fish models by quantifying the sim-to-real gaps, defined as the Wasserstein distance between simulated and real distributions of behavioral metrics such as goal-reaching performance, inter-individual distances, wall interactions, and alignment. The neural network-based fish model exhibited the smallest gap across goal-reaching performance and most other metrics, indicating higher behavioral fidelity than conventional rule-based models under this benchmark. More importantly, this separation shows that the proposed evaluation can quantitatively distinguish candidate models under matched closed-loop conditions. Our work demonstrates how learning-based robotic experiments can uncover deficiencies in behavioral models and provides a general framework for evaluating animal behavior models through embodied interaction.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.