ArXiv TLDR

Differentiable Environment-Trajectory Co-Optimization for Safe Multi-Agent Navigation

🐦 Tweet
2604.06972

Zhan Gao, Gabriele Fadini, Stelian Coros, Amanda Prorok

cs.ROcs.MA

TLDR

This paper introduces a differentiable bi-level optimization framework for co-optimizing environment configurations and multi-agent trajectories to enhance navigation safety.

Key contributions

  • Co-optimizes environment configurations and multi-agent trajectories for safety.
  • Formulates a bi-level problem: environment (safety) and trajectories (cost).
  • Uses differentiable optimization with KKT and Implicit Function Theorem.
  • Proposes a novel, measure-theory-validated metric for navigation safety.

Why it matters

This paper addresses a key limitation in multi-agent navigation by optimizing environments alongside agent trajectories. This novel bi-level approach significantly enhances safety and efficiency, crucial for real-world applications like autonomous logistics and urban transport.

Original Abstract

The environment plays a critical role in multi-agent navigation by imposing spatial constraints, rules, and limitations that agents must navigate around. Traditional approaches treat the environment as fixed, without exploring its impact on agents' performance. This work considers environment configurations as decision variables, alongside agent actions, to jointly achieve safe navigation. We formulate a bi-level problem, where the lower-level sub-problem optimizes agent trajectories that minimize navigation cost and the upper-level sub-problem optimizes environment configurations that maximize navigation safety. We develop a differentiable optimization method that iteratively solves the lower-level sub-problem with interior point methods and the upper-level sub-problem with gradient ascent. A key challenge lies in analytically coupling these two levels. We address this by leveraging KKT conditions and the Implicit Function Theorem to compute gradients of agent trajectories w.r.t. environment parameters, enabling differentiation throughout the bi-level structure. Moreover, we propose a novel metric that quantifies navigation safety as a criterion for the upper-level environment optimization, and prove its validity through measure theory. Our experiments validate the effectiveness of the proposed framework in a variety of safety-critical navigation scenarios, inspired from warehouse logistics to urban transportation. The results demonstrate that optimized environments provide navigation guidance, improving both agents' safety and efficiency.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.