Differentiable Environment-Trajectory Co-Optimization for Safe Multi-Agent Navigation
Zhan Gao, Gabriele Fadini, Stelian Coros, Amanda Prorok
TLDR
This paper introduces a differentiable bi-level optimization framework for co-optimizing environment configurations and multi-agent trajectories to enhance navigation safety.
Key contributions
- Co-optimizes environment configurations and multi-agent trajectories for safety.
- Formulates a bi-level problem: environment (safety) and trajectories (cost).
- Uses differentiable optimization with KKT and Implicit Function Theorem.
- Proposes a novel, measure-theory-validated metric for navigation safety.
Why it matters
This paper addresses a key limitation in multi-agent navigation by optimizing environments alongside agent trajectories. This novel bi-level approach significantly enhances safety and efficiency, crucial for real-world applications like autonomous logistics and urban transport.
Original Abstract
The environment plays a critical role in multi-agent navigation by imposing spatial constraints, rules, and limitations that agents must navigate around. Traditional approaches treat the environment as fixed, without exploring its impact on agents' performance. This work considers environment configurations as decision variables, alongside agent actions, to jointly achieve safe navigation. We formulate a bi-level problem, where the lower-level sub-problem optimizes agent trajectories that minimize navigation cost and the upper-level sub-problem optimizes environment configurations that maximize navigation safety. We develop a differentiable optimization method that iteratively solves the lower-level sub-problem with interior point methods and the upper-level sub-problem with gradient ascent. A key challenge lies in analytically coupling these two levels. We address this by leveraging KKT conditions and the Implicit Function Theorem to compute gradients of agent trajectories w.r.t. environment parameters, enabling differentiation throughout the bi-level structure. Moreover, we propose a novel metric that quantifies navigation safety as a criterion for the upper-level environment optimization, and prove its validity through measure theory. Our experiments validate the effectiveness of the proposed framework in a variety of safety-critical navigation scenarios, inspired from warehouse logistics to urban transportation. The results demonstrate that optimized environments provide navigation guidance, improving both agents' safety and efficiency.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.