Flying by Inference: Active Inference World Models for Adaptive UAV Swarms

April 30, 20262604.27935

Kaleem Arshid, Ali Krayani, Lucio Marcenaro, David Martin Gomez, Carlo Regazzoni

cs.ROeess.SPeess.SY

TLDR

This paper introduces an active-inference framework for adaptive UAV swarm trajectory planning, learning from expert demonstrations for robust online operation.

Key contributions

Proposes an active-inference framework for adaptive UAV swarm trajectory planning, converting optimization to probabilistic inference.
Learns a hierarchical probabilistic world model from expert demonstrations (Mission, Route, Motion dictionaries).
Enables online mission allocation, route insertion, motion adaptation, and collision-aware replanning without re-optimization.
Integrates Bayesian state estimators (EKF, PF) for improved trajectory correction under real-world uncertainty.

Why it matters

This framework offers a novel way to manage complex UAV swarm behaviors by learning from experts and adapting online. It significantly improves robustness and efficiency by avoiding costly re-optimization in dynamic environments.

Original Abstract

This paper presents an expert-guided active-inference-inspired framework for adaptive UAV swarm trajectory planning. The proposed method converts multi-UAV trajectory design from a repeated combinatorial optimization problem into a hierarchical probabilistic inference problem. In the offline phase, a genetic-algorithm planner with repulsive-force collision avoidance (GA--RF) generates expert demonstrations, which are abstracted into Mission, Route, and Motion dictionaries. These dictionaries are used to learn a probabilistic world model that captures how expert mission allocations induce route orders and how route orders induce motion-level behaviors. During online operation, the UAV swarm evaluates candidate actions by forming posterior beliefs over symbolic states and minimizing KL-divergence-based abnormality indicators with respect to expert-derived reference distributions. This enables mission allocation, route insertion, motion adaptation, and collision-aware replanning without rerunning the offline optimizer. Bayesian state estimators, including EKF and PF modules, are integrated at the motion level to improve trajectory correction under uncertainty. Simulation results show that the proposed framework preserves expert-like planning structure while producing smoother and more stable behavior than modified Q-learning. Additional validation using real-flight UAV trajectory data demonstrates that the learned world model can correct symbolic predictions under noisy and non-smooth observations, supporting its applicability to adaptive UAV swarm autonomy.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers