Safety-Critical Contextual Control via Online Riemannian Optimization with World Models

April 21, 20262604.19639

eess.SYcs.AI

TLDR

This paper introduces a safety-critical contextual control framework using online Riemannian optimization and world models, achieving robust performance.

Key contributions

Introduces Penalized Predictive Control (PPC) for safety-critical contextual control with black-box simulators.
Uses a score-based density from the simulator to define a Riemannian geometry guiding action space optimization.
Barrier curvature $κ(ξ_t)$ governs convergence rate and safety margin, replacing unknown dynamics' Lipschitz constant.
Demonstrates superior performance in dynamic navigation tasks, especially after environment shifts.

Why it matters

This paper addresses safety-critical control with complex world models by introducing a novel sample-based Riemannian optimization framework. It ensures safety and performance using black-box simulators, offering a robust and adaptable solution for dynamic environments.

Original Abstract

Modern world models are becoming too complex to admit explicit dynamical descriptions. We study safety-critical contextual control, where a Planner must optimize a task objective using only feasibility samples from a black-box Simulator, conditioned on a context signal $ξ_t$. We develop a sample-based Penalized Predictive Control (PPC) framework grounded in online Riemannian optimization, in which the Simulator compresses the feasibility manifold into a score-based density $\hat{p}(u \mid ξ_t)$ that endows the action space with a Riemannian geometry guiding the Planner's gradient descent. The barrier curvature $κ(ξ_t)$, the minimum curvature of the conditional log-density $-\ln\hat{p}(\cdot\midξ_t)$, governs both convergence rate and safety margin, replacing the Lipschitz constant of the unknown dynamics. Our main result is a contextual safety bound showing that the distance from the true feasibility manifold is controlled by the score estimation error and a ratio that depends on $κ(ξ_t)$, both of which improve with richer context. Simulations on a dynamic navigation task confirm that contextual PPC substantially outperforms marginal and frozen density models, with the advantage growing after environment shifts.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers