Rare Event Analysis via Stochastic Optimal Control

April 14, 20262604.13213

Yuanqi Du, Jiajun He, Dinghuai Zhang, Eric Vanden-Eijnden, Carles Domingo-Enrich

stat.MLcs.LGmath.OCphysics.chem-ph

TLDR

A stochastic optimal control framework is proposed for rare event analysis, accurately estimating committor functions and improving kinetic predictions.

Key contributions

Casts committor estimation as a stochastic optimal control (SOC) problem for rare event analysis.
Develops direct backpropagation and off-policy Value Matching losses with optimality guarantees.
Introduces an alternative sampling process to overcome metastability in controlled trajectories.
Achieves significantly more accurate committor estimates, reaction rates, and equilibrium constants.

Why it matters

Rare events are critical to many physical systems but are computationally challenging. This framework offers a novel and more accurate approach to analyze them, enabling better understanding of complex processes like biomolecular changes and phase transitions.

Original Abstract

Rare events such as conformational changes in biomolecules, phase transitions, and chemical reactions are central to the behavior of many physical systems, yet they are extremely difficult to study computationally because unbiased simulations seldom produce them. Transition Path Theory (TPT) provides a rigorous statistical framework for analyzing such events: it characterizes the ensemble of reactive trajectories between two designated metastable states (reactant and product), and its central object--the committor function, which gives the probability that the system will next reach the product rather than the reactant--encodes all essential kinetic and thermodynamic information. We introduce a framework that casts committor estimation as a stochastic optimal control (SOC) problem. In this formulation the committor defines a feedback control--proportional to the gradient of its logarithm--that actively steers trajectories toward the reactive region, thereby enabling efficient sampling of reactive paths. To solve the resulting hitting-time control problem we develop two complementary objectives: a direct backpropagation loss and a principled off-policy Value Matching loss, for which we establish first-order optimality guarantees. We further address metastability, which can trap controlled trajectories in intermediate basins, by introducing an alternative sampling process that preserves the reactive current while lowering effective energy barriers. On benchmark systems, the framework yields markedly more accurate committor estimates, reaction rates, and equilibrium constants than existing methods.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers