Conditional Score-Based Modeling of Effective Langevin Dynamics
TLDR
A new data-driven method calibrates stochastic reduced-order models by linking conditional scores to finite-lag statistics, enhancing accuracy and scalability.
Key contributions
- Introduces a novel data-driven calibration method for stochastic reduced-order models.
- Leverages a new relationship between model coefficients and the conditional score of finite-time transition density.
- Constrains drift and diffusion directly from finite-lag statistics, avoiding common computational pitfalls.
- Formulates calibration as a least-squares problem, improving scalability and accuracy for complex systems.
Why it matters
This paper significantly advances calibrating stochastic reduced-order models, crucial for complex systems. It offers a scalable, accurate method, overcoming limitations of existing approaches for high-dimensional or unevenly sampled data. This enables more reliable and efficient modeling in various scientific and engineering fields.
Original Abstract
Stochastic reduced-order models are widely used to represent the effective dynamics of complex systems, but estimating their drift and diffusion coefficients from data remains challenging. Standard approaches often rely on short-time trajectory increments, state-space partitioning, or repeated simulation of candidate models, which become unreliable or computationally expensive for high-dimensional systems, coarse temporal sampling, or unevenly sampled data. We introduce a data-driven calibration method based on a novel relationship between the coefficients of a stochastic reduced model and the conditional score of the finite-time transition density, defined as the gradient of the logarithm of the transition density with respect to the initial state. The resulting identity expresses derivatives of lagged correlation functions as stationary expectations over observed lagged pairs involving this conditional score and the unknown model coefficients. This formulation allows the drift and diffusion structure to be constrained directly from finite-lag statistics, without differentiating trajectories, partitioning state space, or repeatedly integrating candidate reduced models during calibration, yielding a least-squares fitting problem over stationary lagged pairs. We validate the approach on analytically tractable and data-driven nonequilibrium diffusions, demonstrating that the inferred models preserve the invariant statistics while accurately reproducing finite-lag dynamical correlations. The framework provides a scalable route for learning stochastic reduced-order models from data that reproduce prescribed statistical and dynamical properties.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.