Non-Equilibrium Stochastic Dynamics as a Unified Framework for Insight and Repetitive Learning: A Kramers Escape Approach to Continual Learning

April 5, 20262604.04154

cond-mat.stat-mechcond-mat.dis-nncs.LGq-bio.NC

TLDR

This paper uses non-equilibrium stochastic dynamics and Kramers escape to model continual learning, explaining plasticity collapse and unifying insight with repetitive learning.

Key contributions

Models learning as Langevin dynamics on a double-well energy landscape.
Identifies EWC penalty as an energy barrier, explaining exponential plasticity collapse.
Unifies insight (transient T spikes) and repetitive learning (fixed elevated T) via distinct temperature protocols.
Offers a physically grounded framework for continual learning and adaptive noise schedule design.

Why it matters

This paper offers a novel, physically grounded framework for understanding the stability-plasticity dilemma in continual learning. It explains the collapse of plasticity in methods like EWC and unifies insight with repetitive learning, suggesting principled design criteria for adaptive noise schedules in AI.

Original Abstract

Continual learning in artificial neural networks is fundamentally limited by the stability--plasticity dilemma: systems that retain prior knowledge tend to resist acquiring new knowledge, and vice versa. Existing approaches, most notably elastic weight consolidation~(EWC), address this empirically without a physical account of why plasticity eventually collapses as tasks accumulate. Separately, the distinction between sudden insight and gradual skill acquisition through repetitive practice has lacked a unified theoretical description. Here, we show that both problems admit a common resolution within non-equilibrium statistical physics. We model the state of a learning system as a particle evolving under Langevin dynamics on a double-well energy landscape, with the noise amplitude governed by a time-dependent effective temperature $T(t)$. The probability density obeys a Fokker--Planck equation, and transitions between metastable states are governed by the Kramers escape rate $k = (ω_0ω_b/2π)\,e^{-ΔE/T}$. We make two contributions. First, we identify the EWC penalty term as an energy barrier whose height grows linearly with the number of accumulated tasks, yielding an exponential collapse of the transition rate predicted analytically and confirmed numerically. Second, we show that insight and repetitive learning correspond to two qualitatively distinct temperature protocols within the same Fokker--Planck equation: insight events produce transient spikes in $T(t)$ that drive rapid barrier crossing, whereas repetitive practice operates at a modestly elevated but fixed temperature, achieving transitions through sustained stochastic diffusion. These results establish a physically grounded framework for understanding plasticity and its failure in continual learning systems, and suggest principled design criteria for adaptive noise schedules in artificial intelligence.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers