Information bottleneck for learning the phase space of dynamics from high-dimensional experimental data
K. Michael Martini, Eslam Abdelaleem, Paarth Gulati, Ilya Nemenman
TLDR
DySIB learns low-dimensional, interpretable dynamical state variables from high-dimensional experimental data by maximizing predictive information.
Key contributions
- Introduces DySIB, a Dynamical Symmetric Information Bottleneck for learning state variables.
- Maximizes predictive mutual information in latent space, avoiding high-dimensional data reconstruction.
- Successfully recovers the known 2D phase space of an experimental pendulum video.
- Learned coordinates align smoothly with canonical angle and angular velocity.
Why it matters
This paper addresses the critical challenge of extracting meaningful dynamical states from complex, high-dimensional observations without supervision. DySIB offers a novel, robust approach that directly learns interpretable coordinates. This could significantly advance understanding and modeling across physical sciences.
Original Abstract
Identifying the dynamical state variables of a system from high-dimensional observations is a central problem across physical sciences. The challenge is that the state variables are not directly observable and must be inferred from raw high-dimensional data without supervision. Here we introduce DySIB (Dynamical Symmetric Information Bottleneck) as a method to learn low-dimensional representations of time-series data by maximizing predictive mutual information between past and future observation windows while penalizing representation complexity. This objective operates entirely in latent space and avoids reconstruction of the observations. We apply DySIB to an experimental video dataset of a physical pendulum, where the underlying state space is known. The method, with hyperparameters of the learning architecture set self-consistently by the data, recovers a two-dimensional representation that matches the dimensionality, topology, and geometry of the pendulum phase space, with the learned coordinates aligning smoothly with the canonical angle and angular velocity. These results demonstrate, on a well-characterized experimental system, that predictive information in latent space can be used to recover interpretable dynamical coordinates directly from high-dimensional data.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.