A Predictive View on Streaming Hidden Markov Models
TLDR
This paper introduces a predictive-first framework for streaming Hidden Markov Models, deriving a beam search algorithm for efficient online regime identification.
Key contributions
- Introduces a predictive-first optimization framework for streaming HMMs, focusing on accurate step-ahead predictions.
- Learns regime-specific predictive models online, using a fixed transition prior instead of full posterior recovery.
- Derives a principled beam search algorithm for HMMs by approximating the posterior predictive with top-S paths.
- Offers a recursive, deterministic algorithm with closed-form updates, avoiding EM or sampling.
Why it matters
Classical HMMs struggle with real-time streaming data due to computational demands for full posterior recovery. This paper offers a novel, efficient approach by prioritizing prediction. Its beam search derivation provides a practical, deterministic algorithm for online regime identification, making HMMs more applicable in dynamic environments.
Original Abstract
We develop a predictive-first optimisation framework for streaming hidden Markov models. Unlike classical approaches that prioritise full posterior recovery under a fully specified generative model, we assume access to regime-specific predictive models whose parameters are learned online while maintaining a fixed transition prior over regimes. Our objective is to sequentially identify latent regimes while maintaining accurate step-ahead predictive distributions. Because the number of possible regime paths grows exponentially, exact filtering is infeasible. We therefore formulate streaming inference as a constrained projection problem in predictive-distribution space: under a fixed hypothesis budget, we approximate the full posterior predictive by the forward-KL optimal mixture supported on $S$ paths. The solution is the renormalised top-$S$ posterior-weighted mixture, providing a principled derivation of beam search for HMMs. The resulting algorithm is fully recursive and deterministic, performing beam-style truncation with closed-form predictive updates and requiring neither EM nor sampling. Empirical comparisons against Online EM and Sequential Monte Carlo under matched computational budgets demonstrate competitive prequential performance.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.