ArXiv TLDR

A Predictive View on Streaming Hidden Markov Models

🐦 Tweet
2604.09208

Gerardo Duran-Martin

stat.MLcs.LG

TLDR

This paper introduces a predictive-first framework for streaming Hidden Markov Models, deriving a beam search algorithm for efficient online regime identification.

Key contributions

  • Introduces a predictive-first optimization framework for streaming HMMs, focusing on accurate step-ahead predictions.
  • Learns regime-specific predictive models online, using a fixed transition prior instead of full posterior recovery.
  • Derives a principled beam search algorithm for HMMs by approximating the posterior predictive with top-S paths.
  • Offers a recursive, deterministic algorithm with closed-form updates, avoiding EM or sampling.

Why it matters

Classical HMMs struggle with real-time streaming data due to computational demands for full posterior recovery. This paper offers a novel, efficient approach by prioritizing prediction. Its beam search derivation provides a practical, deterministic algorithm for online regime identification, making HMMs more applicable in dynamic environments.

Original Abstract

We develop a predictive-first optimisation framework for streaming hidden Markov models. Unlike classical approaches that prioritise full posterior recovery under a fully specified generative model, we assume access to regime-specific predictive models whose parameters are learned online while maintaining a fixed transition prior over regimes. Our objective is to sequentially identify latent regimes while maintaining accurate step-ahead predictive distributions. Because the number of possible regime paths grows exponentially, exact filtering is infeasible. We therefore formulate streaming inference as a constrained projection problem in predictive-distribution space: under a fixed hypothesis budget, we approximate the full posterior predictive by the forward-KL optimal mixture supported on $S$ paths. The solution is the renormalised top-$S$ posterior-weighted mixture, providing a principled derivation of beam search for HMMs. The resulting algorithm is fully recursive and deterministic, performing beam-style truncation with closed-form predictive updates and requiring neither EM nor sampling. Empirical comparisons against Online EM and Sequential Monte Carlo under matched computational budgets demonstrate competitive prequential performance.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.