A Predictive View on Streaming Hidden Markov Models

April 10, 20262604.09208

stat.MLcs.LG

TLDR

This paper introduces a predictive-first framework for streaming Hidden Markov Models, deriving a beam search algorithm for efficient online regime identification.

Key contributions

Introduces a predictive-first optimization framework for streaming HMMs, focusing on accurate step-ahead predictions.
Learns regime-specific predictive models online, using a fixed transition prior instead of full posterior recovery.
Derives a principled beam search algorithm for HMMs by approximating the posterior predictive with top-S paths.
Offers a recursive, deterministic algorithm with closed-form updates, avoiding EM or sampling.

Why it matters

Classical HMMs struggle with real-time streaming data due to computational demands for full posterior recovery. This paper offers a novel, efficient approach by prioritizing prediction. Its beam search derivation provides a practical, deterministic algorithm for online regime identification, making HMMs more applicable in dynamic environments.

Original Abstract

We develop a predictive-first optimisation framework for streaming hidden Markov models. Unlike classical approaches that prioritise full posterior recovery under a fully specified generative model, we assume access to regime-specific predictive models whose parameters are learned online while maintaining a fixed transition prior over regimes. Our objective is to sequentially identify latent regimes while maintaining accurate step-ahead predictive distributions. Because the number of possible regime paths grows exponentially, exact filtering is infeasible. We therefore formulate streaming inference as a constrained projection problem in predictive-distribution space: under a fixed hypothesis budget, we approximate the full posterior predictive by the forward-KL optimal mixture supported on $S$ paths. The solution is the renormalised top-$S$ posterior-weighted mixture, providing a principled derivation of beam search for HMMs. The resulting algorithm is fully recursive and deterministic, performing beam-style truncation with closed-form predictive updates and requiring neither EM nor sampling. Empirical comparisons against Online EM and Sequential Monte Carlo under matched computational budgets demonstrate competitive prequential performance.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers