ArXiv TLDR

SPLICE: Latent Diffusion over JEPA Embeddings for Conformal Time-Series Inpainting

🐦 Tweet
2605.00126

Arnaud Zinflou

cs.LGeess.SPstat.ML

TLDR

SPLICE is a framework for reliable time-series inpainting using latent diffusion over JEPA embeddings, providing guaranteed prediction intervals.

Key contributions

  • Introduces SPLICE, a modular framework for time-series inpainting with reliable, online-adaptive prediction intervals.
  • Utilizes a JEPA encoder for latent space mapping and Adaptive Conformal Inference (ACI) for coverage guarantees.
  • Achieves state-of-the-art accuracy (lowest MSE, best CRPS) and 93-95% empirical coverage.
  • Offers 5-10x speedup with flow-matching and strong transferability across unseen domains.

Why it matters

Existing time-series imputation models lack reliability guarantees, crucial for critical applications like power systems. SPLICE addresses this by providing distribution-free, online-adaptive prediction intervals, ensuring trustworthy imputed values. This significantly enhances the safety and planning capabilities for real-world time-series data.

Original Abstract

Generative models for time-series imputation achieve strong reconstruction accuracy, yet provide no finite-sample reliability guarantees, a critical limitation in power systems where imputed values inform dispatch and planning. We introduce SPLICE (Self-supervised Predictive Latent Inpainting with Conformal Envelopes), a modular framework coupling latent generative imputation with distribution-free, online-adaptive prediction intervals. A JEPA encoder maps daily load segments into a 64-dimensional latent space; a conditional latent bridge with four sampling modes generates candidate gap trajectories; an hourly-conditioned decoder maps back to signal space; and Adaptive Conformal Inference (ACI) wraps the output with coverage-guaranteed prediction bands. The flow-matching variant achieves comparable quality to DDIM in 5--10 ODE steps (5-10x speedup). On thirteen load datasets (nine proprietary, three UCI Electricity, ETTh1), SPLICE achieves the lowest mean Load-only MSE (0.056), winning 9/12 non-degenerate datasets at 91-day gaps and 18/32 across all gap lengths vs. five established baselines, and produces the best CRPS (0.161, -18.3% vs. the strongest competitor). ACI delivers 93--95% empirical coverage, correcting under-coverage failures of up to 7.5 pp observed with static conformal prediction. A pooled JEPA encoder trained on nine feeds transfers to four unseen domains, matching or exceeding per-dataset oracles with only a quick bridge fine-tuning.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.