Conformal Prediction with Time-Series Data via Sequential Conformalized Density Regions
TLDR
SCDR is a new conformal prediction method for time-series data, offering guaranteed coverage and producing flexible prediction sets including bifurcations.
Key contributions
- Proposes Sequential Conformalized Density Regions (SCDR) for time-series prediction.
- Guarantees asymptotic conditional coverage rate for prediction sets.
- Generates flexible prediction sets, including disconnected regions for bifurcations.
- Doubly robust, working even if predictive density or score models are partially misspecified.
Why it matters
This paper introduces SCDR, a robust method for time-series conformal prediction. It provides more accurate and informative prediction sets than existing methods, crucial for understanding complex dynamics like bifurcations. This improves reliability in forecasting and anomaly detection.
Original Abstract
We propose a new conformal prediction method for time-series data with a guaranteed asymptotic conditional coverage rate, Sequential Conformalized Density Regions (SCDR), which is flexible enough to produce both prediction intervals and disconnected prediction sets, signifying the emergence of bifurcations. Our approach uses existing estimated conditional highest density predictive regions to form initial predictive regions. We then use a quantile random forest conformal adjustment to provide guaranteed coverage while adaptively changing to take the non-exchangeable nature of time-series data into account. We show that the proposed method achieves the guaranteed coverage rate asymptotically under certain regularity conditions. In particular, the method is doubly robust -- it works if the predictive density model is correctly specified and/or if the scores follow a nonlinear autoregressive model with the correct order specified. Simulations reveal that the proposed method outperforms existing methods in terms of empirical coverage rates and set sizes. We illustrate the method using two real datasets, the Old Faithful geyser dataset and the Australian electricity usage dataset. Prediction sets formed using SCDR for the geyser eruption durations include both single intervals and unions of two intervals, whereas existing methods produce wider, less informative, single-interval prediction sets.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.