Rolling-Origin Conformal Prediction under Local Stationarity and Weak Dependence
TLDR
Rolling-origin conformal prediction adapts to time-series dependence, achieving optimal coverage with adaptive calibration windows.
Key contributions
- Introduces rolling-origin conformal prediction for time-series with local stationarity and weak dependence.
- Derives optimal calibration window size and minimax-optimal coverage error rates under Hölder-β and α-mixing conditions.
- Proves Bahadur representation and oracle inequality for adaptive window selection via Winkler cross-validation.
- Validates method on real and M4 competition data, outperforming full-history calibration in 86% of cases.
Why it matters
This paper advances time-series forecasting by providing a theoretically optimal, adaptive conformal prediction method that handles dependence and distribution shifts, improving coverage accuracy and practical performance.
Original Abstract
We propose and analyse rolling-origin conformal prediction for time-series forecasting. The method calibrates the conformal quantile against the $m$ most recent pseudo-out-of-sample forecast errors, adapting to serial dependence, volatility clustering, and distributional drift that invalidate classical conformal guarantees. Under Hölder-$β$ local stationarity and $α$-mixing, we establish a four-term coverage-error decomposition and derive the optimal calibration window $m^{\star} \asymp T^{2β/(2β+1)}$ with coverage-error rate $O(T^{-β/(2β+1)})$. A Le Cam two-point construction shows this rate is minimax-optimal over the Hölder-$β$ model class. The Bahadur representation is proved under both $α$-mixing and the physical-dependence framework of Wu (2005). An oracle inequality formalises Winkler cross-validation as an adaptive window selector; the required uniform concentration condition is established in an appendix. Validation on six real series and 93 M4 competition series confirms the theory: rolling-origin calibration outperforms full-history calibration in 86\% of comparisons (median Winkler improvement 12.3\%), maintains coverage within $\pm2\%$ of the 90\% target at short and medium horizons, and the cross-frequency log-log regression slope $0.614$ ($95\%$ CI $[0.424, 0.805]$) is consistent with the theoretical $2/3$ after controlling for frequency fixed effects.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.