Probabilistic Forecasting for Day-ahead Electricity Prices, Battery Trading Strategies and the Economic Evaluation of Predictive Accuracy
TLDR
This paper reveals flaws in using battery trading strategies to evaluate probabilistic electricity price forecasts and proposes a new stochastic programming method.
Key contributions
- Identifies two critical flaws in quantile-based trading strategies (QBTS) for forecast evaluation.
- Proposes framing battery optimization as a stochastic program using fully probabilistic forecasts.
- Examines decision quality measurement for risk-neutral and risk-averse settings under uncertainty.
- Highlights the pitfalls of ranking forecasting models through simplified battery trading strategies.
Why it matters
Accurate evaluation of electricity price forecasting models is crucial for energy market decision-making and asset operation. This paper exposes significant limitations in current application-based benchmarks, particularly those involving battery trading. It offers a more robust framework for assessing forecast quality, leading to better economic outcomes and more reliable energy market strategies.
Original Abstract
Electricity price forecasting supports decision-making in energy markets and asset operation. Probabilistic forecasts are increasingly adopted to explicitly quantify uncertainty, typically issued as quantile predictions or ensembles of the full predictive distribution. However, how improvements in statistical forecast quality translate into economic value remains unclear. Battery storage arbitrage in day-ahead markets is a popular application-based benchmark for this purpose. We analyze quantile-based trading strategies (QBTS) and identify two critical flaws: they do not incentivize honest probabilistic forecasting and they ignore the intertemporal dependence structure of electricity prices. We therefore frame battery optimization as a stochastic program based on fully probabilistic forecasts and examine decision quality measurement for risk-neutral and risk-averse settings under different uncertainty models. Our discussion touches both sides of the coin: How reliable is the economic evaluation of forecasting models though (simplified) application studies - and how do improvements in statistical forecast quality for stochastic programs relate to the decision-quality and economic performance? We provide theoretical justification and empirical evidence from a case study on the German electricity market. Our results highlight the pitfalls of ranking forecasting models through battery trading strategies. We conclude with implications for evaluation practice and directions for future research in application-based forecast assessment.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.