Online Multivariate Regularized Distributional Regression for High-dimensional Probabilistic Electricity Price Forecasting

ArXiv ID: 2504.02518 “View on arXiv”

Authors: Unknown

Abstract

Probabilistic electricity price forecasting (PEPF) is vital for short-term electricity markets, yet the multivariate nature of day-ahead prices - spanning 24 consecutive hours - remains underexplored. At the same time, real-time decision-making requires methods that are both accurate and fast. We introduce an online algorithm for multivariate distributional regression models, allowing an efficient modelling of the conditional means, variances, and dependence structures of electricity prices. The approach combines multivariate distributional regression with online coordinate descent and LASSO-type regularization, enabling scalable estimation in high-dimensional covariate spaces. Additionally, we propose a regularized estimation path over increasingly complex dependence structures, allowing for early stopping and avoiding overfitting. In a case study of the German day-ahead market, our method outperforms a wide range of benchmarks, showing that modeling dependence improves both calibration and predictive accuracy. Furthermore, we analyse the trade-off between predictive accuracy and computational costs for batch and online estimation and provide an high-performing open-source Python implementation in the ondil package.

Keywords: probabilistic electricity price forecasting (PEPF), multivariate distributional regression, online coordinate descent, LASSO-type regularization, dependence structures, Commodities

Complexity vs Empirical Score

  • Math Complexity: 7.5/10
  • Empirical Rigor: 7.0/10
  • Quadrant: Holy Grail
  • Why: The paper employs advanced multivariate distributional regression with LASSO regularization and online coordinate descent, indicating high mathematical sophistication; it includes a comprehensive case study with benchmark comparisons, open-source implementation, and detailed computational cost analysis, demonstrating strong empirical rigor.
  flowchart TD
    A["Research Goal:<br>High-dimensional Probabilistic<br>Electricity Price Forecasting"] --> B["Methodology:<br>Online Multivariate<br>Distributional Regression"]
    
    B --> C{"Data & Inputs:<br>German Day-Ahead Market<br>24-hour price vectors + Covariates"}
    
    C --> D["Computational Process:<br>Online Coordinate Descent<br>+ LASSO Regularization"]
    
    D --> E["Regularization Path:<br>Iterative Complexity<br>Selection & Early Stopping"]
    
    E --> F["Key Outcomes:<br>• Superior Accuracy<br>• Improved Calibration<br>• Efficient Computation<br>• Open-source Python package"]
    
    F --> G["Conclusion:<br>Modeling dependence<br>enhances PEPF performance"]