Investment Portfolio Optimization Based on Modern Portfolio Theory and Deep Learning Models
ArXiv ID: 2508.14999 “View on arXiv”
Authors: Maciej Wysocki, Paweł Sakowski
Abstract
This paper investigates an important problem of an appropriate variance-covariance matrix estimation in the Modern Portfolio Theory. We propose a novel framework for variancecovariance matrix estimation for purposes of the portfolio optimization, which is based on deep learning models. We employ the long short-term memory (LSTM) recurrent neural networks (RNN) along with two probabilistic deep learning models: DeepVAR and GPVAR to the task of one-day ahead multivariate forecasting. We then use these forecasts to optimize portfolios of stocks and cryptocurrencies. Our analysis presents results across different combinations of observation windows and rebalancing periods to compare performances of classical and deep learning variance-covariance estimation methods. The conclusions of the study are that although the strategies (portfolios) performance differed significantly between different combinations of parameters, generally the best results in terms of the information ratio and annualized returns are obtained using the LSTM-RNN models. Moreover, longer observation windows translate into better performance of the deep learning models indicating that these methods require longer windows to be able to efficiently capture the long-term dependencies of the variance-covariance matrix structure. Strategies with less frequent rebalancing typically perform better than these with the shortest rebalancing windows across all considered methods.
Keywords: covariance matrix estimation, LSTM, DeepVAR, portfolio optimization, multivariate forecasting, Multi-Asset (Equities & Crypto)
Complexity vs Empirical Score
- Math Complexity: 7.5/10
- Empirical Rigor: 8.0/10
- Quadrant: Holy Grail
- Why: The paper employs advanced math including stochastic processes (LSTM, VAR, GPVAR) and rigorous portfolio optimization frameworks, and demonstrates strong empirical rigor through extensive backtesting on stocks and cryptocurrencies with various parameters, code sharing for reproducibility, and detailed performance metrics like information ratio.
flowchart TD
A["Research Goal<br/>Optimize Portfolio via<br/>Improved Covariance Estimation"] --> B["Data Inputs<br/>Multi-Asset (Stocks & Crypto)"]
B --> C{"Model Comparison<br/>Observation Windows vs Rebalancing"}
C --> D["Classical Methods<br/>e.g., Sample Covariance"]
C --> E["Deep Learning Methods<br/>LSTM, DeepVAR, GPVAR"]
D & E --> F["Portfolio Optimization<br/>Mean-Variance Framework"]
F --> G["Performance Metrics<br/>Information Ratio &<br/>Annualized Returns"]
G --> H["Key Findings<br/>LSTM Best Performance<br/>Longer Windows Preferred<br/>Less Frequent Rebalancing"]