Accelerated Portfolio Optimization and Option Pricing with Reinforcement Learning
ArXiv ID: 2507.01972 “View on arXiv”
Authors: Hadi Keramati, Samaneh Jazayeri
Abstract
We present a reinforcement learning (RL)-driven framework for optimizing block-preconditioner sizes in iterative solvers used in portfolio optimization and option pricing. The covariance matrix in portfolio optimization or the discretization of differential operators in option pricing models lead to large linear systems of the form $\mathbf{“A”}\textbf{“x”}=\textbf{“b”}$. Direct inversion of high-dimensional portfolio or fine-grid option pricing incurs a significant computational cost. Therefore, iterative methods are usually used for portfolios in real-world situations. Ill-conditioned systems, however, suffer from slow convergence. Traditional preconditioning techniques often require problem-specific parameter tuning. To overcome this limitation, we rely on RL to dynamically adjust the block-preconditioner sizes and accelerate iterative solver convergence. Evaluations on a suite of real-world portfolio optimization matrices demonstrate that our RL framework can be used to adjust preconditioning and significantly accelerate convergence and reduce computational cost. The proposed accelerated solver supports faster decision-making in dynamic portfolio allocation and real-time option pricing.
Keywords: portfolio optimization, reinforcement learning, iterative solvers, preconditioning, option pricing
Complexity vs Empirical Score
- Math Complexity: 7.5/10
- Empirical Rigor: 6.0/10
- Quadrant: Holy Grail
- Why: The paper employs advanced mathematical formulations including KKT conditions, finite difference discretizations of PDEs, and iterative solvers like FGMRES, indicating high math complexity. While it references real-world portfolio matrices and computational cost reductions, the lack of detailed backtest metrics, specific datasets, or implementation code places it in the moderate empirical rigor range.
flowchart TD
A["Research Goal: Accelerate iterative solvers<br>for portfolio & option pricing<br>using RL-driven preconditioning"] --> B["Methodology: RL Agent for<br>dynamic block-preconditioner sizing"]
B --> C["Data Inputs: Real-world portfolio matrices &<br>option pricing discretization systems"]
C --> D["Computational Process: RL-controlled<br>iterative solver with adaptive preconditioning"]
D --> E{"Key Outcomes"}
E --> F["Accelerated convergence of solvers"]
E --> G["Reduced computational cost & time"]
E --> H["Enhanced decision-making speed<br>for portfolio allocation & option pricing"]