false

Error Propagation in Dynamic Programming: From Stochastic Control to Option Pricing

Error Propagation in Dynamic Programming: From Stochastic Control to Option Pricing ArXiv ID: 2509.20239 “View on arXiv” Authors: Andrea Della Vecchia, Damir Filipović Abstract This paper investigates theoretical and methodological foundations for stochastic optimal control (SOC) in discrete time. We start formulating the control problem in a general dynamic programming framework, introducing the mathematical structure needed for a detailed convergence analysis. The associate value function is estimated through a sequence of approximations combining nonparametric regression methods and Monte Carlo subsampling. The regression step is performed within reproducing kernel Hilbert spaces (RKHSs), exploiting the classical KRR algorithm, while Monte Carlo sampling methods are introduced to estimate the continuation value. To assess the accuracy of our value function estimator, we propose a natural error decomposition and rigorously control the resulting error terms at each time step. We then analyze how this error propagates backward in time-from maturity to the initial stage-a relatively underexplored aspect of the SOC literature. Finally, we illustrate how our analysis naturally applies to a key financial application: the pricing of American options. ...

September 24, 2025 · 2 min · Research Team

American Option Pricing Under Time-Varying Rough Volatility: A Signature-Based Hybrid Framework

American Option Pricing Under Time-Varying Rough Volatility: A Signature-Based Hybrid Framework ArXiv ID: 2508.07151 “View on arXiv” Authors: Roshan Shah Abstract We introduce a modular framework that extends the signature method to handle American option pricing under evolving volatility roughness. Building on the signature-pricing framework of Bayer et al. (2025), we add three practical innovations. First, we train a gradient-boosted ensemble to estimate the time-varying Hurst parameter H(t) from rolling windows of recent volatility data. Second, we feed these forecasts into a regime switch that chooses either a rough Bergomi or a calibrated Heston simulator, depending on the predicted roughness. Third, we accelerate signature-kernel evaluations with Random Fourier Features (RFF), cutting computational cost while preserving accuracy. Empirical tests on S&P 500 equity-index options reveal that the assumption of persistent roughness is frequently violated, particularly during stable market regimes when H(t) approaches or exceeds 0.5. The proposed hybrid framework provides a flexible structure that adapts to changing volatility roughness, improving performance over fixed-roughness baselines and reducing duality gaps in some regimes. By integrating a dynamic Hurst parameter estimation pipeline with efficient kernel approximations, we propose to enable tractable, real-time pricing of American options in dynamic volatility environments. ...

August 10, 2025 · 2 min · Research Team

Deep Penalty Methods: A Class of Deep Learning Algorithms for Solving High Dimensional Optimal Stopping Problems

Deep Penalty Methods: A Class of Deep Learning Algorithms for Solving High Dimensional Optimal Stopping Problems ArXiv ID: 2405.11392 “View on arXiv” Authors: Unknown Abstract We propose a deep learning algorithm for high dimensional optimal stopping problems. Our method is inspired by the penalty method for solving free boundary PDEs. Within our approach, the penalized PDE is approximated using the Deep BSDE framework proposed by \cite{“weinan2017deep”}, which leads us to coin the term “Deep Penalty Method (DPM)” to refer to our algorithm. We show that the error of the DPM can be bounded by the loss function and $O(\frac{“1”}λ)+O(λh) +O(\sqrt{“h”})$, where $h$ is the step size in time and $λ$ is the penalty parameter. This finding emphasizes the need for careful consideration when selecting the penalization parameter and suggests that the discretization error converges at a rate of order $\frac{“1”}{“2”}$. We validate the efficacy of the DPM through numerical tests conducted on a high-dimensional optimal stopping model in the area of American option pricing. The numerical tests confirm both the accuracy and the computational efficiency of our proposed algorithm. ...

May 18, 2024 · 2 min · Research Team

On an Optimal Stopping Problem with a Discontinuous Reward

On an Optimal Stopping Problem with a Discontinuous Reward ArXiv ID: 2311.03538 “View on arXiv” Authors: Unknown Abstract We study an optimal stopping problem with an unbounded, time-dependent and discontinuous reward function. This problem is motivated by the pricing of a variable annuity contract with guaranteed minimum maturity benefit, under the assumption that the policyholder’s surrender behaviour maximizes the risk-neutral value of the contract. We consider a general fee and surrender charge function, and give a condition under which optimal stopping always occurs at maturity. Using an alternative representation for the value function of the optimization problem, we study its analytical properties and the resulting surrender (or exercise) region. In particular, we show that the non-emptiness and the shape of the surrender region are fully characterized by the fee and the surrender charge functions, which provides a powerful tool to understand their interrelation and how it affects early surrenders and the optimal surrender boundary. Under certain conditions on these two functions, we develop three representations for the value function; two are analogous to their American option counterpart, and one is new to the actuarial and American option pricing literature. ...

November 6, 2023 · 2 min · Research Team