false

Deep Learning for Continuous-time Stochastic Control with Jumps

Deep Learning for Continuous-time Stochastic Control with Jumps ArXiv ID: 2505.15602 “View on arXiv” Authors: Patrick Cheridito, Jean-Loup Dupret, Donatien Hainaut Abstract In this paper, we introduce a model-based deep-learning approach to solve finite-horizon continuous-time stochastic control problems with jumps. We iteratively train two neural networks: one to represent the optimal policy and the other to approximate the value function. Leveraging a continuous-time version of the dynamic programming principle, we derive two different training objectives based on the Hamilton-Jacobi-Bellman equation, ensuring that the networks capture the underlying stochastic dynamics. Empirical evaluations on different problems illustrate the accuracy and scalability of our approach, demonstrating its effectiveness in solving complex, high-dimensional stochastic control tasks. ...

May 21, 2025 · 2 min · Research Team

A Simple Strategy to Deal with Toxic Flow

A Simple Strategy to Deal with Toxic Flow ArXiv ID: 2503.18005 “View on arXiv” Authors: Unknown Abstract We model the trading activity between a broker and her clients (informed and uninformed traders) as an infinite-horizon stochastic control problem. We derive the broker’s optimal dealing strategy in closed form and use this to introduce an algorithm that bypasses the need to calibrate individual parameters, so the dealing strategy can be executed in real-world trading environments. Finally, we characterise the discount in the price of liquidity a broker offers clients. The discount strikes the optimal balance between maximising the order flow from the broker’s clients and minimising adverse selection losses to the informed traders. ...

March 23, 2025 · 2 min · Research Team

Intraday Battery Dispatch for Hybrid Renewable Energy Assets

Intraday Battery Dispatch for Hybrid Renewable Energy Assets ArXiv ID: 2503.12305 “View on arXiv” Authors: Unknown Abstract We develop a mathematical model for intraday dispatch of co-located wind-battery energy assets. Focusing on the primary objective of firming grid-side actual production vis-a-vis the preset day-ahead hourly generation targets, we conduct a comprehensive study of the resulting stochastic control problem across different firming formulations and wind generation dynamics. Among others, we provide a closed-form solution in the special case of a quadratic objective and linear dynamics, as well as design a novel adaptation of a Gaussian Process-based Regression Monte Carlo algorithm for our setting. Extensions studied include an asymmetric loss function for peak shaving, capturing the cost of battery cycling, and the role of battery duration. In the applied portion of our work, we calibrate our model to a collection of 140+ wind-battery assets in Texas, benchmarking the economic benefits of firming based on outputs of a realistic unit commitment and economic dispatch solver. ...

March 16, 2025 · 2 min · Research Team

Multi-asset optimal trade execution with stochastic cross-effects: An Obizhaeva-Wang-type framework

Multi-asset optimal trade execution with stochastic cross-effects: An Obizhaeva-Wang-type framework ArXiv ID: 2503.05594 “View on arXiv” Authors: Unknown Abstract We analyze a continuous-time optimal trade execution problem in multiple assets where the price impact and the resilience can be matrix-valued stochastic processes that incorporate cross-impact effects. In addition, we allow for stochastic terminal and running targets. Initially, we formulate the optimal trade execution task as a stochastic control problem with a finite-variation control process that acts as an integrator both in the state dynamics and in the cost functional. We then extend this problem continuously to a stochastic control problem with progressively measurable controls. By identifying this extended problem as equivalent to a certain linear-quadratic stochastic control problem, we can use established results in linear-quadratic stochastic control to solve the extended problem. This work generalizes [“Ackermann, Kruse, Urusov; FinancStoch'24”] from the single-asset setting to the multi-asset case. In particular, we reveal cross-hedging effects, showing that it can be optimal to trade in an asset despite having no initial position. Moreover, as a subsetting we discuss a multi-asset variant of the model in [“Obizhaeva, Wang; JFinancMark'13”]. ...

March 7, 2025 · 2 min · Research Team

Risk Measures for DC Pension Plan Decumulation

Risk Measures for DC Pension Plan Decumulation ArXiv ID: 2502.16364 “View on arXiv” Authors: Unknown Abstract As the developed world replaces Defined Benefit (DB) pension plans with Defined Contribution (DC) plans, there is a need to develop decumulation strategies for DC plan holders. Optimal decumulation can be viewed as a problem in optimal stochastic control. Formulation as a control problem requires specification of an objective function, which in turn requires a definition of reward and risk. An intuitive specification of reward is the total withdrawals over the retirement period. Most retirees view risk as the possibility of running out of savings. This paper investigates several possible left tail risk measures, in conjunction with DC plan decumulation. The risk measures studied include (i) expected shortfall (ii) linear shortfall and (iii) probability of shortfall. We establish that, under certain assumptions, the set of optimal controls associated with all expected reward and expected shortfall Pareto efficient frontier curves is identical to the set of optimal controls for all expected reward and linear shortfall Pareto efficient frontier curves. Optimal efficient frontiers are determined computationally for each risk measure, based on a parametric market model. Robustness of these strategies is determined by testing the strategies out-of-sample using block bootstrapping of historical data. ...

February 22, 2025 · 2 min · Research Team

Optimal portfolio under ratio-type periodic evaluation in stochastic factor models under convex trading constraints

Optimal portfolio under ratio-type periodic evaluation in stochastic factor models under convex trading constraints ArXiv ID: 2411.13579 “View on arXiv” Authors: Unknown Abstract This paper studies a type of periodic utility maximization problems for portfolio management in incomplete stochastic factor models with convex trading constraints. The portfolio performance is periodically evaluated on the relative ratio of two adjacent wealth levels over an infinite horizon, featuring the dynamic adjustments in portfolio decision according to past achievements. Under power utility, we transform the original infinite horizon optimal control problem into an auxiliary terminal wealth optimization problem under a modified utility function. To cope with the convex trading constraints, we further introduce an auxiliary unconstrained optimization problem in a modified market model and develop the martingale duality approach to establish the existence of the dual minimizer such that the optimal unconstrained wealth process can be obtained using the dual representation. With the help of the duality results in the auxiliary problems, the relationship between the constrained and unconstrained models as well as some fixed point arguments, we finally derive and verify the optimal constrained portfolio process in a periodic manner for the original problem over an infinite horizon. ...

November 15, 2024 · 2 min · Research Team

Optimal Investment with Costly Expert Opinions

Optimal Investment with Costly Expert Opinions ArXiv ID: 2409.11569 “View on arXiv” Authors: Unknown Abstract We consider the Merton problem of optimizing expected power utility of terminal wealth in the case of an unobservable Markov-modulated drift. What makes the model special is that the agent is allowed to purchase costly expert opinions of varying quality on the current state of the drift, leading to a mixed stochastic control problem with regular and impulse controls involving random consequences. Using ideas from filtering theory, we first embed the original problem with unobservable drift into a full information problem on a larger state space. The value function of the full information problem is characterized as the unique viscosity solution of the dynamic programming PDE. This characterization is achieved by a new variant of the stochastic Perron’s method, which additionally allows us to show that, in between purchases of expert opinions, the problem reduces to an exit time control problem which is known to admit an optimal feedback control. Under the assumption of sufficient regularity of this feedback map, we are able to construct optimal trading and expert opinion strategies. ...

September 17, 2024 · 2 min · Research Team

Dynamic Pricing for Real Estate

Dynamic Pricing for Real Estate ArXiv ID: 2408.12553 “View on arXiv” Authors: Unknown Abstract We study a mathematical model for the optimization of the price of real estate (RE). This model can be characterised by a limited amount of goods, fixed sales horizon and presence of intermediate sales and revenue goals. We develop it as an enhancement and upgrade of the model presented by Besbes and Maglaras now also taking into account variable demand, time value of money, and growth of the objective value of Real Estate with the development stage. ...

August 22, 2024 · 2 min · Research Team

Portfolio and reinsurance optimization under unknown market price of risk

Portfolio and reinsurance optimization under unknown market price of risk ArXiv ID: 2408.07432 “View on arXiv” Authors: Unknown Abstract We investigate the optimal investment-reinsurance problem for insurance company with partial information on the market price of the risk. Through the use of filtering techniques we convert the original optimization problem involving different filtrations, into an equivalent stochastic control problem under the observation filtration only, the so-called separated problem. The Markovian structure of the separated problem allows us to apply a classical approach to stochastic optimization based on the Hamilton-Jacobi-Bellman equation, and to provide explicit formulas for the value function and the optimal investment-reinsurance strategy. We finally discuss some comparisons between the optimal strategies pursued by a partially informed insurer and that followed by a fully informed insurer, and we evaluate the value of information using the idea of indifference pricing. These results are also supported by numerical experiments. ...

August 14, 2024 · 2 min · Research Team

Optimal risk mitigation by deep reinsurance

Optimal risk mitigation by deep reinsurance ArXiv ID: 2408.06168 “View on arXiv” Authors: Unknown Abstract We consider an insurance company which faces financial risk in the form of insurance claims and market-dependent surplus fluctuations. The company aims to simultaneously control its terminal wealth (e.g. at the end of an accounting period) and the ruin probability in a finite time interval by purchasing reinsurance. The target functional is given by the expected utility of terminal wealth perturbed by a modified Gerber-Shiu penalty function. We solve the problem of finding the optimal reinsurance strategy and the corresponding maximal target functional via neural networks. The procedure is illustrated by a numerical example, where the surplus process is given by a Cramér-Lundberg model perturbed by a mean-reverting Ornstein-Uhlenbeck process. ...

August 12, 2024 · 2 min · Research Team