Neural Networks

Designing an attack-defense game: how to increase robustness of financial transaction models via a competition

Designing an attack-defense game: how to increase robustness of financial transaction models via a competition ArXiv ID: 2308.11406 “View on arXiv” Authors: Unknown Abstract Banks routinely use neural networks to make decisions. While these models offer higher accuracy, they are susceptible to adversarial attacks, a risk often overlooked in the context of event sequences, particularly sequences of financial transactions, as most works consider computer vision and NLP modalities. We propose a thorough approach to studying these risks: a novel type of competition that allows a realistic and detailed investigation of problems in financial transaction data. The participants directly oppose each other, proposing attacks and defenses – so they are examined in close-to-real-life conditions. The paper outlines our unique competition structure with direct opposition of participants, presents results for several different top submissions, and analyzes the competition results. We also introduce a new open dataset featuring financial transactions with credit default labels, enhancing the scope for practical research and development. ...

D-TIPO: Deep time-inconsistent portfolio optimization with stocks and options

D-TIPO: Deep time-inconsistent portfolio optimization with stocks and options ArXiv ID: 2308.10556 “View on arXiv” Authors: Unknown Abstract In this paper, we propose a machine learning algorithm for time-inconsistent portfolio optimization. The proposed algorithm builds upon neural network based trading schemes, in which the asset allocation at each time point is determined by a a neural network. The loss function is given by an empirical version of the objective function of the portfolio optimization problem. Moreover, various trading constraints are naturally fulfilled by choosing appropriate activation functions in the output layers of the neural networks. Besides this, our main contribution is to add options to the portfolio of risky assets and a risk-free bond and using additional neural networks to determine the amount allocated into the options as well as their strike prices. We consider objective functions more in line with the rational preference of an investor than the classical mean-variance, apply realistic trading constraints and model the assets with a correlated jump-diffusion SDE. With an incomplete market and a more involved objective function, we show that it is beneficial to add options to the portfolio. Moreover, it is shown that adding options leads to a more constant stock allocation with less demand for drastic re-allocations. ...

A discretization scheme for path-dependent FBSDEs and PDEs

A discretization scheme for path-dependent FBSDEs and PDEs ArXiv ID: 2308.07029 “View on arXiv” Authors: Unknown Abstract This study develops a numerical scheme for path-dependent FBSDEs and PDEs. We introduce a Picard iteration method for solving path-dependent FBSDEs, prove its convergence to the true solution, and establish its rate of convergence. A key contribution of our approach is a novel estimator for the martingale integrand in the FBSDE, specifically designed to handle path-dependence more reliably than existing methods. We derive a concentration inequality that quantifies the statistical error of this estimator in a Monte Carlo framework. Based on these results, we investigate a supervised learning method with neural networks for solving path-dependent PDEs. The proposed algorithm is fully implementable and adaptable to a broad class of path-dependent problems. ...

Modeling Inverse Demand Function with Explainable Dual Neural Networks

Modeling Inverse Demand Function with Explainable Dual Neural Networks ArXiv ID: 2307.14322 “View on arXiv” Authors: Unknown Abstract Financial contagion has been widely recognized as a fundamental risk to the financial system. Particularly potent is price-mediated contagion, wherein forced liquidations by firms depress asset prices and propagate financial stress, enabling crises to proliferate across a broad spectrum of seemingly unrelated entities. Price impacts are currently modeled via exogenous inverse demand functions. However, in real-world scenarios, only the initial shocks and the final equilibrium asset prices are typically observable, leaving actual asset liquidations largely obscured. This missing data presents significant limitations to calibrating the existing models. To address these challenges, we introduce a novel dual neural network structure that operates in two sequential stages: the first neural network maps initial shocks to predicted asset liquidations, and the second network utilizes these liquidations to derive resultant equilibrium prices. This data-driven approach can capture both linear and non-linear forms without pre-specifying an analytical structure; furthermore, it functions effectively even in the absence of observable liquidation data. Experiments with simulated datasets demonstrate that our model can accurately predict equilibrium asset prices based solely on initial shocks, while revealing a strong alignment between predicted and true liquidations. Our explainable framework contributes to the understanding and modeling of price-mediated contagion and provides valuable insights for financial authorities to construct effective stress tests and regulatory policies. ...

Machine learning for option pricing: an empirical investigation of network architectures

Machine learning for option pricing: an empirical investigation of network architectures ArXiv ID: 2307.07657 “View on arXiv” Authors: Unknown Abstract We consider the supervised learning problem of learning the price of an option or the implied volatility given appropriate input data (model parameters) and corresponding output data (option prices or implied volatilities). The majority of articles in this literature considers a (plain) feed forward neural network architecture in order to connect the neurons used for learning the function mapping inputs to outputs. In this article, motivated by methods in image classification and recent advances in machine learning methods for PDEs, we investigate empirically whether and how the choice of network architecture affects the accuracy and training time of a machine learning algorithm. We find that the generalized highway network architecture achieves the best performance, when considering the mean squared error and the training time as criteria, within the considered parameter budgets for the Black-Scholes and Heston option pricing problems. Considering the transformed implied volatility problem, a simplified DGM variant achieves the lowest error among the tested architectures. We also carry out a capacity-normalised comparison for completeness, where all architectures are evaluated with an equal number of parameters. Finally, for the implied volatility problem, we additionally include experiments using real market data. ...

Deep calibration with random grids

Deep calibration with random grids ArXiv ID: 2306.11061 “View on arXiv” Authors: Unknown Abstract We propose a neural network-based approach to calibrating stochastic volatility models, which combines the pioneering grid approach by Horvath et al. (2021) with the pointwise two-stage calibration of Bayer et al. (2018) and Liu et al. (2019). Our methodology inherits robustness from the former while not suffering from the need for interpolation/extrapolation techniques, a clear advantage ensured by the pointwise approach. The crucial point to the entire procedure is the generation of implied volatility surfaces on random grids, which one dispenses to the network in the training phase. We support the validity of our calibration technique with several empirical and Monte Carlo experiments for the rough Bergomi and Heston models under a simple but effective parametrization of the forward variance curve. The approach paves the way for valuable applications in financial engineering - for instance, pricing under local stochastic volatility models - and extensions to the fast-growing field of path-dependent volatility models. ...

Neural networks can detect model-free static arbitrage strategies

Neural networks can detect model-free static arbitrage strategies ArXiv ID: 2306.16422 “View on arXiv” Authors: Unknown Abstract In this paper we demonstrate both theoretically as well as numerically that neural networks can detect model-free static arbitrage opportunities whenever the market admits some. Due to the use of neural networks, our method can be applied to financial markets with a high number of traded securities and ensures almost immediate execution of the corresponding trading strategies. To demonstrate its tractability, effectiveness, and robustness we provide examples using real financial data. From a technical point of view, we prove that a single neural network can approximately solve a class of convex semi-infinite programs, which is the key result in order to derive our theoretical results that neural networks can detect model-free static arbitrage strategies whenever the financial market admits such opportunities. ...

Machine Learning and Hamilton-Jacobi-Bellman Equation for Optimal Decumulation: a Comparison Study

Machine Learning and Hamilton-Jacobi-Bellman Equation for Optimal Decumulation: a Comparison Study ArXiv ID: 2306.10582 “View on arXiv” Authors: Unknown Abstract We propose a novel data-driven neural network (NN) optimization framework for solving an optimal stochastic control problem under stochastic constraints. Customized activation functions for the output layers of the NN are applied, which permits training via standard unconstrained optimization. The optimal solution yields a multi-period asset allocation and decumulation strategy for a holder of a defined contribution (DC) pension plan. The objective function of the optimal control problem is based on expected wealth withdrawn (EW) and expected shortfall (ES) that directly targets left-tail risk. The stochastic bound constraints enforce a guaranteed minimum withdrawal each year. We demonstrate that the data-driven approach is capable of learning a near-optimal solution by benchmarking it against the numerical results from a Hamilton-Jacobi-Bellman (HJB) Partial Differential Equation (PDE) computational framework. ...

Optimal Portfolio Execution in a Regime-switching Market with Non-linear Impact Costs: Combining Dynamic Program and Neural Network

Optimal Portfolio Execution in a Regime-switching Market with Non-linear Impact Costs: Combining Dynamic Program and Neural Network ArXiv ID: 2306.08809 “View on arXiv” Authors: Unknown Abstract Optimal execution of a portfolio have been a challenging problem for institutional investors. Traders face the trade-off between average trading price and uncertainty, and traditional methods suffer from the curse of dimensionality. Here, we propose a four-step numerical framework for the optimal portfolio execution problem where multiple market regimes exist, with the underlying regime switching based on a Markov process. The market impact costs are modelled with a temporary part and a permanent part, where the former affects only the current trade while the latter persists. Our approach accepts impact cost functions in generic forms. First, we calculate the approximated orthogonal portfolios based on estimated impact cost functions; second, we employ dynamic program to learn the optimal selling schedule of each approximated orthogonal portfolio; third, weights of a neural network are pre-trained with the strategy suggested by previous step; last, we train the neural network to optimize on the original trading model. In our experiment of a 10-asset liquidation example with quadratic impact costs, the proposed combined method provides promising selling strategy for both CRRA (constant relative risk aversion) and mean-variance objectives. The running time is linear in the number of risky assets in the portfolio as well as in the number of trading periods. Possible improvements in running time are discussed for potential large-scale usages. ...

Swing contract pricing: with and without Neural Networks

Swing contract pricing: with and without Neural Networks ArXiv ID: 2306.03822 “View on arXiv” Authors: Unknown Abstract We propose two parametric approaches to evaluate swing contracts with firm constraints. Our objective is to define approximations for the optimal control, which represents the amounts of energy purchased throughout the contract. The first approach involves approximating the optimal control by means of an explicit parametric function, where the parameters are determined using stochastic gradient descent based algorithms. The second approach builds on the first one, where we replace parameters in the first approach by the output of a neural network. Our numerical experiments demonstrate that by using Langevin based algorithms, both parameterizations provide, in a short computation time, better prices compared to state-of-the-art methods. ...