false

Transfer Learning Across Fixed-Income Product Classes

Transfer Learning Across Fixed-Income Product Classes ArXiv ID: 2505.07676 “View on arXiv” Authors: Nicolas Camenzind, Damir Filipovic Abstract We propose a framework for transfer learning of discount curves across different fixed-income product classes. Motivated by challenges in estimating discount curves from sparse or noisy data, we extend kernel ridge regression (KR) to a vector-valued setting, formulating a convex optimization problem in a vector-valued reproducing kernel Hilbert space (RKHS). Each component of the solution corresponds to the discount curve implied by a specific product class. We introduce an additional regularization term motivated by economic principles, promoting smoothness of spread curves between product classes, and show that it leads to a valid separable kernel structure. A main theoretical contribution is a decomposition of the vector-valued RKHS norm induced by separable kernels. We further provide a Gaussian process interpretation of vector-valued KR, enabling quantification of estimation uncertainty. Illustrative examples demonstrate that transfer learning significantly improves extrapolation performance and tightens confidence intervals compared to single-curve estimation. ...

May 12, 2025 · 2 min · Research Team

Can LLM-based Financial Investing Strategies Outperform the Market in Long Run?

Can LLM-based Financial Investing Strategies Outperform the Market in Long Run? ArXiv ID: 2505.07078 “View on arXiv” Authors: Weixian Waylon Li, Hyeonjun Kim, Mihai Cucuringu, Tiejun Ma Abstract Large Language Models (LLMs) have recently been leveraged for asset pricing tasks and stock trading applications, enabling AI agents to generate investment decisions from unstructured financial data. However, most evaluations of LLM timing-based investing strategies are conducted on narrow timeframes and limited stock universes, overstating effectiveness due to survivorship and data-snooping biases. We critically assess their generalizability and robustness by proposing FINSABER, a backtesting framework evaluating timing-based strategies across longer periods and a larger universe of symbols. Systematic backtests over two decades and 100+ symbols reveal that previously reported LLM advantages deteriorate significantly under broader cross-section and over a longer-term evaluation. Our market regime analysis further demonstrates that LLM strategies are overly conservative in bull markets, underperforming passive benchmarks, and overly aggressive in bear markets, incurring heavy losses. These findings highlight the need to develop LLM strategies that are able to prioritise trend detection and regime-aware risk controls over mere scaling of framework complexity. ...

May 11, 2025 · 2 min · Research Team

Copula Analysis of Risk: A Multivariate Risk Analysis for VaR and CoVaR using Copulas and DCC-GARCH

Copula Analysis of Risk: A Multivariate Risk Analysis for VaR and CoVaR using Copulas and DCC-GARCH ArXiv ID: 2505.06950 “View on arXiv” Authors: Aryan Singh, Paul O Reilly, Daim Sharif, Patrick Haughey, Eoghan McCarthy, Sathvika Thorali Suresh, Aakhil Anvar, Adarsh Sajeev Kumar Abstract A multivariate risk analysis for VaR and CVaR using different copula families is performed on historical financial time series fitted with DCC-GARCH models. A theoretical background is provided alongside a comparison of goodness-of-fit across different copula families to estimate the validity and effectiveness of approaches discussed. ...

May 11, 2025 · 1 min · Research Team

NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks

NewsNet-SDF: Stochastic Discount Factor Estimation with Pretrained Language Model News Embeddings via Adversarial Networks ArXiv ID: 2505.06864 “View on arXiv” Authors: Shunyao Wang, Ming Cheng, Christina Dan Wang Abstract Stochastic Discount Factor (SDF) models provide a unified framework for asset pricing and risk assessment, yet traditional formulations struggle to incorporate unstructured textual information. We introduce NewsNet-SDF, a novel deep learning framework that seamlessly integrates pretrained language model embeddings with financial time series through adversarial networks. Our multimodal architecture processes financial news using GTE-multilingual models, extracts temporal patterns from macroeconomic data via LSTM networks, and normalizes firm characteristics, fusing these heterogeneous information sources through an innovative adversarial training mechanism. Our dataset encompasses approximately 2.5 million news articles and 10,000 unique securities, addressing the computational challenges of processing and aligning text data with financial time series. Empirical evaluations on U.S. equity data (1980-2022) demonstrate NewsNet-SDF substantially outperforms alternatives with a Sharpe ratio of 2.80. The model shows a 471% improvement over CAPM, over 200% improvement versus traditional SDF implementations, and a 74% reduction in pricing errors compared to the Fama-French five-factor model. In comprehensive comparisons, our deep learning approach consistently outperforms traditional, modern, and other neural asset pricing models across all key metrics. Ablation studies confirm that text embeddings contribute significantly more to model performance than macroeconomic features, with news-derived principal components ranking among the most influential determinants of SDF dynamics. These results validate the effectiveness of our multimodal deep learning approach in integrating unstructured text with traditional financial data for more accurate asset pricing, providing new insights for digital intelligent decision-making in financial technology. ...

May 11, 2025 · 2 min · Research Team

Beyond the Mean: Limit Theory and Tests for Infinite-Mean Autoregressive Conditional Durations

Beyond the Mean: Limit Theory and Tests for Infinite-Mean Autoregressive Conditional Durations ArXiv ID: 2505.06190 “View on arXiv” Authors: Giuseppe Cavaliere, Thomas Mikosch, Anders Rahbek, Frederik Vilandt Abstract Integrated autoregressive conditional duration (ACD) models serve as natural counterparts to the well-known integrated GARCH models used for financial returns. However, despite their resemblance, asymptotic theory for ACD is challenging and also not complete, in particular for integrated ACD. Central challenges arise from the facts that (i) integrated ACD processes imply durations with infinite expectation, and (ii) even in the non-integrated case, conventional asymptotic approaches break down due to the randomness in the number of durations within a fixed observation period. Addressing these challenges, we provide here unified asymptotic theory for the (quasi-) maximum likelihood estimator for ACD models; a unified theory which includes integrated ACD models. Based on the new results, we also provide a novel framework for hypothesis testing in duration models, enabling inference on a key empirical question: whether durations possess a finite or infinite expectation. We apply our results to high-frequency cryptocurrency ETF trading data. Motivated by parameter estimates near the integrated ACD boundary, we assess whether durations between trades in these markets have finite expectation, an assumption often made implicitly in the literature on point process models. Our empirical findings indicate infinite-mean durations for all the five cryptocurrencies examined, with the integrated ACD hypothesis rejected – against alternatives with tail index less than one – for four out of the five cryptocurrencies considered. ...

May 9, 2025 · 2 min · Research Team

FlowHFT: Imitation Learning via Flow Matching Policy for Optimal High-Frequency Trading under Diverse Market Conditions

FlowHFT: Imitation Learning via Flow Matching Policy for Optimal High-Frequency Trading under Diverse Market Conditions ArXiv ID: 2505.05784 “View on arXiv” Authors: Yang Li, Zhi Chen, Steve Yang Abstract High-frequency trading (HFT) is an investing strategy that continuously monitors market states and places bid and ask orders at millisecond speeds. Traditional HFT approaches fit models with historical data and assume that future market states follow similar patterns. This limits the effectiveness of any single model to the specific conditions it was trained for. Additionally, these models achieve optimal solutions only under specific market conditions, such as assumptions about stock price’s stochastic process, stable order flow, and the absence of sudden volatility. Real-world markets, however, are dynamic, diverse, and frequently volatile. To address these challenges, we propose the FlowHFT, a novel imitation learning framework based on flow matching policy. FlowHFT simultaneously learns strategies from numerous expert models, each proficient in particular market scenarios. As a result, our framework can adaptively adjust investment decisions according to the prevailing market state. Furthermore, FlowHFT incorporates a grid-search fine-tuning mechanism. This allows it to refine strategies and achieve superior performance even in complex or extreme market scenarios where expert strategies may be suboptimal. We test FlowHFT in multiple market environments. We first show that flow matching policy is applicable in stochastic market environments, thus enabling FlowHFT to learn trading strategies under different market conditions. Notably, our single framework consistently achieves performance superior to the best expert for each market condition. ...

May 9, 2025 · 2 min · Research Team

The bias of IID resampled backtests for rolling-window mean-variance portfolios

The bias of IID resampled backtests for rolling-window mean-variance portfolios ArXiv ID: 2505.06383 “View on arXiv” Authors: Andrew Paskaramoorthy, Terence van Zyl, Tim Gebbie Abstract Backtests on historical data are the basis for practical evaluations of portfolio selection rules, but their reliability is often limited by reliance on a single sample path. This can lead to high estimation variance. Resampling techniques offer a potential solution by increasing the effective sample size, but can disrupt the temporal ordering inherent in financial data and introduce significant bias. This paper investigates the critical questions: First, How large is this bias for Sharpe Ratio estimates?, and then, second: What are its primary drivers?. We focus on the canonical rolling-window mean-variance portfolio rule. Our contributions are identifying the bias mechanism, and providing a practical heuristic for gauging bias severity. We show that the bias arises from the disruption of train-test dependence linked to the return auto-covariance structure and derive bounds for the bias which show a strong dependence on the observable first-lag autocorrelation. Using simulations to confirm these findings, it is revealed that the resulting Sharpe Ratio bias is often a fraction of a typical backtest’s estimation noise, benefiting from partial offsetting of component biases. Empirical analysis further illustrates that differences between IID-resampled and standard backtests align qualitatively with these drivers. Surprisingly, our results suggest that while IID resampling can disrupt temporal dependence, its resulting bias can often be tolerable. However, we highlight the need for structure-preserving resampling methods. ...

May 9, 2025 · 2 min · Research Team

Comparative Evaluation of VaR Models: Historical Simulation, GARCH-Based Monte Carlo, and Filtered Historical Simulation

Comparative Evaluation of VaR Models: Historical Simulation, GARCH-Based Monte Carlo, and Filtered Historical Simulation ArXiv ID: 2505.05646 “View on arXiv” Authors: Xin Tian Abstract This report presents a comprehensive evaluation of three Value-at-Risk (VaR) modeling approaches: Historical Simulation (HS), GARCH with Normal approximation (GARCH-N), and GARCH with Filtered Historical Simulation (FHS), using both in-sample and multi-day forecasting frameworks. We compute daily 5 percent VaR estimates using each method and assess their accuracy via empirical breach frequencies and visual breach indicators. Our findings reveal severe miscalibration in the HS and GARCH-N models, with empirical breach rates far exceeding theoretical levels. In contrast, the FHS method consistently aligns with theoretical expectations and exhibits desirable statistical and visual behavior. We further simulate 5-day cumulative returns under both GARCH-N and GARCH-FHS frameworks to compute multi-period VaR and Expected Shortfall. Results show that GARCH-N underestimates tail risk due to its reliance on the Gaussian assumption, whereas GARCH-FHS provides more robust and conservative tail estimates. Overall, the study demonstrates that the GARCH-FHS model offers superior performance in capturing fat-tailed risks and provides more reliable short-term risk forecasts. ...

May 8, 2025 · 2 min · Research Team

Error Analysis of Deep PDE Solvers for Option Pricing

Error Analysis of Deep PDE Solvers for Option Pricing ArXiv ID: 2505.05121 “View on arXiv” Authors: Jasper Rou Abstract Option pricing often requires solving partial differential equations (PDEs). Although deep learning-based PDE solvers have recently emerged as quick solutions to this problem, their empirical and quantitative accuracy remain not well understood, hindering their real-world applicability. In this research, our aim is to offer actionable insights into the utility of deep PDE solvers for practical option pricing implementation. Through comparative experiments in both the Black–Scholes and the Heston model, we assess the empirical performance of two neural network algorithms to solve PDEs: the Deep Galerkin Method and the Time Deep Gradient Flow method (TDGF). We determine their empirical convergence rates and training time as functions of (i) the number of sampling stages, (ii) the number of samples, (iii) the number of layers, and (iv) the number of nodes per layer. For the TDGF, we also consider the order of the discretization scheme and the number of time steps. ...

May 8, 2025 · 2 min · Research Team

Impact of Tariff Wars on Global Economy

Impact of Tariff Wars on Global Economy ArXiv ID: 2505.05576 “View on arXiv” Authors: N. S. Gonchar, O. P. Dovzhyk, A. S. Zhokhin, W. H. Kozyrsky, A. P. Makhort Abstract The Ricardian model of world trade based on comparative advantage is not sufficient to justify equal trade relations.The existing model of trade relations does not explain the distribution of income among trading countries. This paper presents a method for building equitable trade relations. Its essence is to present an algorithm for building such trade relations, based on the previously proposed model of world trade, that the trade balance of each country would be equal to zero. Under such conditions, tariff wars would become impossible. It is proved that, provided that the supply structure is consistent with the demand structure, it is always possible to build an equilibrium price vector for which the trade balance of each country is zero. This state of economic equilibrium is called ideal. The article presents an algorithm to build an export structure based on the structure of imports. This algorithm is quite simple and allows for a wide range of applications. Under fairly simple realistic assumptions about the behaviour of countries trading with each other that are subject to tariff restrictions, it is proved that this leads to an increase in the prices of the goods traded by these countries. Among the equilibrium states, there are also those called oversupply states. The latter describes the phenomenon of recession. This contributes to a fall in stock market indices. ...

May 8, 2025 · 2 min · Research Team