false

Machine Learning Enhanced Multi-Factor Quantitative Trading: A Cross-Sectional Portfolio Optimization Approach with Bias Correction

Machine Learning Enhanced Multi-Factor Quantitative Trading: A Cross-Sectional Portfolio Optimization Approach with Bias Correction ArXiv ID: 2507.07107 “View on arXiv” Authors: Yimin Du Abstract This paper presents a comprehensive machine learning framework for quantitative trading that achieves superior risk-adjusted returns through systematic factor engineering, real-time computation optimization, and cross-sectional portfolio construction. Our approach integrates multi-factor alpha discovery with bias correction techniques, leveraging PyTorch-accelerated factor computation and advanced portfolio optimization. The system processes 500-1000 factors derived from open-source alpha101 extensions and proprietary market microstructure signals. Key innovations include tensor-based factor computation acceleration, geometric Brownian motion data augmentation, and cross-sectional neutralization strategies. Empirical validation on Chinese A-share markets (2010-2024) demonstrates annualized returns of $20%$ with Sharpe ratios exceeding 2.0, significantly outperforming traditional approaches. Our analysis reveals the critical importance of bias correction in factor construction and the substantial impact of cross-sectional portfolio optimization on strategy performance. Code and experimental implementations are available at: https://github.com/initial-d/ml-quant-trading ...

June 2, 2025 · 2 min · Research Team

Explainable-AI powered stock price prediction using time series transformers: A Case Study on BIST100

Explainable-AI powered stock price prediction using time series transformers: A Case Study on BIST100 ArXiv ID: 2506.06345 “View on arXiv” Authors: Sukru Selim Calik, Andac Akyuz, Zeynep Hilal Kilimci, Kerem Colak Abstract Financial literacy is increasingly dependent on the ability to interpret complex financial data and utilize advanced forecasting tools. In this context, this study proposes a novel approach that combines transformer-based time series models with explainable artificial intelligence (XAI) to enhance the interpretability and accuracy of stock price predictions. The analysis focuses on the daily stock prices of the five highest-volume banks listed in the BIST100 index, along with XBANK and XU100 indices, covering the period from January 2015 to March 2025. Models including DLinear, LTSNet, Vanilla Transformer, and Time Series Transformer are employed, with input features enriched by technical indicators. SHAP and LIME techniques are used to provide transparency into the influence of individual features on model outputs. The results demonstrate the strong predictive capabilities of transformer models and highlight the potential of interpretable machine learning to empower individuals in making informed investment decisions and actively engaging in financial markets. ...

June 1, 2025 · 2 min · Research Team

A Causation-Based Framework for Pricing and Cost Allocation of Energy, Reserves, and Transmission in Modern Power Systems

A Causation-Based Framework for Pricing and Cost Allocation of Energy, Reserves, and Transmission in Modern Power Systems ArXiv ID: 2505.24159 “View on arXiv” Authors: Luiza Ribeiro, Alexandre Street, Jose Manuel Arroyo, Rodrigo Moreno Abstract The increasing vulnerability of power systems has heightened the need for operating reserves to manage contingencies such as generator outages, line failures, and sudden load variations. Unlike energy costs, driven by consumer demand, operating reserve costs arise from addressing the most critical credible contingencies - prompting the question: how should these costs be allocated through efficient pricing mechanisms? As an alternative to previously reported schemes, this paper presents a new causation-based pricing framework for electricity markets based on contingency-constrained energy and reserve scheduling models. Major salient features include a novel security charge mechanism along with the explicit definition of prices for up-spinning reserves, down-spinning reserves, and transmission services. These features ensure more comprehensive and efficient cost-reflective market operations. Moreover, the proposed nodal pricing scheme yields revenue adequacy and neutrality while promoting reliability incentives for generators based on the cost-causation principle. An additional salient aspect of the proposed framework is the economic incentive for transmission assets, which are remunerated based on their use to deliver energy and reserves across all contingency states. Numerical results from two case studies illustrate the performance of the proposed pricing scheme. ...

May 30, 2025 · 2 min · Research Team

Optimising cryptocurrency portfolios through stable clustering of price correlation networks

Optimising cryptocurrency portfolios through stable clustering of price correlation networks ArXiv ID: 2505.24831 “View on arXiv” Authors: Ruixue Jing, Ryota Kobayashi, Luis Enrique Correa Rocha Abstract The emerging cryptocurrency market presents unique challenges for investment due to its unregulated nature and inherent volatility. However, collective price movements can be explored to maximise profits with minimal risk using investment portfolios. In this paper, we develop a technical framework that utilises historical data on daily closing prices and integrates network analysis, price forecasting, and portfolio theory to identify cryptocurrencies for building profitable portfolios under uncertainty. Our method utilises the Louvain network community algorithm and consensus clustering to detect robust and temporally stable clusters of highly correlated cryptocurrencies, from which the chosen cryptocurrencies are selected. A price prediction step using the ARIMA model guarantees that the portfolio performs well for up to 14 days in the investment horizon. Empirical analysis over a 5-year period shows that despite the high volatility in the crypto market, hidden price patterns can be effectively utilised to generate consistently profitable, time-agnostic cryptocurrency portfolios. ...

May 30, 2025 · 2 min · Research Team

Path-dependent option pricing with two-dimensional PDE using MPDATA

Path-dependent option pricing with two-dimensional PDE using MPDATA ArXiv ID: 2505.24435 “View on arXiv” Authors: Paweł Magnuszewski, Sylwester Arabas Abstract In this paper, we discuss a simple yet robust PDE method for evaluating path-dependent Asian-style options using the non-oscillatory forward-in-time second-order MPDATA finite-difference scheme. The valuation methodology involves casting the Black-Merton-Scholes equation as a transport problem by first transforming it into a homogeneous advection-diffusion PDE via variable substitution, and then expressing the diffusion term as an advective flux using the pseudo-velocity technique. As a result, all terms of the Black-Merton-Sholes equation are consistently represented using a single high-order numerical scheme for the advection operator. We detail the additional steps required to solve the two-dimensional valuation problem compared to MPDATA valuations of vanilla instruments documented in a prior study. Using test cases employing fixed-strike instruments, we validate the solutions against Monte Carlo valuations, as well as against an approximate analytical solution in which geometric instead of arithmetic averaging is used. The analysis highlights the critical importance of the MPDATA corrective steps that improve the solution over the underlying first-order “upwind” step. The introduced valuation scheme is robust: conservative, non-oscillatory, and positive-definite; yet lucid: explicit in time, engendering intuitive stability-condition interpretation and inflow/outflow boundary-condition heuristics. MPDATA is particularly well suited for two-dimensional problems as it is not a dimensionally split scheme. The documented valuation workflow also constitutes a useful two-dimensional case for testing advection schemes featuring both Monte Carlo solutions and analytic bounds. An implementation of the introduced valuation workflow, based on the PyMPDATA package and the Numba Just-In-Time compiler for Python, is provided as free and open source software. ...

May 30, 2025 · 2 min · Research Team

The Hype Index: an NLP-driven Measure of Market News Attention

The Hype Index: an NLP-driven Measure of Market News Attention ArXiv ID: 2506.06329 “View on arXiv” Authors: Zheng Cao, Wanchaloem Wunkaew, Helyette Geman Abstract This paper introduces the Hype Index as a novel metric to quantify media attention toward large-cap equities, leveraging advances in Natural Language Processing (NLP) for extracting predictive signals from financial news. Using the S&P 100 as the focus universe, we first construct a News Count-Based Hype Index, which measures relative media exposure by computing the share of news articles referencing each stock or sector. We then extend it to the Capitalization Adjusted Hype Index, adjusts for economic size by taking the ratio of a stock’s or sector’s media weight to its market capitalization weight within its industry or sector. We compute both versions of the Hype Index at the stock and sector levels, and evaluate them through multiple lenses: (1) their classification into different hype groups, (2) their associations with returns, volatility, and VIX index at various lags, (3) their signaling power for short-term market movements, and (4) their empirical properties including correlations, samplings, and trends. Our findings suggest that the Hype Index family provides a valuable set of tools for stock volatility analysis, market signaling, and NLP extensions in Finance. ...

May 30, 2025 · 2 min · Research Team

TIP-Search: Time-Predictable Inference Scheduling for Market Prediction under Uncertain Load

TIP-Search: Time-Predictable Inference Scheduling for Market Prediction under Uncertain Load ArXiv ID: 2506.08026 “View on arXiv” Authors: Xibai Wang Abstract This paper proposes TIP-Search, a time-predictable inference scheduling framework for real-time market prediction under uncertain workloads. Motivated by the strict latency demands in high-frequency financial systems, TIP-Search dynamically selects a deep learning model from a heterogeneous pool, aiming to maximize predictive accuracy while satisfying per-task deadline constraints. Our approach profiles latency and generalization performance offline, then performs online task-aware selection without relying on explicit input domain labels. We evaluate TIP-Search on three real-world limit order book datasets (FI-2010, Binance BTC/USDT, LOBSTER AAPL) and demonstrate that it outperforms static baselines with up to 8.5% improvement in accuracy and 100% deadline satisfaction. Our results highlight the effectiveness of TIP-Search in robust low-latency financial inference under uncertainty. ...

May 30, 2025 · 2 min · Research Team

Critical Dynamics of Random Surfaces and Multifractal Scaling

Critical Dynamics of Random Surfaces and Multifractal Scaling ArXiv ID: 2505.23928 “View on arXiv” Authors: Christof Schmidhuber Abstract The critical dynamics of conformal field theories on random surfaces is investigated beyond the previously studied dynamics of the overall area and the genus. It is found that the evolution of the order parameter in physical time performs a generalization of the multifractal random walk. Accordingly, the higher moments of time variations of the order parameter exhibit multifractal scaling. The series of Hurst exponents is computed and illustrated at the examples of the Ising-, 3-state-Potts-, and general minimal models as well as $c=1$ models on a random surface. It is noted that some of these models can replicate the observed multifractal scaling in financial markets. ...

May 29, 2025 · 2 min · Research Team

Model-Free Deep Hedging with Transaction Costs and Light Data Requirements

Model-Free Deep Hedging with Transaction Costs and Light Data Requirements ArXiv ID: 2505.22836 “View on arXiv” Authors: Pierre Brugière, Gabriel Turinici Abstract Option pricing theory, such as the Black and Scholes (1973) model, provides an explicit solution to construct a strategy that perfectly hedges an option in a continuous-time setting. In practice, however, trading occurs in discrete time and often involves transaction costs, making the direct application of continuous-time solutions potentially suboptimal. Previous studies, such as those by Buehler et al. (2018), Buehler et al. (2019) and Cao et al. (2019), have shown that deep learning or reinforcement learning can be used to derive better hedging strategies than those based on continuous-time models. However, these approaches typically rely on a large number of trajectories (of the order of $10^5$ or $10^6$) to train the model. In this work, we show that using as few as 256 trajectories is sufficient to train a neural network that significantly outperforms, in the Geometric Brownian Motion framework, both the classical Black & Scholes formula and the Leland model, which is arguably one of the most effective explicit alternatives for incorporating transaction costs. The ability to train neural networks with such a small number of trajectories suggests the potential for more practical and simple implementation on real-time financial series. ...

May 28, 2025 · 2 min · Research Team

Multi-period Mean-Buffered Probability of Exceedance in Defined Contribution Portfolio Optimization

Multi-period Mean-Buffered Probability of Exceedance in Defined Contribution Portfolio Optimization ArXiv ID: 2505.22121 “View on arXiv” Authors: Duy-Minh Dang, Chang Chen Abstract We investigate multi-period mean-risk portfolio optimization for long-horizon Defined Contribution plans, focusing on buffered Probability of Exceedance (bPoE), a more intuitive, dollar-based alternative to Conditional Value-at-Risk (CVaR). We formulate both pre-commitment and time-consistent Mean-bPoE and Mean-CVaR portfolio optimization problems under realistic investment constraints (e.g., no leverage, no short selling) and jump-diffusion dynamics. These formulations are naturally framed as bilevel optimization problems, with an outer search over the shortfall threshold and an inner optimization over rebalancing decisions. We establish an equivalence between the pre-commitment formulations through a one-to-one correspondence of their scalarization optimal sets, while showing that no such equivalence holds in the time-consistent setting. We develop provably convergent numerical schemes for the value functions associated with both pre-commitment and time-consistent formulations of these mean-risk control problems. Using nearly a century of market data, we find that time-consistent Mean-bPoE strategies closely resemble their pre-commitment counterparts. In particular, they maintain alignment with investors’ preferences for a minimum acceptable terminal wealth level-unlike time-consistent Mean-CVaR, which often leads to counterintuitive control behavior. We further show that bPoE, as a strictly tail-oriented measure, prioritizes guarding against catastrophic shortfalls while allowing meaningful upside exposure, making it especially appealing for long-horizon wealth security. These findings highlight bPoE’s practical advantages for Defined Contribution investment planning. ...

May 28, 2025 · 2 min · Research Team