false

QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning

QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning ArXiv ID: 2508.20467 “View on arXiv” Authors: Xiangdong Liu, Jiahao Chen Abstract In the highly volatile and uncertain global financial markets, traditional quantitative trading models relying on statistical modeling or empirical rules often fail to adapt to dynamic market changes and black swan events due to rigid assumptions and limited generalization. To address these issues, this paper proposes QTMRL (Quantitative Trading Multi-Indicator Reinforcement Learning), an intelligent trading agent combining multi-dimensional technical indicators with reinforcement learning (RL) for adaptive and stable portfolio management. We first construct a comprehensive multi-indicator dataset using 23 years of S&P 500 daily OHLCV data (2000-2022) for 16 representative stocks across 5 sectors, enriching raw data with trend, volatility, and momentum indicators to capture holistic market dynamics. Then we design a lightweight RL framework based on the Advantage Actor-Critic (A2C) algorithm, including data processing, A2C algorithm, and trading agent modules to support policy learning and actionable trading decisions. Extensive experiments compare QTMRL with 9 baselines (e.g., ARIMA, LSTM, moving average strategies) across diverse market regimes, verifying its superiority in profitability, risk adjustment, and downside risk control. The code of QTMRL is publicly available at https://github.com/ChenJiahaoJNU/QTMRL.git ...

August 28, 2025 · 2 min · Research Team

FinCast: A Foundation Model for Financial Time-Series Forecasting

FinCast: A Foundation Model for Financial Time-Series Forecasting ArXiv ID: 2508.19609 “View on arXiv” Authors: Zhuohang Zhu, Haodong Chen, Qiang Qu, Vera Chung Abstract Financial time-series forecasting is critical for maintaining economic stability, guiding informed policymaking, and promoting sustainable investment practices. However, it remains challenging due to various underlying pattern shifts. These shifts arise primarily from three sources: temporal non-stationarity (distribution changes over time), multi-domain diversity (distinct patterns across financial domains such as stocks, commodities, and futures), and varying temporal resolutions (patterns differing across per-second, hourly, daily, or weekly indicators). While recent deep learning methods attempt to address these complexities, they frequently suffer from overfitting and typically require extensive domain-specific fine-tuning. To overcome these limitations, we introduce FinCast, the first foundation model specifically designed for financial time-series forecasting, trained on large-scale financial datasets. Remarkably, FinCast exhibits robust zero-shot performance, effectively capturing diverse patterns without domain-specific fine-tuning. Comprehensive empirical and qualitative evaluations demonstrate that FinCast surpasses existing state-of-the-art methods, highlighting its strong generalization capabilities. ...

August 27, 2025 · 2 min · Research Team

Optimal Quoting under Adverse Selection and Price Reading

Optimal Quoting under Adverse Selection and Price Reading ArXiv ID: 2508.20225 “View on arXiv” Authors: Alexander Barzykin, Philippe Bergault, Olivier Guéant, Malo Lemmel Abstract Over the past decade, many dealers have implemented algorithmic models to automatically respond to RFQs and manage flows originating from their electronic platforms. In parallel, building on the foundational work of Ho and Stoll, and later Avellaneda and Stoikov, the academic literature on market making has expanded to address trade size distributions, client tiering, complex price dynamics, alpha signals, and the internalization versus externalization dilemma in markets with dealer-to-client and interdealer-broker segments. In this paper, we tackle two critical dimensions: adverse selection, arising from the presence of informed traders, and price reading, whereby the market maker’s own quotes inadvertently reveal the direction of their inventory. These risks are well known to practitioners, who routinely face informed flows and algorithms capable of extracting signals from quoting behavior. Yet they have received limited attention in the quantitative finance literature, beyond stylized toy models with limited actionability. Extending the existing literature, we propose a tractable and implementable framework that enables market makers to adjust their quotes with greater awareness of informational risk. ...

August 27, 2025 · 2 min · Research Team

Combined machine learning for stock selection strategy based on dynamic weighting methods

Combined machine learning for stock selection strategy based on dynamic weighting methods ArXiv ID: 2508.18592 “View on arXiv” Authors: Lin Cai, Zhiyang He, Caiya Zhang Abstract This paper proposes a novel stock selection strategy framework based on combined machine learning algorithms. Two types of weighting methods for three representative machine learning algorithms are developed to predict the returns of the stock selection strategy. One is static weighting based on model evaluation metrics, the other is dynamic weighting based on Information Coefficients (IC). Using CSI 300 index data, we empirically evaluate the strategy’ s backtested performance and model predictive accuracy. The main results are as follows: (1) The strategy by combined machine learning algorithms significantly outperforms single-model approaches in backtested returns. (2) IC-based weighting (particularly IC_Mean) demonstrates greater competitiveness than evaluation-metric-based weighting in both backtested returns and predictive performance. (3) Factor screening substantially enhances the performance of combined machine learning strategies. ...

August 26, 2025 · 2 min · Research Team

Asymmetric super-Heston-rough volatility model with Zumbach effect as scaling limit of quadratic Hawkes processes

Asymmetric super-Heston-rough volatility model with Zumbach effect as scaling limit of quadratic Hawkes processes ArXiv ID: 2508.16566 “View on arXiv” Authors: Priyanka Chudasama, Srikanth Krishnan Iyer Abstract Hawkes processes were first introduced to obtain microscopic models for the rough volatility observed in asset prices. Scaling limits of such processes leads to the rough-Heston model that describes the macroscopic behavior. Blanc et al. (2017) show that Time-reversal asymmetry (TRA) or the Zumbach effect can be modeled using Quadratic Hawkes (QHawkes) processes. Dandapani et al. (2021) obtain a super-rough-Heston model as scaling limit of QHawkes processes in the case where the impact of buying and selling actions are symmetric. To model asymmetry in buying and selling actions, we propose a bivariate QHawkes process and derive a super-rough-Heston model as scaling limits for the price process in the stable and near-unstable regimes that preserves TRA. A new feature of the limiting process in the near-unstable regime is that the two driving Brownian motions exhibit a stochastic covariation that depends on the spot volatility. ...

August 22, 2025 · 2 min · Research Team

Deep Learning for Short Term Equity Trend Forecasting: A Behavior Driven Multi Factor Approach

Deep Learning for Short Term Equity Trend Forecasting: A Behavior Driven Multi Factor Approach ArXiv ID: 2508.14656 “View on arXiv” Authors: Yuqi Luan Abstract This study proposes a behaviorally-informed multi-factor stock selection framework that integrates short-cycle technical alpha signals with deep learning. We design a dual-task multilayer perceptron (MLP) that jointly predicts five-day future returns and directional price movements, thereby capturing nonlinear market behaviors such as volume-price divergence, momentum-driven herding, and bottom reversals. The model is trained on 40 carefully constructed factors derived from price-volume patterns and behavioral finance insights. Empirical evaluation demonstrates that the dual-task MLP achieves superior and stable performance across both predictive accuracy and economic relevance, as measured by information coefficient (IC), information ratio (IR), and portfolio backtesting results. Comparative experiments further show that deep learning methods outperform linear baselines by effectively capturing structural interactions between factors. This work highlights the potential of structure-aware deep learning in enhancing multi-factor modeling and provides a practical framework for short-horizon quantitative investment strategies. ...

August 20, 2025 · 2 min · Research Team

Generative Neural Operators of Log-Complexity Can Simultaneously Solve Infinitely Many Convex Programs

Generative Neural Operators of Log-Complexity Can Simultaneously Solve Infinitely Many Convex Programs ArXiv ID: 2508.14995 “View on arXiv” Authors: Anastasis Kratsios, Ariel Neufeld, Philipp Schmocker Abstract Neural operators (NOs) are a class of deep learning models designed to simultaneously solve infinitely many related problems by casting them into an infinite-dimensional space, whereon these NOs operate. A significant gap remains between theory and practice: worst-case parameter bounds from universal approximation theorems suggest that NOs may require an unrealistically large number of parameters to solve most operator learning problems, which stands in direct opposition to a slew of experimental evidence. This paper closes that gap for a specific class of {“NOs”}, generative {“equilibrium operators”} (GEOs), using (realistic) finite-dimensional deep equilibrium layers, when solving families of convex optimization problems over a separable Hilbert space $X$. Here, the inputs are smooth, convex loss functions on $X$, and outputs are the associated (approximate) solutions to the optimization problem defined by each input loss. We show that when the input losses lie in suitable infinite-dimensional compact sets, our GEO can uniformly approximate the corresponding solutions to arbitrary precision, with rank, depth, and width growing only logarithmically in the reciprocal of the approximation error. We then validate both our theoretical results and the trainability of GEOs on three applications: (1) nonlinear PDEs, (2) stochastic optimal control problems, and (3) hedging problems in mathematical finance under liquidity constraints. ...

August 20, 2025 · 2 min · Research Team

Variable selection for minimum-variance portfolios

Variable selection for minimum-variance portfolios ArXiv ID: 2508.14986 “View on arXiv” Authors: Guilherme V. Moura, André P. Santos, Hudson S. Torrent Abstract Machine learning (ML) methods have been successfully employed in identifying variables that can predict the equity premium of individual stocks. In this paper, we investigate if ML can also be helpful in selecting variables relevant for optimal portfolio choice. To address this question, we parameterize minimum-variance portfolio weights as a function of a large pool of firm-level characteristics as well as their second-order and cross-product transformations, yielding a total of 4,610 predictors. We find that the gains from employing ML to select relevant predictors are substantial: minimum-variance portfolios achieve lower risk relative to sparse specifications commonly considered in the literature, especially when non-linear terms are added to the predictor space. Moreover, some of the selected predictors that help decreasing portfolio risk also increase returns, leading to minimum-variance portfolios with good performance in terms of Shape ratios in some situations. Our evidence suggests that ad-hoc sparsity can be detrimental to the performance of minimum-variance characteristics-based portfolios. ...

August 20, 2025 · 2 min · Research Team

AlphaX: An AI-Based Value Investing Strategy for the Brazilian Stock Market

AlphaX: An AI-Based Value Investing Strategy for the Brazilian Stock Market ArXiv ID: 2508.13429 “View on arXiv” Authors: Paulo André Lima de Castro Abstract Autonomous trading strategies have been a subject of research within the field of artificial intelligence (AI) for aconsiderable period. Various AI techniques have been explored to develop autonomous agents capable of trading financial assets. These approaches encompass traditional methods such as neural networks, fuzzy logic, and reinforcement learning, as well as more recent advancements, including deep neural networks and deep reinforcement learning. Many developers report success in creating strategies that exhibit strong performance during simulations using historical price data, a process commonly referred to as backtesting. However, when these strategies are deployed in real markets, their performance often deteriorates, particularly in terms of risk-adjusted returns. In this study, we propose an AI-based strategy inspired by a classical investment paradigm: Value Investing. Financial AI models are highly susceptible to lookahead bias and other forms of bias that can significantly inflate performance in backtesting compared to live trading conditions. To address this issue, we conducted a series of computational simulations while controlling for these biases, thereby reducing the risk of overfitting. Our results indicate that the proposed approach outperforms major Brazilian market benchmarks. Moreover, the strategy, named AlphaX, demonstrated superior performance relative to widely used technical indicators such as the Relative Strength Index (RSI) and Money Flow Index (MFI), with statistically significant results. Finally, we discuss several open challenges and highlight emerging technologies in qualitative analysis that may contribute to the development of a comprehensive AI-based Value Investing framework in the future ...

August 19, 2025 · 2 min · Research Team

Optimal Portfolio Construction -- A Reinforcement Learning Embedded Bayesian Hierarchical Risk Parity (RL-BHRP) Approach

Optimal Portfolio Construction – A Reinforcement Learning Embedded Bayesian Hierarchical Risk Parity (RL-BHRP) Approach ArXiv ID: 2508.11856 “View on arXiv” Authors: Shaofeng Kang, Zeying Tian Abstract We propose a two-level, learning-based portfolio method (RL-BHRP) that spreads risk across sectors and stocks, and adjusts exposures as market conditions change. Using U.S. Equities from 2012 to mid-2025, we design the model using 2012 to 2019 data, and evaluate it out-of-sample from 2020 to 2025 against a sector index built from exchange-traded funds and a static risk-balanced portfolio. Over the test window, the adaptive portfolio compounds wealth by approximately 120 percent, compared with 101 percent for the static comparator and 91 percent for the sector benchmark. The average annual growth is roughly 15 percent, compared to 13 percent and 12 percent, respectively. Gains are achieved without significant deviations from the benchmark and with peak-to-trough losses comparable to those of the alternatives, indicating that the method adds value while remaining diversified and investable. Weight charts show gradual shifts rather than abrupt swings, reflecting disciplined rebalancing and the cost-aware design. Overall, the results support risk-balanced, adaptive allocation as a practical approach to achieving stronger and more stable long-term performance. ...

August 16, 2025 · 2 min · Research Team