false

DeltaHedge: A Multi-Agent Framework for Portfolio Options Optimization

DeltaHedge: A Multi-Agent Framework for Portfolio Options Optimization ArXiv ID: 2509.12753 “View on arXiv” Authors: Feliks Bańka, Jarosław A. Chudziak Abstract In volatile financial markets, balancing risk and return remains a significant challenge. Traditional approaches often focus solely on equity allocation, overlooking the strategic advantages of options trading for dynamic risk hedging. This work presents DeltaHedge, a multi-agent framework that integrates options trading with AI-driven portfolio management. By combining advanced reinforcement learning techniques with an ensembled options-based hedging strategy, DeltaHedge enhances risk-adjusted returns and stabilizes portfolio performance across varying market conditions. Experimental results demonstrate that DeltaHedge outperforms traditional strategies and standalone models, underscoring its potential to transform practical portfolio management in complex financial environments. Building on these findings, this paper contributes to the fields of quantitative finance and AI-driven portfolio optimization by introducing a novel multi-agent system for integrating options trading strategies, addressing a gap in the existing literature. ...

September 16, 2025 · 2 min · Research Team

Myopic Optimality: why reinforcement learning portfolio management strategies lose money

Myopic Optimality: why reinforcement learning portfolio management strategies lose money ArXiv ID: 2509.12764 “View on arXiv” Authors: Yuming Ma Abstract Myopic optimization (MO) outperforms reinforcement learning (RL) in portfolio management: RL yields lower or negative returns, higher variance, larger costs, heavier CVaR, lower profitability, and greater model risk. We model execution/liquidation frictions with mark-to-market accounting. Using Malliavin calculus (Clark-Ocone/BEL), we derive policy gradients and risk shadow price, unifying HJB and KKT. This gives dual gap and convergence results: geometric MO vs. RL floors. We quantify phantom profit in RL via Malliavin policy-gradient contamination analysis and define a control-affects-dynamics (CAD) premium of RL indicating plausibly positive. ...

September 16, 2025 · 2 min · Research Team

QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning

QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning ArXiv ID: 2508.20467 “View on arXiv” Authors: Xiangdong Liu, Jiahao Chen Abstract In the highly volatile and uncertain global financial markets, traditional quantitative trading models relying on statistical modeling or empirical rules often fail to adapt to dynamic market changes and black swan events due to rigid assumptions and limited generalization. To address these issues, this paper proposes QTMRL (Quantitative Trading Multi-Indicator Reinforcement Learning), an intelligent trading agent combining multi-dimensional technical indicators with reinforcement learning (RL) for adaptive and stable portfolio management. We first construct a comprehensive multi-indicator dataset using 23 years of S&P 500 daily OHLCV data (2000-2022) for 16 representative stocks across 5 sectors, enriching raw data with trend, volatility, and momentum indicators to capture holistic market dynamics. Then we design a lightweight RL framework based on the Advantage Actor-Critic (A2C) algorithm, including data processing, A2C algorithm, and trading agent modules to support policy learning and actionable trading decisions. Extensive experiments compare QTMRL with 9 baselines (e.g., ARIMA, LSTM, moving average strategies) across diverse market regimes, verifying its superiority in profitability, risk adjustment, and downside risk control. The code of QTMRL is publicly available at https://github.com/ChenJiahaoJNU/QTMRL.git ...

August 28, 2025 · 2 min · Research Team

AlphaAgents: Large Language Model based Multi-Agents for Equity Portfolio Constructions

AlphaAgents: Large Language Model based Multi-Agents for Equity Portfolio Constructions ArXiv ID: 2508.11152 “View on arXiv” Authors: Tianjiao Zhao, Jingrao Lyu, Stokes Jones, Harrison Garber, Stefano Pasquali, Dhagash Mehta Abstract The field of artificial intelligence (AI) agents is evolving rapidly, driven by the capabilities of Large Language Models (LLMs) to autonomously perform and refine tasks with human-like efficiency and adaptability. In this context, multi-agent collaboration has emerged as a promising approach, enabling multiple AI agents to work together to solve complex challenges. This study investigates the application of role-based multi-agent systems to support stock selection in equity research and portfolio management. We present a comprehensive analysis performed by a team of specialized agents and evaluate their stock-picking performance against established benchmarks under varying levels of risk tolerance. Furthermore, we examine the advantages and limitations of employing multi-agent frameworks in equity analysis, offering critical insights into their practical efficacy and implementation challenges. ...

August 15, 2025 · 2 min · Research Team

Time-Varying Factor-Augmented Models for Volatility Forecasting

Time-Varying Factor-Augmented Models for Volatility Forecasting ArXiv ID: 2508.01880 “View on arXiv” Authors: Duo Zhang, Jiayu Li, Junyi Mo, Elynn Chen Abstract Accurate volatility forecasts are vital in modern finance for risk management, portfolio allocation, and strategic decision-making. However, existing methods face key limitations. Fully multivariate models, while comprehensive, are computationally infeasible for realistic portfolios. Factor models, though efficient, primarily use static factor loadings, failing to capture evolving volatility co-movements when they are most critical. To address these limitations, we propose a novel, model-agnostic Factor-Augmented Volatility Forecast framework. Our approach employs a time-varying factor model to extract a compact set of dynamic, cross-sectional factors from realized volatilities with minimal computational cost. These factors are then integrated into both statistical and AI-based forecasting models, enabling a unified system that jointly models asset-specific dynamics and evolving market-wide co-movements. Our framework demonstrates strong performance across two prominent asset classes-large-cap U.S. technology equities and major cryptocurrencies-over both short-term (1-day) and medium-term (7-day) horizons. Using a suite of linear and non-linear AI-driven models, we consistently observe substantial improvements in predictive accuracy and economic value. Notably, a practical pairs-trading strategy built on our forecasts delivers superior risk-adjusted returns and profitability, particularly under adverse market conditions. ...

August 3, 2025 · 2 min · Research Team

Two Stochastic Control Methods for Mean-Variance Portfolio Selection of Jump Diffusions and Their Relationship

Two Stochastic Control Methods for Mean-Variance Portfolio Selection of Jump Diffusions and Their Relationship ArXiv ID: 2508.01138 “View on arXiv” Authors: Qiyue Zhang, Jingtao Shi Abstract This paper is concerned with the maximum principle and dynamic programming principle for mean-variance portfolio selection of jump diffusions and their relationship. First, the optimal portfolio and efficient frontier of the problem are obtained using both methods. Furthermore, the relationship between these two methods is investigated. Specially, the connections between the adjoint processes and value function are given. ...

August 2, 2025 · 1 min · Research Team

Is Causality Necessary for Efficient Portfolios? A Computational Perspective on Predictive Validity and Model Misspecification

Is Causality Necessary for Efficient Portfolios? A Computational Perspective on Predictive Validity and Model Misspecification ArXiv ID: 2507.23138 “View on arXiv” Authors: Alejandro Rodriguez Dominguez Abstract A recent line of research has argued that causal factor models are necessary for portfolio optimization, claiming that structurally misspecified models inevitably produce inverted signals and nonviable frontiers. This paper challenges that view. We show, through theoretical analysis, simulation counterexamples, and empirical validation, that predictive models can remain operationally valid even when structurally incorrect. Our contributions are fourfold. First, we distinguish between directional agreement, ranking, and calibration, proving that sign alignment alone does not ensure efficiency when signals are mis-scaled. Second, we establish that structurally misspecified signals can still yield convex and viable efficient frontiers provided they maintain directional alignment with true returns. Third, we derive and empirically confirm a quantitative scaling law that shows how Sharpe ratios contract smoothly with declining alignment, thereby clarifying the role of calibration within the efficient set. Fourth, we validate these results on real financial data, demonstrating that predictive signals, despite structural imperfections, can support coherent frontiers. These findings refine the debate on causality in portfolio modeling. While causal inference remains valuable for interpretability and risk attribution, it is not a prerequisite for optimization efficiency. Ultimately, what matters is the directional fidelity and calibration of predictive signals in relation to their intended use in robust portfolio construction. ...

July 30, 2025 · 2 min · Research Team

Pathwise analysis of log-optimal portfolios

Pathwise analysis of log-optimal portfolios ArXiv ID: 2507.18232 “View on arXiv” Authors: Andrew L. Allan, Anna P. Kwossek, Chong Liu, David J. Prömel Abstract Based on the theory of càdlàg rough paths, we develop a pathwise approach to analyze stability and approximation properties of portfolios along individual price trajectories generated by standard models of financial markets. As a prototypical example from portfolio theory, we study the log-optimal portfolio in a classical investment-consumption optimization problem on a frictionless financial market modelled by an Itô diffusion process. We identify a fully deterministic framework that enables a pathwise construction of the log-optimal portfolio, for which we then establish pathwise stability estimates with respect to the underlying model parameters. We also derive pathwise error estimates arising from the time-discretization of the log-optimal portfolio and its associated capital process. ...

July 24, 2025 · 2 min · Research Team

Deep Declarative Risk Budgeting Portfolios

Deep Declarative Risk Budgeting Portfolios ArXiv ID: 2504.19980 “View on arXiv” Authors: Manuel Parra-Diaz, Carlos Castro-Iragorri Abstract Recent advances in deep learning have spurred the development of end-to-end frameworks for portfolio optimization that utilize implicit layers. However, many such implementations are highly sensitive to neural network initialization, undermining performance consistency. This research introduces a robust end-to-end framework tailored for risk budgeting portfolios that effectively reduces sensitivity to initialization. Importantly, this enhanced stability does not compromise portfolio performance, as our framework consistently outperforms the risk parity benchmark. ...

April 28, 2025 · 1 min · Research Team

Improving Bayesian Optimization for Portfolio Management with an Adaptive Scheduling

Improving Bayesian Optimization for Portfolio Management with an Adaptive Scheduling ArXiv ID: 2504.13529 “View on arXiv” Authors: Unknown Abstract Existing black-box portfolio management systems are prevalent in the financial industry due to commercial and safety constraints, though their performance can fluctuate dramatically with changing market regimes. Evaluating these non-transparent systems is computationally expensive, as fixed budgets limit the number of possible observations. Therefore, achieving stable and sample-efficient optimization for these systems has become a critical challenge. This work presents a novel Bayesian optimization framework (TPE-AS) that improves search stability and efficiency for black-box portfolio models under these limited observation budgets. Standard Bayesian optimization, which solely maximizes expected return, can yield erratic search trajectories and misalign the surrogate model with the true objective, thereby wasting the limited evaluation budget. To mitigate these issues, we propose a weighted Lagrangian estimator that leverages an adaptive schedule and importance sampling. This estimator dynamically balances exploration and exploitation by incorporating both the maximization of model performance and the minimization of the variance of model observations. It guides the search from broad, performance-seeking exploration towards stable and desirable regions as the optimization progresses. Extensive experiments and ablation studies, which establish our proposed method as the primary approach and other configurations as baselines, demonstrate its effectiveness across four backtest settings with three distinct black-box portfolio management models. ...

April 18, 2025 · 2 min · Research Team