ChatGPT in Systematic Investing – Enhancing Risk-Adjusted Returns with LLMs
ArXiv ID: 2510.26228 “View on arXiv”
Authors: Nikolas Anic, Andrea Barbon, Ralf Seiz, Carlo Zarattini
Abstract
This paper investigates whether large language models (LLMs) can improve cross-sectional momentum strategies by extracting predictive signals from firm-specific news. We combine daily U.S. equity returns for S&P 500 constituents with high-frequency news data and use prompt-engineered queries to ChatGPT that inform the model when a stock is about to enter a momentum portfolio. The LLM evaluates whether recent news supports a continuation of past returns, producing scores that condition both stock selection and portfolio weights. An LLM-enhanced momentum strategy outperforms a standard long-only momentum benchmark, delivering higher Sharpe and Sortino ratios both in-sample and in a truly out-of-sample period after the model’s pre-training cut-off. These gains are robust to transaction costs, prompt design, and portfolio constraints, and are strongest for concentrated, high-conviction portfolios. The results suggest that LLMs can serve as effective real-time interpreters of financial news, adding incremental value to established factor-based investment strategies.
Keywords: large language models, momentum strategy, prompt engineering, ChatGPT, news analytics, Equities (S&P 500)
Complexity vs Empirical Score
- Math Complexity: 2.5/10
- Empirical Rigor: 9.0/10
- Quadrant: Street Traders
- Why: The paper’s core mathematical complexity is moderate, focusing on standard momentum signal construction and portfolio weighting, with no advanced statistical theory or heavy derivations. However, it demonstrates high empirical rigor by utilizing high-frequency news and equity datasets, conducting in-sample and out-of-sample backtests with robustness checks against transaction costs and prompt variations, and reporting performance metrics like Sharpe and Sortino ratios.
flowchart TD
A["Research Goal<br>Can LLMs enhance<br>cross-sectional momentum<br>strategies using news?"]
B["Data & Inputs<br>Daily S&P 500 Returns +<br>High-Frequency News Data"]
C["Core Methodology<br>ChatGPT Prompt Engineering<br>Stock enters momentum portfolio?"]
D["Computational Process<br>LLM Evaluation Score<br>Conditions Selection & Weights"]
E["Key Outcome 1<br>LLM-Enhanced Strategy<br>Outperforms Benchmark"]
F["Key Outcome 2<br>Higher Sharpe & Sortino Ratios<br>Robust Out-of-Sample Results"]
A --> B
B --> C
C --> D
D --> E
D --> F