StockGPT: A GenAI Model for Stock Prediction and Trading
ArXiv ID: 2404.05101 “View on arXiv”
Authors: Unknown
Abstract
This paper introduces StockGPT, an autoregressive ``number’’ model trained and tested on 70 million daily U.S.\ stock returns over nearly 100 years. Treating each return series as a sequence of tokens, StockGPT automatically learns the hidden patterns predictive of future returns via its attention mechanism. On a held-out test sample from 2001 to 2023, daily and monthly rebalanced long-short portfolios formed from StockGPT predictions yield strong performance. The StockGPT-based portfolios span momentum and long-/short-term reversals, eliminating the need for manually crafted price-based strategies, and yield highly significant alphas against leading stock market factors, suggesting a novel AI pricing effect. This highlights the immense promise of generative AI in surpassing human in making complex financial investment decisions.
Keywords: Autoregressive Model, Attention Mechanism, Predictive Modeling, Generative AI, Long-Short Portfolios, Equity (Stocks)
Complexity vs Empirical Score
- Math Complexity: 7.5/10
- Empirical Rigor: 9.0/10
- Quadrant: Holy Grail
- Why: The paper employs advanced generative AI/transformer architectures with attention mechanisms (high math complexity) and demonstrates rigorous out-of-sample backtesting across 23 years with 70M data points, detailed portfolio construction, and statistical significance testing (high empirical rigor).
flowchart TD
A["Research Goal: Can Generative AI<br>surpass human stock prediction?"] --> B["Methodology: Autoregressive<br>Attention-Based Model"]
B --> C["Input: 70M Daily US Stock<br>Returns (1926-2023)"]
C --> D["Process: Train StockGPT to<br>predict next return tokens"]
D --> E["Output: Daily/Monthly<br>Long-Short Portfolios"]
E --> F["Key Findings:<br>Strong Out-of-Sample Returns<br>Significant Alphas vs. Factors<br>Novel AI Pricing Effect"]