RiskMiner: Discovering Formulaic Alphas via Risk Seeking Monte Carlo Tree Search

ArXiv ID: 2402.07080 “View on arXiv”

Authors: Unknown

Abstract

The formulaic alphas are mathematical formulas that transform raw stock data into indicated signals. In the industry, a collection of formulaic alphas is combined to enhance modeling accuracy. Existing alpha mining only employs the neural network agent, unable to utilize the structural information of the solution space. Moreover, they didn’t consider the correlation between alphas in the collection, which limits the synergistic performance. To address these problems, we propose a novel alpha mining framework, which formulates the alpha mining problems as a reward-dense Markov Decision Process (MDP) and solves the MDP by the risk-seeking Monte Carlo Tree Search (MCTS). The MCTS-based agent fully exploits the structural information of discrete solution space and the risk-seeking policy explicitly optimizes the best-case performance rather than average outcomes. Comprehensive experiments are conducted to demonstrate the efficiency of our framework. Our method outperforms all state-of-the-art benchmarks on two real-world stock sets under various metrics. Backtest experiments show that our alphas achieve the most profitable results under a realistic trading setting.

Keywords: Formulaic Alphas, Monte Carlo Tree Search (MCTS), Markov Decision Process (MDP), Reinforcement Learning, Quantitative Trading, Equities

Complexity vs Empirical Score

  • Math Complexity: 7.5/10
  • Empirical Rigor: 8.0/10
  • Quadrant: Holy Grail
  • Why: The paper employs advanced concepts like MCTS, reward-dense MDPs, and risk-seeking policy optimization, indicating high math complexity, while demonstrating empirical rigor through comprehensive experiments on real-world datasets, backtests, and ablation studies.
  flowchart TD
    A["Research Goal: Discover formulaic alphas<br>that generate high returns<br>via Risk-Seeking MCTS"] --> B["Data Input:<br>Raw Stock Data<br>Preprocessing"]
    B --> C["Core Methodology:<br>Reward-Dense MDP Formulation"]
    C --> D["MCTS Agent:<br>Risk-Seeking Policy<br>Optimizes Best-Case Scenarios"]
    D --> E["Alpha Generation:<br>Formulaic Alphas<br>discovered by MCTS"]
    E --> F["Experiments & Backtest:<br>Performance Evaluation"]
    F --> G["Key Findings:<br>Outperforms SOTA benchmarks<br>Most profitable in trading settings"]