Hidden Order in Trades Predicts the Size of Price Moves

ArXiv ID: 2512.15720 “View on arXiv”

Authors: Mainak Singha

Abstract

Financial markets exhibit an apparent paradox: while directional price movements remain largely unpredictable–consistent with weak-form efficiency–the magnitude of price changes displays systematic structure. Here we demonstrate that real-time order-flow entropy, computed from a 15-state Markov transition matrix at second resolution, predicts the magnitude of intraday returns without providing directional information. Analysis of 38.5 million SPY trades over 36 trading days reveals that conditioning on entropy below the 5th percentile increases subsequent 5-minute absolute returns by a factor of 2.89 (t = 12.41, p < 0.0001), while directional accuracy remains at 45.0%–statistically indistinguishable from chance (p = 0.12). This decoupling arises from a fundamental symmetry: entropy is invariant under sign permutation, detecting the presence of informed trading without revealing its direction. Walk-forward validation across five non-overlapping test periods confirms out-of-sample predictability, and label-permutation placebo tests yield z = 14.4 against the null. These findings suggest that information-theoretic measures may serve as volatility state variables in market microstructure, though the limited sample (36 days, single instrument) requires extended validation.

Keywords: order-flow entropy, market microstructure, volatility prediction, Markov transition, information theory

Complexity vs Empirical Score

  • Math Complexity: 7.5/10
  • Empirical Rigor: 8.0/10
  • Quadrant: Holy Grail
  • Why: The paper uses advanced mathematical concepts like Markov chains, information theory, and stationary distributions, and includes formal theorems and derivations (math-heavy), while also presenting detailed backtest results, walk-forward validation, robustness checks, and specific trading rules (data/implementation-heavy).
  flowchart TD
    A["Research Goal:<br>Predict Price Move Magnitude<br>without Direction"] --> B["Data & Inputs<br>38.5M SPY Trades<br>36 Trading Days<br>1-Second Resolution"]
    B --> C["Key Methodology<br>15-State Markov Transition Matrix<br>Compute Order-Flow Entropy"]
    C --> D["Computational Process<br>Real-time Entropy Calculation"]
    D --> E["Analysis<br>Condition on Low Entropy<br>5th Percentile Threshold"]
    E --> F["Key Findings / Outcomes"]
    F --> G["Predictability: 5-Min Abs Return<br>2.89x Increase (t=12.41, p<0.0001)"]
    F --> H["Non-Predictability: Direction<br>45.0% Accuracy (Chance Level)"]
    F --> I["Robustness<br>Walk-forward validation passed<br>Placebo test z=14.4"]