HLOB – Information Persistence and Structure in Limit Order Books
ArXiv ID: 2405.18938 “View on arXiv”
Authors: Unknown
Abstract
We introduce a novel large-scale deep learning model for Limit Order Book mid-price changes forecasting, and we name it `HLOB’. This architecture (i) exploits the information encoded by an Information Filtering Network, namely the Triangulated Maximally Filtered Graph, to unveil deeper and non-trivial dependency structures among volume levels; and (ii) guarantees deterministic design choices to handle the complexity of the underlying system by drawing inspiration from the groundbreaking class of Homological Convolutional Neural Networks. We test our model against 9 state-of-the-art deep learning alternatives on 3 real-world Limit Order Book datasets, each including 15 stocks traded on the NASDAQ exchange, and we systematically characterize the scenarios where HLOB outperforms state-of-the-art architectures. Our approach sheds new light on the spatial distribution of information in Limit Order Books and on its degradation over increasing prediction horizons, narrowing the gap between microstructural modeling and deep learning-based forecasting in high-frequency financial markets.
Keywords: Limit Order Book, Mid-price Prediction, Deep Learning, High-Frequency Trading, Microstructure
Complexity vs Empirical Score
- Math Complexity: 8.5/10
- Empirical Rigor: 7.5/10
- Quadrant: Holy Grail
- Why: The paper introduces advanced concepts like Homological Convolutional Neural Networks and Triangulated Maximally Filtered Graphs, indicating high mathematical density, while it rigorously tests the model against 9 alternatives on 3 real-world NASDAQ datasets with systematic characterization, demonstrating strong empirical implementation and backtest-readiness.
flowchart TD
A["Research Goal<br>Mid-Price Change Forecasting"] --> B["Data Inputs<br>3 NASDAQ Limit Order Book Datasets"]
B --> C["Proposed Methodology<br>HLOB Architecture"]
C --> D["Component 1<br>Triangulated Maximally Filtered Graph<br>Information Filtering"]
C --> E["Component 2<br>Homological Convolutional Neural Networks<br>Spatial Processing"]
D & E --> F["Computational Process<br>Deep Learning Prediction"]
F --> G["Key Findings & Outcomes<br>State-of-the-Art Performance &<br>Information Structure Insight"]