Limit Order Book Simulation and Trade Evaluation with $K$-Nearest-Neighbor Resampling
ArXiv ID: 2409.06514 “View on arXiv”
Authors: Unknown
Abstract
In this paper, we show how $K$-nearest neighbor ($K$-NN) resampling, an off-policy evaluation method proposed in \cite{“giegrich2023k”}, can be applied to simulate limit order book (LOB) markets and how it can be used to evaluate and calibrate trading strategies. Using historical LOB data, we demonstrate that our simulation method is capable of recreating realistic LOB dynamics and that synthetic trading within the simulation leads to a market impact in line with the corresponding literature. Compared to other statistical LOB simulation methods, our algorithm has theoretical convergence guarantees under general conditions, does not require optimization, is easy to implement and computationally efficient. Furthermore, we show that in a benchmark comparison our method outperforms a deep learning-based algorithm for several key statistics. In the context of a LOB with pro-rata type matching, we demonstrate how our algorithm can calibrate the size of limit orders for a liquidation strategy. Finally, we describe how $K$-NN resampling can be modified for choices of higher dimensional state spaces.
Keywords: Limit Order Book (LOB) Simulation, K-Nearest Neighbor (K-NN) Resampling, Market Impact, Off-policy Evaluation, Trading Strategy Calibration, Equities
Complexity vs Empirical Score
- Math Complexity: 7.5/10
- Empirical Rigor: 8.0/10
- Quadrant: Holy Grail
- Why: The paper utilizes advanced mathematical concepts from stochastic calculus and metric space theory for K-NN resampling with convergence proofs, while demonstrating rigorous empirical validation using historical LOB data, outperforming deep learning benchmarks and providing specific calibration examples for trading strategies.
flowchart TD
A["Research Goal<br>Simulate LOB & Evaluate/Calibrate Trading<br>using K-NN Resampling"] --> B["Data & Inputs"]
B --> C["Methodology: K-NN Resampling"]
subgraph C ["Methodology: K-NN Resampling"]
direction LR
C1["Identify State Space"] --> C2["Find K-Nearest<br>Historical Neighbors"] --> C3["Resample Future<br>Transitions/Outcomes"]
end
C --> D["Computational Process"]
subgraph D ["Computational Process"]
direction LR
D1["Market Simulation<br>Recreate LOB Dynamics"] --> D2["Strategy Evaluation<br>Simulate Synthetic Trading"]
end
D --> E["Key Findings & Outcomes"]
subgraph E ["Key Findings & Outcomes"]
E1["Validated Market Impact<br>Matches Literature"]
E2["Outperformed Deep Learning<br>in Benchmark Stats"]
E3["Calibrated Liquidation<br>Strategy Order Sizes"]
E4["Extensible to<br>Higher Dimensional States"]
end