A Framework for Predictive Directional Trading Based on Volatility and Causal Inference
ArXiv ID: 2507.09347 “View on arXiv”
Authors: Ivan Letteri
Abstract
Purpose: This study introduces a novel framework for identifying and exploiting predictive lead-lag relationships in financial markets. We propose an integrated approach that combines advanced statistical methodologies with machine learning models to enhance the identification and exploitation of predictive relationships between equities. Methods: We employed a Gaussian Mixture Model (GMM) to cluster nine prominent stocks based on their mid-range historical volatility profiles over a three-year period. From the resulting clusters, we constructed a multi-stage causal inference pipeline, incorporating the Granger Causality Test (GCT), a customised Peter-Clark Momentary Conditional Independence (PCMCI) test, and Effective Transfer Entropy (ETE) to identify robust, predictive linkages. Subsequently, Dynamic Time Warping (DTW) and a K-Nearest Neighbours (KNN) classifier were utilised to determine the optimal time lag for trade execution. The resulting strategy was rigorously backtested. Results: The proposed volatility-based trading strategy, tested from 8 June 2023 to 12 August 2023, demonstrated substantial efficacy. The portfolio yielded a total return of 15.38%, significantly outperforming the 10.39% return of a comparative Buy-and-Hold strategy. Key performance metrics, including a Sharpe Ratio up to 2.17 and a win rate up to 100% for certain pairs, confirmed the strategy’s viability. Conclusion: This research contributes a systematic and robust methodology for identifying profitable trading opportunities derived from volatility-based causal relationships. The findings have significant implications for both academic research in financial modelling and the practical application of algorithmic trading, offering a structured approach to developing resilient, data-driven strategies.
Keywords: Volatility clustering, Granger causality, Dynamic Time Warping, Causal inference, Algorithmic trading
Complexity vs Empirical Score
- Math Complexity: 8.5/10
- Empirical Rigor: 7.0/10
- Quadrant: Holy Grail
- Why: The paper employs a sophisticated mathematical pipeline combining GMM clustering, multiple causal inference tests (Granger, PCMCI, Transfer Entropy), and DTW for lag optimization, indicating high mathematical density. While the backtest results show strong performance metrics and a defined testing period, the absence of publicly available code or datasets and reliance on a specific, short out-of-sample window slightly limits the empirical rigor compared to a fully reproducible study.
flowchart TD
A["Research Goal"] -->|Identify lead-lag relationships| B["Data Preprocessing"]
B --> C["Variance Reduction<br>GMM Clustering"]
C --> D["Reliability Assessment<br>Multi-stage Causal Pipeline"]
D -->|GCT, PCMCI, ETE| E["Trade Signal Generation<br>DTW + KNN"]
E --> F["Strategy Execution"]
F --> G["Performance Evaluation"]
style A fill:#e1f5fe
style G fill:#e8f5e8
style D fill:#fff3e0