false

From Data Acquisition to Lag Modeling: Quantitative Exploration of A-Share Market with Low-Coupling System Design

From Data Acquisition to Lag Modeling: Quantitative Exploration of A-Share Market with Low-Coupling System Design ArXiv ID: 2506.19255 “View on arXiv” Authors: Jianyong Fang, Sitong Wu, Junfan Tong Abstract We propose a novel two-stage framework to detect lead-lag relationships in the Chinese A-share market. First, long-term coupling between stocks is measured via daily data using correlation, dynamic time warping, and rank-based metrics. Then, high-frequency data (1-, 5-, and 15-minute) is used to detect statistically significant lead-lag patterns via cross-correlation, Granger causality, and regression models. Our low-coupling modular system supports scalable data processing and improves reproducibility. Results show that strongly coupled stock pairs often exhibit lead-lag effects, especially at finer time scales. These findings provide insights into market microstructure and quantitative trading opportunities. ...

June 24, 2025 · 2 min · Research Team

Causal Interventions in Bond Multi-Dealer-to-Client Platforms

Causal Interventions in Bond Multi-Dealer-to-Client Platforms ArXiv ID: 2506.18147 “View on arXiv” Authors: Paloma Marín, Sergio Ardanza-Trevijano, Javier Sabio Abstract The digitalization of financial markets has shifted trading from voice to electronic channels, with Multi-Dealer-to-Client (MD2C) platforms now enabling clients to request quotes (RfQs) for financial instruments like bonds from multiple dealers simultaneously. In this competitive landscape, dealers cannot see each other’s prices, making a rigorous analysis of the negotiation process crucial to ensure their profitability. This article introduces a novel general framework for analyzing the RfQ process using probabilistic graphical models and causal inference. Within this framework, we explore different inferential questions that are relevant for dealers participating in MD2C platforms, such as the computation of optimal prices, estimating potential revenues and the identification of clients that might be interested in trading the dealer’s axes. We then move into analyzing two different approaches for model specification: a generative model built on the work of (Fermanian, Guéant, & Pu, 2017); and discriminative models utilizing machine learning techniques. Our results show that generative models can match the predictive accuracy of leading discriminative algorithms such as LightGBM (ROC-AUC: 0.742 vs. 0.743) while simultaneously enforcing critical business requirements, notably spread monotonicity. ...

June 22, 2025 · 2 min · Research Team

Multi-dimensional queue-reactive model and signal-driven models: a unified framework

Multi-dimensional queue-reactive model and signal-driven models: a unified framework ArXiv ID: 2506.11843 “View on arXiv” Authors: Emmanouil Sfendourakis Abstract We present a Markovian market model driven by a hidden Brownian efficient price. In particular, we extend the queue-reactive model, making its dynamics dependent on the efficient price. Our study focuses on two sub-models: a signal-driven price model where the mid-price jump rates depend on the efficient price and an observable signal, and the usual queue-reactive model dependent on the efficient price via the intensities of the order arrivals. This way, we are able to correlate the evolution of limit order books of different stocks. We prove the stability of the observed mid-price around the efficient price under natural assumptions. Precisely, we show that at the macroscopic scale, prices behave as diffusions. We also develop a maximum likelihood estimation procedure for the model, and test it numerically. Our model is them used to backest trading strategies in a liquidation context. ...

June 13, 2025 · 2 min · Research Team

The Subtle Interplay between Square-root Impact, Order Imbalance & Volatility: A Unifying Framework

The Subtle Interplay between Square-root Impact, Order Imbalance & Volatility: A Unifying Framework ArXiv ID: 2506.07711 “View on arXiv” Authors: Guillaume Maitrier, Jean-Philippe Bouchaud Abstract In this work, we aim to reconcile several apparently contradictory observations in market microstructure: is the famous “square-root law” of metaorder impact, which decays with time, compatible with the random-walk nature of prices and the linear impact of order imbalances? Can one entirely explain the volatility of prices as resulting from the flow of uninformed metaorders that mechanically impact them? We introduce a new theoretical framework to describe metaorders with different signs, sizes and durations, which all impact prices as a square-root of volume but with a subsequent time decay. We show that, as in the original propagator model, price diffusion is ensured by the long memory of cross-correlations between metaorders. In order to account for the effect of strongly fluctuating volumes q of individual trades, we further introduce two q-dependent exponents, which allow us to describe how the moments of generalized volume imbalance and the correlation between price changes and generalized order flow imbalance scale with T. We predict in particular that the corresponding power-laws depend in a non-monotonic fashion on a parameter a, which allows one to put the same weight on all child orders or to overweight large ones, a behaviour that is clearly borne out by empirical data. We also predict that the correlation between price changes and volume imbalances should display a maximum as a function of a, which again matches observations. Such noteworthy agreement between theory and data suggests that our framework correctly captures the basic mechanism at the heart of price formation, namely the average impact of metaorders. We argue that our results support the “Order-Driven” theory of excess volatility, and are at odds with the idea that a “Fundamental” component accounts for a large share of the volatility of financial markets. ...

June 9, 2025 · 3 min · Research Team

Classifying and Clustering Trading Agents

Classifying and Clustering Trading Agents ArXiv ID: 2505.21662 “View on arXiv” Authors: Mateusz Wilinski, Anubha Goel, Alexandros Iosifidis, Juho Kanniainen Abstract The rapid development of sophisticated machine learning methods, together with the increased availability of financial data, has the potential to transform financial research, but also poses a challenge in terms of validation and interpretation. A good case study is the task of classifying financial investors based on their behavioral patterns. Not only do we have access to both classification and clustering tools for high-dimensional data, but also data identifying individual investors is finally available. The problem, however, is that we do not have access to ground truth when working with real-world data. This, together with often limited interpretability of modern machine learning methods, makes it difficult to fully utilize the available research potential. In order to deal with this challenge we propose to use a realistic agent-based model as a way to generate synthetic data. This way one has access to ground truth, large replicable data, and limitless research scenarios. Using this approach we show how, even when classifying trading agents in a supervised manner is relatively easy, a more realistic task of unsupervised clustering may give incorrect or even misleading results. We complete the results with investigating the details of how supervised techniques were able to successfully distinguish between different trading behaviors. ...

May 27, 2025 · 2 min · Research Team

Stochastic Price Dynamics in Response to Order Flow Imbalance: Evidence from CSI 300 Index Futures

Stochastic Price Dynamics in Response to Order Flow Imbalance: Evidence from CSI 300 Index Futures ArXiv ID: 2505.17388 “View on arXiv” Authors: Chen Hu, Kouxiao Zhang Abstract We conduct modeling of the price dynamics following order flow imbalance in market microstructure and apply the model to the analysis of Chinese CSI 300 Index Futures. There are three findings. The first is that the order flow imbalance is analogous to a shock to the market. Unlike the common practice of using Hawkes processes, we model the impact of order flow imbalance as an Ornstein-Uhlenbeck process with memory and mean-reverting characteristics driven by a jump-type Lévy process. Motivated by the empirically stable correlation between order flow imbalance and contemporaneous price changes, we propose a modified asset price model where the drift term of canonical geometric Brownian motion is replaced by an Ornstein-Uhlenbeck process. We establish stochastic differential equations and derive the logarithmic return process along with its mean and variance processes under initial boundary conditions, and evolution of cost-effectiveness ratio with order flow imbalance as the trading trigger point, termed as the quasi-Sharpe ratio or response ratio. Secondly, our results demonstrate horizon-dependent heterogeneity in how conventional metrics interact with order flow imbalance. This underscores the critical role of forecast horizon selection for strategies. Thirdly, we identify regime-dependent dynamics in the memory and forecasting power of order flow imbalance. This taxonomy provides both a screening protocol for existing indicators and an ex-ante evaluation paradigm for novel metrics. ...

May 23, 2025 · 2 min · Research Team

ClusterLOB: Enhancing Trading Strategies by Clustering Orders in Limit Order Books

ClusterLOB: Enhancing Trading Strategies by Clustering Orders in Limit Order Books ArXiv ID: 2504.20349 “View on arXiv” Authors: Yichi Zhang, Mihai Cucuringu, Alexander Y. Shestopaloff, Stefan Zohren Abstract In the rapidly evolving world of financial markets, understanding the dynamics of limit order book (LOB) is crucial for unraveling market microstructure and participant behavior. We introduce ClusterLOB as a method to cluster individual market events in a stream of market-by-order (MBO) data into different groups. To do so, each market event is augmented with six time-dependent features. By applying the K-means++ clustering algorithm to the resulting order features, we are then able to assign each new order to one of three distinct clusters, which we identify as directional, opportunistic, and market-making participants, each capturing unique trading behaviors. Our experimental results are performed on one year of MBO data containing small-tick, medium-tick, and large-tick stocks from NASDAQ. To validate the usefulness of our clustering, we compute order flow imbalances across each cluster within 30-minute buckets during the trading day. We treat each cluster’s imbalance as a signal that provides insights into trading strategies and participants’ responses to varying market conditions. To assess the effectiveness of these signals, we identify the trading strategy with the highest Sharpe ratio in the training dataset, and demonstrate that its performance in the test dataset is superior to benchmark trading strategies that do not incorporate clustering. We also evaluate trading strategies based on order flow imbalance decompositions across different market event types, including add, cancel, and trade events, to assess their robustness in various market conditions. This work establishes a robust framework for clustering market participant behavior, which helps us to better understand market microstructure, and inform the development of more effective predictive trading signals with practical applications in algorithmic trading and quantitative finance. ...

April 29, 2025 · 3 min · Research Team

Trading Graph Neural Network

Trading Graph Neural Network ArXiv ID: 2504.07923 “View on arXiv” Authors: Unknown Abstract This paper proposes a new algorithm – Trading Graph Neural Network (TGNN) that can structurally estimate the impact of asset features, dealer features and relationship features on asset prices in trading networks. It combines the strength of the traditional simulated method of moments (SMM) and recent machine learning techniques – Graph Neural Network (GNN). It outperforms existing reduced-form methods with network centrality measures in prediction accuracy. The method can be used on networks with any structure, allowing for heterogeneity among both traders and assets. ...

April 10, 2025 · 2 min · Research Team

Optimal Execution and Macroscopic Market Making

Optimal Execution and Macroscopic Market Making ArXiv ID: 2504.06717 “View on arXiv” Authors: Unknown Abstract We propose a stochastic game modelling the strategic interaction between market makers and traders of optimal execution type. For traders, the permanent price impact commonly attributed to them is replaced by quoting strategies implemented by market makers. For market makers, order flows become endogenous, driven by tactical traders rather than assumed exogenously. Using the forward-backward stochastic differential equation (FBSDE) characterization of Nash equilibria, we establish a local well-posedness result for the general game. In the specific Almgren-Chriss-Avellaneda-Stoikov model, a decoupling approach guarantees the global well-posedness of the FBSDE system via the well-posedness of an associated backward stochastic Riccati equation. Finally, by introducing small diffusion terms into the inventory processes, global well-posedness is achieved for the approximation game. ...

April 9, 2025 · 2 min · Research Team

Modeling metaorder impact with a Non-Markovian Zero Intelligence model

Modeling metaorder impact with a Non-Markovian Zero Intelligence model ArXiv ID: 2503.05254 “View on arXiv” Authors: Unknown Abstract Devising models of the limit order book that realistically reproduce the market response to exogenous trades is extremely challenging and fundamental in order to test trading strategies. We propose a novel explainable model for small tick assets, the Non-Markovian Zero Intelligence, which is a variant of the well-known Zero Intelligence model. The main modification is that the probability of limit orders’ signs (buy/sell) is not constant but is a function of the exponentially weighted mid-price return, representing the past price dynamics, and can be interpreted as the reaction of traders with reservation prices to the price trend. With numerical simulations and analytical arguments, we show that the model predicts a concave price path during a metaorder execution and to a price reversion after the execution ends, as empirically observed. We analyze in-depth the mechanism at the root of the arising concavity, the components which constitute the price impact in our model, and the dependence of the results on the two main parameters, namely the time scale and the strength of the reaction of traders to the price trend. ...

March 7, 2025 · 2 min · Research Team