false

When AI Trading Agents Compete: Adverse Selection of Meta-Orders by Reinforcement Learning-Based Market Making

When AI Trading Agents Compete: Adverse Selection of Meta-Orders by Reinforcement Learning-Based Market Making ArXiv ID: 2510.27334 “View on arXiv” Authors: Ali Raza Jafree, Konark Jain, Nick Firoozye Abstract We investigate the mechanisms by which medium-frequency trading agents are adversely selected by opportunistic high-frequency traders. We use reinforcement learning (RL) within a Hawkes Limit Order Book (LOB) model in order to replicate the behaviours of high-frequency market makers. In contrast to the classical models with exogenous price impact assumptions, the Hawkes model accounts for endogenous price impact and other key properties of the market (Jain et al. 2024a). Given the real-world impracticalities of the market maker updating strategies for every event in the LOB, we formulate the high-frequency market making agent via an impulse control reinforcement learning framework (Jain et al. 2025). The RL used in the simulation utilises Proximal Policy Optimisation (PPO) and self-imitation learning. To replicate the adverse selection phenomenon, we test the RL agent trading against a medium frequency trader (MFT) executing a meta-order and demonstrate that, with training against the MFT meta-order execution agent, the RL market making agent learns to capitalise on the price drift induced by the meta-order. Recent empirical studies have shown that medium-frequency traders are increasingly subject to adverse selection by high-frequency trading agents. As high-frequency trading continues to proliferate across financial markets, the slippage costs incurred by medium-frequency traders are likely to increase over time. However, we do not observe that increased profits for the market making RL agent necessarily cause significantly increased slippages for the MFT agent. ...

October 31, 2025 · 2 min · Research Team

Optimal Quoting under Adverse Selection and Price Reading

Optimal Quoting under Adverse Selection and Price Reading ArXiv ID: 2508.20225 “View on arXiv” Authors: Alexander Barzykin, Philippe Bergault, Olivier Guéant, Malo Lemmel Abstract Over the past decade, many dealers have implemented algorithmic models to automatically respond to RFQs and manage flows originating from their electronic platforms. In parallel, building on the foundational work of Ho and Stoll, and later Avellaneda and Stoikov, the academic literature on market making has expanded to address trade size distributions, client tiering, complex price dynamics, alpha signals, and the internalization versus externalization dilemma in markets with dealer-to-client and interdealer-broker segments. In this paper, we tackle two critical dimensions: adverse selection, arising from the presence of informed traders, and price reading, whereby the market maker’s own quotes inadvertently reveal the direction of their inventory. These risks are well known to practitioners, who routinely face informed flows and algorithms capable of extracting signals from quoting behavior. Yet they have received limited attention in the quantitative finance literature, beyond stylized toy models with limited actionability. Extending the existing literature, we propose a tractable and implementable framework that enables market makers to adjust their quotes with greater awareness of informational risk. ...

August 27, 2025 · 2 min · Research Team

Optimal Fees for Liquidity Provision in Automated Market Makers

Optimal Fees for Liquidity Provision in Automated Market Makers ArXiv ID: 2508.08152 “View on arXiv” Authors: Steven Campbell, Philippe Bergault, Jason Milionis, Marcel Nutz Abstract Passive liquidity providers (LPs) in automated market makers (AMMs) face losses due to adverse selection (LVR), which static trading fees often fail to offset in practice. We study the key determinants of LP profitability in a dynamic reduced-form model where an AMM operates in parallel with a centralized exchange (CEX), traders route their orders optimally to the venue offering the better price, and arbitrageurs exploit price discrepancies. Using large-scale simulations and real market data, we analyze how LP profits vary with market conditions such as volatility and trading volume, and characterize the optimal AMM fee as a function of these conditions. We highlight the mechanisms driving these relationships through extensive comparative statics, and confirm the model’s relevance through market data calibration. A key trade-off emerges: fees must be low enough to attract volume, yet high enough to earn sufficient revenues and mitigate arbitrage losses. We find that under normal market conditions, the optimal AMM fee is competitive with the trading cost on the CEX and remarkably stable, whereas in periods of very high volatility, a high fee protects passive LPs from severe losses. These findings suggest that a threshold-type dynamic fee schedule is both robust enough to market conditions and improves LP outcomes. ...

August 11, 2025 · 2 min · Research Team

A Simple Strategy to Deal with Toxic Flow

A Simple Strategy to Deal with Toxic Flow ArXiv ID: 2503.18005 “View on arXiv” Authors: Unknown Abstract We model the trading activity between a broker and her clients (informed and uninformed traders) as an infinite-horizon stochastic control problem. We derive the broker’s optimal dealing strategy in closed form and use this to introduce an algorithm that bypasses the need to calibrate individual parameters, so the dealing strategy can be executed in real-world trading environments. Finally, we characterise the discount in the price of liquidity a broker offers clients. The discount strikes the optimal balance between maximising the order flow from the broker’s clients and minimising adverse selection losses to the informed traders. ...

March 23, 2025 · 2 min · Research Team

Liquidity Competition Between Brokers and an Informed Trader

Liquidity Competition Between Brokers and an Informed Trader ArXiv ID: 2503.08287 “View on arXiv” Authors: Unknown Abstract We study a multi-agent setting in which brokers transact with an informed trader. Through a sequential Stackelberg-type game, brokers manage trading costs and adverse selection with an informed trader. In particular, supplying liquidity to the informed traders allows the brokers to speculate based on the flow information. They simultaneously attempt to minimize inventory risk and trading costs with the lit market based on the informed order flow, also known as the internalization-externalization strategy. We solve in closed form for the trading strategy that the informed trader uses with each broker and propose a system of equations which classify the equilibrium strategies of the brokers. By solving these equations numerically we may study the resulting strategies in equilibrium. Finally, we formulate a competitive game between brokers in order to determine the liquidity prices subject to precommitment supplied to the informed trader and provide a numerical example in which the resulting equilibrium is not Pareto efficient. ...

March 11, 2025 · 2 min · Research Team

Strategic Learning and Trading in Broker-Mediated Markets

Strategic Learning and Trading in Broker-Mediated Markets ArXiv ID: 2412.20847 “View on arXiv” Authors: Unknown Abstract We study strategic interactions in a broker-mediated market. A broker provides liquidity to an informed trader and to noise traders while managing inventory in the lit market. The broker and the informed trader maximise their trading performance while filtering each other’s private information; the trader estimates the broker’s trading activity in the lit market while the broker estimates the informed trader’s private signal. Brokers hold a strategic advantage over traders who rely solely on prices to filter information. We find that information leakage in the client’s trading flow yields an economic value to the broker that is comparable to transaction costs; she speculates profitably and mitigates risk effectively, which, in turn, adversely impacts the informed trader’s performance. In contrast, low signal-to-noise sources, such as prices, result in the broker’s trading performance being indistinguishable from that of a naive strategy that internalises noise flow, externalises informed flow, and offloads inventory at a constant rate. ...

December 30, 2024 · 2 min · Research Team

Broker-Trader Partial Information Nash-Equilibria

Broker-Trader Partial Information Nash-Equilibria ArXiv ID: 2412.17712 “View on arXiv” Authors: Unknown Abstract We study partial information Nash equilibrium between a broker and an informed trader. In this setting, the informed trader, who possesses knowledge of a trading signal, trades multiple assets with the broker in a dealer market. Simultaneously, the broker offloads these assets in a lit exchange where their actions impact the asset prices. The broker, however, only observes aggregate prices and cannot distinguish between underlying trends and volatility. Both the broker and the informed trader aim to maximize their penalized expected wealth. Using convex analysis, we characterize the Nash equilibrium and demonstrate its existence and uniqueness. Furthermore, we establish that this equilibrium corresponds to the solution of a nonstandard system of forward-backward stochastic differential equations (FBSDEs) that involves the two differing filtrations. For short enough time horizons, we prove that a unique solution of this system exists. Finally, under quite general assumptions, we show that the solution to the FBSDE system admits a polynomial approximation in the strength of the transient impact to arbitrary order, and prove that the error is controlled. ...

December 23, 2024 · 2 min · Research Team

Market Simulation under Adverse Selection

Market Simulation under Adverse Selection ArXiv ID: 2409.12721 “View on arXiv” Authors: Unknown Abstract In this paper, we study the effects of fill probabilities and adverse fills on the trading strategy simulation process. We specifically focus on a stochastic optimal control market-making problem and test the strategy on ES (E-mini S&P 500), NQ (E-mini Nasdaq 100), CL (Crude Oil) and ZN (10-Year Treasury Note), which are some of the most liquid futures contracts listed on the CME (Chicago Mercantile Exchange). We provide empirical evidence that shows how fill probabilities and adverse fills can significantly affect performance and propose a more prudent simulation framework to deal with this. Many previous works aim to measure different types of adverse selection in the limit order book (LOB), however, they often simulate price processes and market orders independently. This has the ability to largely inflate the performance of a short-term style trading strategy. Our studies show that using more realistic fill probabilities and tracking adverse fills in the strategy simulation process more accurately shows how these types of trading strategies would perform in reality. ...

September 19, 2024 · 2 min · Research Team

The Negative Drift of a Limit Order Fill

The Negative Drift of a Limit Order Fill ArXiv ID: 2407.16527 “View on arXiv” Authors: Unknown Abstract Market making refers to a form of trading in financial markets characterized by passive orders which add liquidity to limit order books. Market makers are important for the proper functioning of financial markets worldwide. Given the importance, financial mathematics has endeavored to derive optimal strategies for placing limit orders in this context. This paper identifies a key discrepancy between popular model assumptions and the realities of real markets, specifically regarding the dynamics around limit order fills. Traditionally, market making models rely on an assumption of low-cost random fills, when in reality we observe a high-cost non-random fill behavior. Namely, limit order fills are caused by and coincide with adverse price movements, which create a drag on the market maker’s profit and loss. We refer to this phenomenon as “the negative drift” associated with limit order fills. We describe a discrete market model and prove theoretically that the negative drift exists. We also provide a detailed empirical simulation using one of the most traded financial instruments in the world, the 10 Year US Treasury Bond futures, which also confirms its existence. To our knowledge, this is the first paper to describe and prove this phenomenon in such detail. ...

July 23, 2024 · 2 min · Research Team

Optimal Automated Market Makers: Differentiable Economics and Strong Duality

Optimal Automated Market Makers: Differentiable Economics and Strong Duality ArXiv ID: 2402.09129 “View on arXiv” Authors: Unknown Abstract The role of a market maker is to simultaneously offer to buy and sell quantities of goods, often a financial asset such as a share, at specified prices. An automated market maker (AMM) is a mechanism that offers to trade according to some predetermined schedule; the best choice of this schedule depends on the market maker’s goals. The literature on the design of AMMs has mainly focused on prediction markets with the goal of information elicitation. More recent work motivated by DeFi has focused instead on the goal of profit maximization, but considering only a single type of good (traded with a numeraire), including under adverse selection (Milionis et al. 2022). Optimal market making in the presence of multiple goods, including the possibility of complex bundling behavior, is not well understood. In this paper, we show that finding an optimal market maker is dual to an optimal transport problem, with specific geometric constraints on the transport plan in the dual. We show that optimal mechanisms for multiple goods and under adverse selection can take advantage of bundling, both improved prices for bundled purchases and sales as well as sometimes accepting payment “in kind.” We present conjectures of optimal mechanisms in additional settings which show further complex behavior. From a methodological perspective, we make essential use of the tools of differentiable economics to generate conjectures of optimal mechanisms, and give a proof-of-concept for the use of such tools in guiding theoretical investigations. ...

February 14, 2024 · 2 min · Research Team