LEMs: A Primer On Large Execution Models

ArXiv ID: 2509.25211 “View on arXiv”

Authors: Remi Genet, Hugo Inzirillo

Abstract

This paper introduces Large Execution Models (LEMs), a novel deep learning framework that extends transformer-based architectures to address complex execution problems with flexible time boundaries and multiple execution constraints. Building upon recent advances in neural VWAP execution strategies, LEMs generalize the approach from fixed-duration orders to scenarios where execution duration is bounded between minimum and maximum time horizons, similar to share buyback contract structures. The proposed architecture decouples market information processing from execution allocation decisions: a common feature extraction pipeline using Temporal Kolmogorov-Arnold Networks (TKANs), Variable Selection Networks (VSNs), and multi-head attention mechanisms processes market data to create informational context, while independent allocation networks handle the specific execution logic for different scenarios (fixed quantity vs. fixed notional, buy vs. sell orders). This architectural separation enables a unified model to handle diverse execution objectives while leveraging shared market understanding across scenarios. Through comprehensive empirical evaluation on intraday cryptocurrency markets and multi-day equity trading using DOW Jones constituents, we demonstrate that LEMs achieve superior execution performance compared to traditional benchmarks by dynamically optimizing execution paths within flexible time constraints. The unified model architecture enables deployment across different execution scenarios (buy/sell orders, varying duration boundaries, volume/notional targets) through a single framework, providing significant operational advantages over asset-specific approaches.

Keywords: Large Execution Models (LEMs), Temporal Kolmogorov-Arnold Networks (TKANs), Variable Selection Networks (VSNs), Transformer Architectures, VWAP Execution, Equity / Cryptocurrency

Complexity vs Empirical Score

  • Math Complexity: 7.5/10
  • Empirical Rigor: 7.0/10
  • Quadrant: Holy Grail
  • Why: The paper employs advanced mathematics including Temporal Kolmogorov-Arnold Networks (TKANs), multi-head attention, and optimal control formulations, while providing comprehensive empirical evaluation on real-world datasets (cryptocurrency and DOW Jones) with reported performance improvements.
  flowchart TD
    A["Research Goal: Generalize VWAP Execution<br>to Flexible Time Constraints"] --> B["Methodology: LEM Architecture Design"]
    
    B --> C["Data Inputs:<br>Intraday Crypto & Multi-Day Equity Markets"]
    
    C --> D["Core Computation: Market Information Processing<br>TKANs + VSNs + Multi-Head Attention"]
    
    D --> E["Parallel Execution Allocation Networks<br>Fixed Qty vs. Fixed Notional | Buy vs. Sell"]
    
    E --> F["Key Findings:<br>1. Superior execution vs. traditional benchmarks<br>2. Dynamic optimization in flexible time horizons<br>3. Unified model for multi-scenario deployment"]