TIP-Search: Time-Predictable Inference Scheduling for Market Prediction under Uncertain Load

ArXiv ID: 2506.08026 “View on arXiv”

Authors: Xibai Wang

Abstract

This paper proposes TIP-Search, a time-predictable inference scheduling framework for real-time market prediction under uncertain workloads. Motivated by the strict latency demands in high-frequency financial systems, TIP-Search dynamically selects a deep learning model from a heterogeneous pool, aiming to maximize predictive accuracy while satisfying per-task deadline constraints. Our approach profiles latency and generalization performance offline, then performs online task-aware selection without relying on explicit input domain labels. We evaluate TIP-Search on three real-world limit order book datasets (FI-2010, Binance BTC/USDT, LOBSTER AAPL) and demonstrate that it outperforms static baselines with up to 8.5% improvement in accuracy and 100% deadline satisfaction. Our results highlight the effectiveness of TIP-Search in robust low-latency financial inference under uncertainty.

Keywords: Real-time Inference, Deep Learning, Limit Order Book (LOB), High-Frequency Trading, Scheduling Framework, Equities

Complexity vs Empirical Score

Math Complexity: 6.0/10
Empirical Rigor: 7.5/10
Quadrant: Holy Grail
Why: The paper introduces a formal optimization formulation and a Pareto frontier concept for model selection under constraints, demonstrating moderate mathematical complexity. It provides strong empirical rigor by testing on multiple real-world LOB datasets, implementing dynamic scheduling, and reporting quantitative improvements (e.g., 8.5% accuracy gain, 100% deadline satisfaction) with ablation studies.

  flowchart TD
    A["Research Goal: Develop a time-predictable<br>inference scheduler for real-time<br>market prediction under uncertain load"] --> B["Data Preparation: Use Limit Order Book datasets<br>(FI-2010, Binance BTC/USDT, LOBSTER AAPL)"]
    B --> C["Offline Profiling: Characterize model latency<br>and generalization performance"]
    C --> D["Online Scheduling: Dynamic model selection<br>via TIP-Search to maximize accuracy<br>while meeting strict deadlines"]
    D --> E{"Evaluation & Outcomes"}
    E --> F["+8.5% Accuracy<br>vs. static baselines"]
    E --> G["100% Deadline Satisfaction<br>under uncertain load"]
    E --> H["Robust Low-Latency<br>Financial Inference"]

TIP-Search: Time-Predictable Inference Scheduling for Market Prediction under Uncertain Load#

Abstract#

Complexity vs Empirical Score#

TIP-Search: Time-Predictable Inference Scheduling for Market Prediction under Uncertain Load

Abstract

Complexity vs Empirical Score