false

An Efficient deep learning model to Predict Stock Price Movement Based on Limit Order Book

An Efficient deep learning model to Predict Stock Price Movement Based on Limit Order Book ArXiv ID: 2505.22678 “View on arXiv” Authors: Jiahao Yang, Ran Fang, Ming Zhang, Jun Zhou Abstract In high-frequency trading (HFT), leveraging limit order books (LOB) to model stock price movements is crucial for achieving profitable outcomes. However, this task is challenging due to the high-dimensional and volatile nature of the original data. Even recent deep learning models often struggle to capture price movement patterns effectively, particularly without well-designed features. We observed that raw LOB data exhibits inherent symmetry between the ask and bid sides, and the bid-ask differences demonstrate greater stability and lower complexity compared to the original data. Building on this insight, we propose a novel approach in which leverages the Siamese architecture to enhance the performance of existing deep learning models. The core idea involves processing the ask and bid sides separately using the same module with shared parameters. We applied our Siamese-based methods to several widely used strong baselines and validated their effectiveness using data from 14 military industry stocks in the Chinese A-share market. Furthermore, we integrated multi-head attention (MHA) mechanisms with the Long Short-Term Memory (LSTM) module to investigate its role in modeling stock price movements. Our experiments used raw data and widely used Order Flow Imbalance (OFI) features as input with some strong baseline models. The results show that our method improves the performance of strong baselines in over 75$% of cases, excluding the Multi-Layer Perception (MLP) baseline, which performed poorly and is not considered practical. Furthermore, we found that Multi-Head Attention can enhance model performance, particularly over shorter forecasting horizons. ...

May 14, 2025 · 2 min · Research Team

Beyond the Black Box: Interpretability of LLMs in Finance

Beyond the Black Box: Interpretability of LLMs in Finance ArXiv ID: 2505.24650 “View on arXiv” Authors: Hariom Tatsat, Ariye Shater Abstract Large Language Models (LLMs) exhibit remarkable capabilities across a spectrum of tasks in financial services, including report generation, chatbots, sentiment analysis, regulatory compliance, investment advisory, financial knowledge retrieval, and summarization. However, their intrinsic complexity and lack of transparency pose significant challenges, especially in the highly regulated financial sector, where interpretability, fairness, and accountability are critical. As far as we are aware, this paper presents the first application in the finance domain of understanding and utilizing the inner workings of LLMs through mechanistic interpretability, addressing the pressing need for transparency and control in AI systems. Mechanistic interpretability is the most intuitive and transparent way to understand LLM behavior by reverse-engineering their internal workings. By dissecting the activations and circuits within these models, it provides insights into how specific features or components influence predictions - making it possible not only to observe but also to modify model behavior. In this paper, we explore the theoretical aspects of mechanistic interpretability and demonstrate its practical relevance through a range of financial use cases and experiments, including applications in trading strategies, sentiment analysis, bias, and hallucination detection. While not yet widely adopted, mechanistic interpretability is expected to become increasingly vital as adoption of LLMs increases. Advanced interpretability tools can ensure AI systems remain ethical, transparent, and aligned with evolving financial regulations. In this paper, we have put special emphasis on how these techniques can help unlock interpretability requirements for regulatory and compliance purposes - addressing both current needs and anticipating future expectations from financial regulators globally. ...

May 14, 2025 · 2 min · Research Team

Fast Learning in Quantitative Finance with Extreme Learning Machine

Fast Learning in Quantitative Finance with Extreme Learning Machine ArXiv ID: 2505.09551 “View on arXiv” Authors: Liexin Cheng, Xue Cheng, Shuaiqiang Liu Abstract A critical factor in adopting machine learning for time-sensitive financial tasks is computational speed, including model training and inference. This paper demonstrates that a broad class of such problems, especially those previously addressed using deep neural networks, can be efficiently solved using single-layer neural networks without iterative gradient-based training. This is achieved through the extreme learning machine (ELM) framework. ELM utilizes a single-layer network with randomly initialized hidden nodes and output weights obtained via convex optimization, enabling rapid training and inference. We present various applications in both supervised and unsupervised learning settings, including option pricing, intraday return prediction, volatility surface fitting, and numerical solution of partial differential equations. Across these examples, ELM demonstrates notable improvements in computational efficiency while maintaining comparable accuracy and generalization compared to deep neural networks and classical machine learning methods. We also briefly discuss theoretical aspects of ELM implementation and its generalization capabilities. ...

May 14, 2025 · 2 min · Research Team

FLUXLAYER: High-Performance Design for Cross-chain Fragmented Liquidity

FLUXLAYER: High-Performance Design for Cross-chain Fragmented Liquidity ArXiv ID: 2505.09423 “View on arXiv” Authors: Xin Lao, Shiping Chen, Qin Wang Abstract Autonomous Market Makers (AMMs) rely on arbitrage to facilitate passive price updates. Liquidity fragmentation poses a complex challenge across different blockchain networks. This paper proposes FluxLayer, a solution to mitigate fragmented liquidity and capture the maximum extractable value (MEV) in a cross-chain environment. FluxLayer is a three-layer framework that integrates a settlement layer, an intent layer, and an under-collateralised leverage lending vault mechanism. Our evaluation demonstrates that FluxLayer can effectively enhance cross-chain MEV by capturing more arbitrage opportunities, reducing costs, and improving overall liquidity. ...

May 14, 2025 · 1 min · Research Team

Monte-Carlo Option Pricing in Quantum Parallel

Monte-Carlo Option Pricing in Quantum Parallel ArXiv ID: 2505.09459 “View on arXiv” Authors: Robert Scriba, Yuying Li, Jingbo B Wang Abstract Financial derivative pricing is a significant challenge in finance, involving the valuation of instruments like options based on underlying assets. While some cases have simple solutions, many require complex classical computational methods like Monte Carlo simulations and numerical techniques. However, as derivative complexities increase, these methods face limitations in computational power. Cases involving Non-Vanilla Basket pricing, American Options, and derivative portfolio risk analysis need extensive computations in higher-dimensional spaces, posing challenges for classical computers. Quantum computing presents a promising avenue by harnessing quantum superposition and entanglement, allowing the handling of high-dimensional spaces effectively. In this paper, we introduce a self-contained and all-encompassing quantum algorithm that operates without reliance on oracles or presumptions. More specifically, we develop an effective stochastic method for simulating exponentially many potential asset paths in quantum parallel, leading to a highly accurate final distribution of stock prices. Furthermore, we demonstrate how this algorithm can be extended to price more complex options and analyze risk within derivative portfolios. ...

May 14, 2025 · 2 min · Research Team

An Efficient Multi-scale Leverage Effect Estimator under Dependent Microstructure Noise

An Efficient Multi-scale Leverage Effect Estimator under Dependent Microstructure Noise ArXiv ID: 2505.08654 “View on arXiv” Authors: Ziyang Xiong, Zhao Chen, Christina Dan Wang Abstract Estimating the leverage effect from high-frequency data is vital but challenged by complex, dependent microstructure noise, often exhibiting non-Gaussian higher-order moments. This paper introduces a novel multi-scale framework for efficient and robust leverage effect estimation under such flexible noise structures. We develop two new estimators, the Subsampling-and-Averaging Leverage Effect (SALE) and the Multi-Scale Leverage Effect (MSLE), which adapt subsampling and multi-scale approaches holistically using a unique shifted window technique. This design simplifies the multi-scale estimation procedure and enhances noise robustness without requiring the pre-averaging approach. We establish central limit theorems and stable convergence, with MSLE achieving convergence rates of an optimal $n^{"-1/4"}$ and a near-optimal $n^{"-1/9"}$ for the noise-free and noisy settings, respectively. A cornerstone of our framework’s efficiency is a specifically designed MSLE weighting strategy that leverages covariance structures across scales. This significantly reduces asymptotic variance and, critically, yields substantially smaller finite-sample errors than existing methods under both noise-free and realistic noisy settings. Extensive simulations and empirical analyses confirm the superior efficiency, robustness, and practical advantages of our approach. ...

May 13, 2025 · 2 min · Research Team

Forecasting Intraday Volume in Equity Markets with Machine Learning

Forecasting Intraday Volume in Equity Markets with Machine Learning ArXiv ID: 2505.08180 “View on arXiv” Authors: Mihai Cucuringu, Kang Li, Chao Zhang Abstract This study focuses on forecasting intraday trading volumes, a crucial component for portfolio implementation, especially in high-frequency (HF) trading environments. Given the current scarcity of flexible methods in this area, we employ a suite of machine learning (ML) models enriched with numerous HF predictors to enhance the predictability of intraday trading volumes. Our findings reveal that intraday stock trading volume is highly predictable, especially with ML and considering commonality. Additionally, we assess the economic benefits of accurate volume forecasting through Volume Weighted Average Price (VWAP) strategies. The results demonstrate that precise intraday forecasting offers substantial advantages, providing valuable insights for traders to optimize their strategies. ...

May 13, 2025 · 2 min · Research Team

DeFi Liquidation Risk Modeling Using Geometric Brownian Motion

DeFi Liquidation Risk Modeling Using Geometric Brownian Motion ArXiv ID: 2505.08100 “View on arXiv” Authors: Timofei Belenko, Georgii Vosorov Abstract In this paper, we propose an analytical method to compute the collateral liquidation probability in decentralized finance (DeFi) stablecoin single-collateral lending. Our approach models the collateral exchange rate as a zero-drift geometric Brownian motion, and derives the probability of it crossing the liquidation threshold. Unlike most existing methods that rely on computationally intensive simulations such as Monte Carlo, our formula provides a lightweight, exact solution. This advancement offers a more efficient alternative for risk assessment in DeFi platforms. ...

May 12, 2025 · 2 min · Research Team

DELPHYNE: A Pre-Trained Model for General and Financial Time Series

DELPHYNE: A Pre-Trained Model for General and Financial Time Series ArXiv ID: 2506.06288 “View on arXiv” Authors: Xueying Ding, Aakriti Mittal, Achintya Gopal Abstract Time-series data is a vital modality within data science communities. This is particularly valuable in financial applications, where it helps in detecting patterns, understanding market behavior, and making informed decisions based on historical data. Recent advances in language modeling have led to the rise of time-series pre-trained models that are trained on vast collections of datasets and applied to diverse tasks across financial domains. However, across financial applications, existing time-series pre-trained models have not shown boosts in performance over simple finance benchmarks in both zero-shot and fine-tuning settings. This phenomenon occurs because of a i) lack of financial data within the pre-training stage, and ii) the negative transfer effect due to inherently different time-series patterns across domains. Furthermore, time-series data is continuous, noisy, and can be collected at varying frequencies and with varying lags across different variables, making this data more challenging to model than languages. To address the above problems, we introduce a Pre-trained MoDEL for FINance TimE-series (Delphyne). Delphyne achieves competitive performance to existing foundation and full-shot models with few fine-tuning steps on publicly available datasets, and also shows superior performances on various financial tasks. ...

May 12, 2025 · 2 min · Research Team

Revisiting the Excess Volatility Puzzle Through the Lens of the Chiarella Model

Revisiting the Excess Volatility Puzzle Through the Lens of the Chiarella Model ArXiv ID: 2505.07820 “View on arXiv” Authors: Jutta G. Kurth, Adam A. Majewski, Jean-Philippe Bouchaud Abstract We amend and extend the Chiarella model of financial markets to deal with arbitrary long-term value drifts in a consistent way. This allows us to improve upon existing calibration schemes, opening the possibility of calibrating individual monthly time series instead of classes of time series. The technique is employed on spot prices of four asset classes from ca. 1800 onward (stock indices, bonds, commodities, currencies). The so-called fundamental value is a direct output of the calibration, which allows us to (a) quantify the amount of excess volatility in these markets, which we find to be large (e.g. a factor $\approx$ 4 for stock indices) and consistent with previous estimates; and (b) determine the distribution of mispricings (i.e. the difference between market price and value), which we find in many cases to be bimodal. Both findings are strongly at odds with the Efficient Market Hypothesis. We also study in detail the ‘sloppiness’ of the calibration, that is, the directions in parameter space that are weakly constrained by data. The main conclusions of our study are remarkably consistent across different asset classes, and reinforce the hypothesis that the medium-term fate of financial markets is determined by a tug-of-war between trend followers and fundamentalists. ...

May 12, 2025 · 2 min · Research Team