false

Technical Analysis Meets Machine Learning: Bitcoin Evidence

Technical Analysis Meets Machine Learning: Bitcoin Evidence ArXiv ID: 2511.00665 “View on arXiv” Authors: José Ángel Islas Anguiano, Andrés García-Medina Abstract In this note, we compare Bitcoin trading performance using two machine learning models-Light Gradient Boosting Machine (LightGBM) and Long Short-Term Memory (LSTM)-and two technical analysis-based strategies: Exponential Moving Average (EMA) crossover and a combination of Moving Average Convergence/Divergence with the Average Directional Index (MACD+ADX). The objective is to evaluate how trading signals can be used to maximize profits in the Bitcoin market. This comparison was motivated by the U.S. Securities and Exchange Commission’s (SEC) approval of the first spot Bitcoin exchange-traded funds (ETFs) on 2024-01-10. Our results show that the LSTM model achieved a cumulative return of approximately 65.23% in under a year, significantly outperforming LightGBM, the EMA and MACD+ADX strategies, as well as the baseline buy-and-hold. This study highlights the potential for deeper integration of machine learning and technical analysis in the rapidly evolving cryptocurrency landscape. ...

November 1, 2025 · 2 min · Research Team

Deep reinforcement learning for optimal trading with partial information

Deep reinforcement learning for optimal trading with partial information ArXiv ID: 2511.00190 “View on arXiv” Authors: Andrea Macrì, Sebastian Jaimungal, Fabrizio Lillo Abstract Reinforcement Learning (RL) applied to financial problems has been the subject of a lively area of research. The use of RL for optimal trading strategies that exploit latent information in the market is, to the best of our knowledge, not widely tackled. In this paper we study an optimal trading problem, where a trading signal follows an Ornstein-Uhlenbeck process with regime-switching dynamics. We employ a blend of RL and Recurrent Neural Networks (RNN) in order to make the most at extracting underlying information from the trading signal with latent parameters. The latent parameters driving mean reversion, speed, and volatility are filtered from observations of the signal, and trading strategies are derived via RL. To address this problem, we propose three Deep Deterministic Policy Gradient (DDPG)-based algorithms that integrate Gated Recurrent Unit (GRU) networks to capture temporal dependencies in the signal. The first, a one -step approach (hid-DDPG), directly encodes hidden states from the GRU into the RL trader. The second and third are two-step methods: one (prob-DDPG) makes use of posterior regime probability estimates, while the other (reg-DDPG) relies on forecasts of the next signal value. Through extensive simulations with increasingly complex Markovian regime dynamics for the trading signal’s parameters, as well as an empirical application to equity pair trading, we find that prob-DDPG achieves superior cumulative rewards and exhibits more interpretable strategies. By contrast, reg-DDPG provides limited benefits, while hid-DDPG offers intermediate performance with less interpretable strategies. Our results show that the quality and structure of the information supplied to the agent are crucial: embedding probabilistic insights into latent regimes substantially improves both profitability and robustness of reinforcement learning-based trading strategies. ...

October 31, 2025 · 3 min · Research Team

Supervised Similarity for Firm Linkages

Supervised Similarity for Firm Linkages ArXiv ID: 2506.19856 “View on arXiv” Authors: Ryan Samson, Adrian Banner, Luca Candelori, Sebastien Cottrell, Tiziana Di Matteo, Paul Duchnowski, Vahagn Kirakosyan, Jose Marques, Kharen Musaelian, Stefano Pasquali, Ryan Stever, Dario Villani Abstract We introduce a novel proxy for firm linkages, Characteristic Vector Linkages (CVLs). We use this concept to estimate firm linkages, first through Euclidean similarity, and then by applying Quantum Cognition Machine Learning (QCML) to similarity learning. We demonstrate that both methods can be used to construct profitable momentum spillover trading strategies, but QCML similarity outperforms the simpler Euclidean similarity. ...

June 9, 2025 · 1 min · Research Team

Risk-aware Trading Portfolio Optimization

Risk-aware Trading Portfolio Optimization ArXiv ID: 2503.04662 “View on arXiv” Authors: Unknown Abstract We investigate portfolio optimization in financial markets from a trading and risk management perspective. We term this task Risk-Aware Trading Portfolio Optimization (RATPO), formulate the corresponding optimization problem, and propose an efficient Risk-Aware Trading Swarm (RATS) algorithm to solve it. The key elements of RATPO are a generic initial portfolio P, a specific set of Unique Eligible Instruments (UEIs), their combination into an Eligible Optimization Strategy (EOS), an objective function, and a set of constraints. RATS searches for an optimal EOS that, added to P, improves the objective function repecting the constraints. RATS is a specialized Particle Swarm Optimization method that leverages the parameterization of P in terms of UEIs, enables parallel computation with a large number of particles, and is fully general with respect to specific choices of the key elements, which can be customized to encode financial knowledge and needs of traders and risk managers. We showcase two RATPO applications involving a real trading portfolio made of hundreds of different financial instruments, an objective function combining both market risk (VaR) and profit&loss measures, constrains on market sensitivities and UEIs trading costs. In the case of small-sized EOS, RATS successfully identifies the optimal solution and demonstrates robustness with respect to hyper-parameters tuning. In the case of large-sized EOS, RATS markedly improves the portfolio objective value, optimizing risk and capital charge while respecting risk limits and preserving expected profits. Our work bridges the gap between the implementation of effective trading strategies and compliance with stringent regulatory and economic capital requirements, allowing a better alignment of business and risk management objectives. ...

March 6, 2025 · 2 min · Research Team

Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies

Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies ArXiv ID: 2502.17518 “View on arXiv” Authors: Unknown Abstract This paper presents a comprehensive study on the use of ensemble Reinforcement Learning (RL) models in financial trading strategies, leveraging classifier models to enhance performance. By combining RL algorithms such as A2C, PPO, and SAC with traditional classifiers like Support Vector Machines (SVM), Decision Trees, and Logistic Regression, we investigate how different classifier groups can be integrated to improve risk-return trade-offs. The study evaluates the effectiveness of various ensemble methods, comparing them with individual RL models across key financial metrics, including Cumulative Returns, Sharpe Ratios (SR), Calmar Ratios, and Maximum Drawdown (MDD). Our results demonstrate that ensemble methods consistently outperform base models in terms of risk-adjusted returns, providing better management of drawdowns and overall stability. However, we identify the sensitivity of ensemble performance to the choice of variance threshold τ, highlighting the importance of dynamic τ adjustment to achieve optimal performance. This study emphasizes the value of combining RL with classifiers for adaptive decision-making, with implications for financial trading, robotics, and other dynamic environments. ...

February 23, 2025 · 2 min · Research Team

Calculating Profits and Losses for Algorithmic Trading Strategies: A Short Guide

Calculating Profits and Losses for Algorithmic Trading Strategies: A Short Guide ArXiv ID: 2411.14068 “View on arXiv” Authors: Unknown Abstract We present a series of equations that track the total realized and unrealized profits and losses at any time, incorporating the spread. The resulting formalism is ideally suited to evaluate the performance of trading model algorithms. Keywords: realized profit/loss, unrealized profit/loss, spread, trading algorithms, performance evaluation, Trading Strategies Complexity vs Empirical Score Math Complexity: 3.5/10 Empirical Rigor: 2.0/10 Quadrant: Philosophers Why: The paper presents a series of algebraic equations to formalize profit and loss calculations, which is moderately math-intensive but lacks the deep stochastic calculus or advanced statistics often seen in quant finance research. Empirically, it is a theoretical guide with illustrative examples but no backtested performance, real-world datasets, or implementation code. flowchart TD A["Research Goal: Develop<br>algorithms to track<br>realized & unrealized PnL"] --> B["Key Methodology: Mathematical Formalism"] B --> C["Data/Inputs: Trades, Prices, Spread"] C --> D["Computational Process:<br>Equations for PnL Calculation"] D --> E["Key Findings: Robust<br>Performance Evaluation"]

November 21, 2024 · 1 min · Research Team

Reinforcement Learning Framework for Quantitative Trading

Reinforcement Learning Framework for Quantitative Trading ArXiv ID: 2411.07585 “View on arXiv” Authors: Unknown Abstract The inherent volatility and dynamic fluctuations within the financial stock market underscore the necessity for investors to employ a comprehensive and reliable approach that integrates risk management strategies, market trends, and the movement trends of individual securities. By evaluating specific data, investors can make more informed decisions. However, the current body of literature lacks substantial evidence supporting the practical efficacy of reinforcement learning (RL) agents, as many models have only demonstrated success in back testing using historical data. This highlights the urgent need for a more advanced methodology capable of addressing these challenges. There is a significant disconnect in the effective utilization of financial indicators to better understand the potential market trends of individual securities. The disclosure of successful trading strategies is often restricted within financial markets, resulting in a scarcity of widely documented and published strategies leveraging RL. Furthermore, current research frequently overlooks the identification of financial indicators correlated with various market trends and their potential advantages. This research endeavors to address these complexities by enhancing the ability of RL agents to effectively differentiate between positive and negative buy/sell actions using financial indicators. While we do not address all concerns, this paper provides deeper insights and commentary on the utilization of technical indicators and their benefits within reinforcement learning. This work establishes a foundational framework for further exploration and investigation of more complex scenarios. ...

November 12, 2024 · 2 min · Research Team

A Review of Reinforcement Learning in Financial Applications

A Review of Reinforcement Learning in Financial Applications ArXiv ID: 2411.12746 “View on arXiv” Authors: Unknown Abstract In recent years, there has been a growing trend of applying Reinforcement Learning (RL) in financial applications. This approach has shown great potential to solve decision-making tasks in finance. In this survey, we present a comprehensive study of the applications of RL in finance and conduct a series of meta-analyses to investigate the common themes in the literature, such as the factors that most significantly affect RL’s performance compared to traditional methods. Moreover, we identify challenges including explainability, Markov Decision Process (MDP) modeling, and robustness that hinder the broader utilization of RL in the financial industry and discuss recent advancements in overcoming these challenges. Finally, we propose future research directions, such as benchmarking, contextual RL, multi-agent RL, and model-based RL to address these challenges and to further enhance the implementation of RL in finance. ...

November 1, 2024 · 2 min · Research Team

Strong denoising of financial time-series

Strong denoising of financial time-series ArXiv ID: 2408.05690 “View on arXiv” Authors: Unknown Abstract In this paper we introduce a method for significantly improving the signal to noise ratio in financial data. The approach relies on combining a target variable with different context variables and use auto-encoders (AEs) to learn reconstructions of the combined inputs. The objective is to obtain agreement among pairs of AEs which are trained on related but different inputs and for which they are forced to find common ground. The training process is set up as a “conversation” where the models take turns at producing a prediction (speaking) and reconciling own predictions with the output of the other AE (listening), until an agreement is reached. This leads to a new way of constraining the complexity of the data representation generated by the AE. Unlike standard regularization whose strength needs to be decided by the designer, the proposed mutual regularization uses the partner network to detect and amend the lack of generality of the learned representation of the data. The integration of alternative perspectives enhances the de-noising capacity of a single AE and allows us to discover new regularities in financial time-series which can be converted into profitable trading strategies. ...

August 11, 2024 · 2 min · Research Team

Dynamic Time Warping for Lead-Lag Relationships in Lagged Multi-Factor Models

Dynamic Time Warping for Lead-Lag Relationships in Lagged Multi-Factor Models ArXiv ID: 2309.08800 “View on arXiv” Authors: Unknown Abstract In multivariate time series systems, lead-lag relationships reveal dependencies between time series when they are shifted in time relative to each other. Uncovering such relationships is valuable in downstream tasks, such as control, forecasting, and clustering. By understanding the temporal dependencies between different time series, one can better comprehend the complex interactions and patterns within the system. We develop a cluster-driven methodology based on dynamic time warping for robust detection of lead-lag relationships in lagged multi-factor models. We establish connections to the multireference alignment problem for both the homogeneous and heterogeneous settings. Since multivariate time series are ubiquitous in a wide range of domains, we demonstrate that our algorithm is able to robustly detect lead-lag relationships in financial markets, which can be subsequently leveraged in trading strategies with significant economic benefits. ...

September 15, 2023 · 2 min · Research Team