Modelling crypto markets by multi-agent reinforcement learning

ArXiv ID: 2402.10803 “View on arXiv”

Authors: Unknown

Abstract

Building on a previous foundation work (Lussange et al. 2020), this study introduces a multi-agent reinforcement learning (MARL) model simulating crypto markets, which is calibrated to the Binance’s daily closing prices of $153$ cryptocurrencies that were continuously traded between 2018 and 2022. Unlike previous agent-based models (ABM) or multi-agent systems (MAS) which relied on zero-intelligence agents or single autonomous agent methodologies, our approach relies on endowing agents with reinforcement learning (RL) techniques in order to model crypto markets. This integration is designed to emulate, with a bottom-up approach to complexity inference, both individual and collective agents, ensuring robustness in the recent volatile conditions of such markets and during the COVID-19 era. A key feature of our model also lies in the fact that its autonomous agents perform asset price valuation based on two sources of information: the market prices themselves, and the approximation of the crypto assets fundamental values beyond what those market prices are. Our MAS calibration against real market data allows for an accurate emulation of crypto markets microstructure and probing key market behaviors, in both the bearish and bullish regimes of that particular time period.

Keywords: Multi-Agent Reinforcement Learning, Crypto Markets, Market Microstructure, Agent-Based Modeling, Asset Valuation, Cryptocurrencies

Complexity vs Empirical Score

  • Math Complexity: 7.5/10
  • Empirical Rigor: 8.0/10
  • Quadrant: Holy Grail
  • Why: The paper employs advanced multi-agent reinforcement learning (MARL) and complex agent-based modeling to simulate crypto market microstructure, indicating high mathematical density. It is empirically rigorous, being directly calibrated on real-world Binance data for 153 cryptocurrencies over a 5-year period (2018-2022), covering both bearish and bullish regimes.
  flowchart TD
    A["Research Goal<br>Model crypto markets with<br>Multi-Agent Reinforcement Learning"] --> B{"Key Methodology"}
    B --> B1["Data: 153 Cryptocurrencies<br>Binance Daily Closing Prices<br>2018-2022"]
    B --> B2["Agents: RL-equipped<br>vs. Zero-Intelligence/Single Agent"]
    B --> B3["Information Sources<br>Market Prices & Fundamental Values"]
    B --> B4["Calibration<br>Emulate Market Microstructure"]
    B --> B5["Regime Analysis<br>Bearish vs. Bullish<br>COVID-19 Era"]
    B --> C["Computational Process"]
    C --> C1["Bottom-up Complexity Inference"]
    C --> C2["Multi-Agent Simulation<br>Interaction of Autonomous Agents"]
    C --> C3["Robustness Check<br>Market Volatility"]
    C --> D{"Outcomes"}
    D --> D1["Accurate Emulation of Crypto Markets"]
    D --> D2["Insights into Market Microstructure"]
    D --> D3["Behavioral Probing across Regimes"]
    D --> D4["Validation of RL approach<br>vs. Traditional ABM/MAS"]