false

Extracting the Structure of Press Releases for Predicting Earnings Announcement Returns

Extracting the Structure of Press Releases for Predicting Earnings Announcement Returns ArXiv ID: 2509.24254 “View on arXiv” Authors: Yuntao Wu, Ege Mert Akin, Charles Martineau, Vincent Grégoire, Andreas Veneris Abstract We examine how textual features in earnings press releases predict stock returns on earnings announcement days. Using over 138,000 press releases from 2005 to 2023, we compare traditional bag-of-words and BERT-based embeddings. We find that press release content (soft information) is as informative as earnings surprise (hard information), with FinBERT yielding the highest predictive power. Combining models enhances explanatory strength and interpretability of the content of press releases. Stock prices fully reflect the content of press releases at market open. If press releases are leaked, it offers predictive advantage. Topic analysis reveals self-serving bias in managerial narratives. Our framework supports real-time return prediction through the integration of online learning, provides interpretability and reveals the nuanced role of language in price formation. ...

September 29, 2025 · 2 min · Research Team

Language of Persuasion and Misrepresentation in Business Communication: A Textual Detection Approach

Language of Persuasion and Misrepresentation in Business Communication: A Textual Detection Approach ArXiv ID: 2508.09935 “View on arXiv” Authors: Sayem Hossen, Monalisa Moon Joti, Md. Golam Rashed Abstract Business communication digitisation has reorganised the process of persuasive discourse, which allows not only greater transparency but also advanced deception. This inquiry synthesises classical rhetoric and communication psychology with linguistic theory and empirical studies in the financial reporting, sustainability discourse, and digital marketing to explain how deceptive language can be systematically detected using persuasive lexicon. In controlled settings, detection accuracies of greater than 99% were achieved by using computational textual analysis as well as personalised transformer models. However, reproducing this performance in multilingual settings is also problematic and, to a large extent, this is because it is not easy to find sufficient data, and because few multilingual text-processing infrastructures are in place. This evidence shows that there has been an increasing gap between the theoretical representations of communication and those empirically approximated, and therefore, there is a need to have strong automatic text-identification systems where AI-based discourse is becoming more realistic in communicating with humans. ...

August 13, 2025 · 2 min · Research Team

HARLF: Hierarchical Reinforcement Learning and Lightweight LLM-Driven Sentiment Integration for Financial Portfolio Optimization

HARLF: Hierarchical Reinforcement Learning and Lightweight LLM-Driven Sentiment Integration for Financial Portfolio Optimization ArXiv ID: 2507.18560 “View on arXiv” Authors: Benjamin Coriat, Eric Benhamou Abstract This paper presents a novel hierarchical framework for portfolio optimization, integrating lightweight Large Language Models (LLMs) with Deep Reinforcement Learning (DRL) to combine sentiment signals from financial news with traditional market indicators. Our three-tier architecture employs base RL agents to process hybrid data, meta-agents to aggregate their decisions, and a super-agent to merge decisions based on market data and sentiment analysis. Evaluated on data from 2018 to 2024, after training on 2000-2017, the framework achieves a 26% annualized return and a Sharpe ratio of 1.2, outperforming equal-weighted and S&P 500 benchmarks. Key contributions include scalable cross-modal integration, a hierarchical RL structure for enhanced stability, and open-source reproducibility. ...

July 24, 2025 · 2 min · Research Team

Optimal Trading under Instantaneous and Persistent Price Impact, Predictable Returns and Multiscale Stochastic Volatility

Optimal Trading under Instantaneous and Persistent Price Impact, Predictable Returns and Multiscale Stochastic Volatility ArXiv ID: 2507.17162 “View on arXiv” Authors: Patrick Chan, Ronnie Sircar, Iosif Zimbidis Abstract We consider a dynamic portfolio optimization problem that incorporates predictable returns, instantaneous transaction costs, price impact, and stochastic volatility, extending the classical results of Garleanu and Pedersen (2013), which assume constant volatility. Constructing the optimal portfolio strategy in this general setting is challenging due to the nonlinear nature of the resulting Hamilton-Jacobi-Bellman (HJB) equations. To address this, we propose a multi-scale volatility expansion that captures stochastic volatility dynamics across different time scales. Specifically, the analysis involves a singular perturbation for the fast mean-reverting volatility factor and a regular perturbation for the slow-moving factor. We also introduce an approximation for small price impact and demonstrate its numerical accuracy. We formally derive asymptotic approximations up to second order and use Monte Carlo simulations to show how incorporating these corrections improves the Profit and Loss (PnL) of the resulting portfolio strategy. ...

July 23, 2025 · 2 min · Research Team

Deep Learning Enhanced Multivariate GARCH

Deep Learning Enhanced Multivariate GARCH ArXiv ID: 2506.02796 “View on arXiv” Authors: Haoyuan Wang, Chen Liu, Minh-Ngoc Tran, Chao Wang Abstract This paper introduces a novel multivariate volatility modeling framework, named Long Short-Term Memory enhanced BEKK (LSTM-BEKK), that integrates deep learning into multivariate GARCH processes. By combining the flexibility of recurrent neural networks with the econometric structure of BEKK models, our approach is designed to better capture nonlinear, dynamic, and high-dimensional dependence structures in financial return data. The proposed model addresses key limitations of traditional multivariate GARCH-based methods, particularly in capturing persistent volatility clustering and asymmetric co-movement across assets. Leveraging the data-driven nature of LSTMs, the framework adapts effectively to time-varying market conditions, offering improved robustness and forecasting performance. Empirical results across multiple equity markets confirm that the LSTM-BEKK model achieves superior performance in terms of out-of-sample portfolio risk forecast, while maintaining the interpretability from the BEKK models. These findings highlight the potential of hybrid econometric-deep learning models in advancing financial risk management and multivariate volatility forecasting. ...

June 3, 2025 · 2 min · Research Team

Why is the volatility of single stocks so much rougher than that of the S&P500?

Why is the volatility of single stocks so much rougher than that of the S&P500? ArXiv ID: 2505.02678 “View on arXiv” Authors: Othmane Zarhali, Cecilia Aubrun, Emmanuel Bacry, Jean-Philippe Bouchaud, Jean-François Muzy Abstract The Nested factor model was introduced by Chicheportiche et al. to represent non-linear correlations between stocks. Stock returns are explained by a standard factor model, but the (log)-volatilities of factors and residuals are themselves decomposed into factor modes, with a common dominant volatility mode affecting both market and sector factors but also residuals. Here, we consider the case of a single factor where the only dominant log-volatility mode is rough, with a Hurst exponent $H \simeq 0.11$ and the log-volatility residuals are ‘‘super-rough’’ or ‘‘multifractal’’, with $H \simeq 0$. We demonstrate that such a construction naturally accounts for the somewhat surprising stylized fact reported by Wu et al. , where it has been observed that the Hurst exponents of stock indexes are large compared to those of individual stocks. We propose a statistical procedure to estimate the Hurst factor exponent from the stock returns dynamics together with theoretical guarantees of its consistency. We demonstrate the effectiveness of our approach through numerical experiments and apply it to daily stock data from the S&P500 index. The estimated roughness exponents for both the factor and idiosyncratic components validate the assumptions underlying our model. ...

May 5, 2025 · 2 min · Research Team

Latent Variable Estimation in Bayesian Black-Litterman Models

Latent Variable Estimation in Bayesian Black-Litterman Models ArXiv ID: 2505.02185 “View on arXiv” Authors: Thomas Y. L. Lin, Jerry Yao-Chieh Hu, Paul W. Chiou, Peter Lin Abstract We revisit the Bayesian Black-Litterman (BL) portfolio model and remove its reliance on subjective investor views. Classical BL requires an investor “view”: a forecast vector $q$ and its uncertainty matrix $Ω$ that describe how much a chosen portfolio should outperform the market. Our key idea is to treat $(q,Ω)$ as latent variables and learn them from market data within a single Bayesian network. Consequently, the resulting posterior estimation admits closed-form expression, enabling fast inference and stable portfolio weights. Building on these, we propose two mechanisms to capture how features interact with returns: shared-latent parametrization and feature-influenced views; both recover classical BL and Markowitz portfolios as special cases. Empirically, on 30-year Dow-Jones and 20-year sector-ETF data, we improve Sharpe ratios by 50% and cut turnover by 55% relative to Markowitz and the index baselines. This work turns BL into a fully data-driven, view-free, and coherent Bayesian framework for portfolio optimization. ...

May 4, 2025 · 2 min · Research Team

Financial resilience of agricultural and food production companies in Spain: A compositional cluster analysis of the impact of the Ukraine-Russia war (2021-2023)

Financial resilience of agricultural and food production companies in Spain: A compositional cluster analysis of the impact of the Ukraine-Russia war (2021-2023) ArXiv ID: 2504.05912 “View on arXiv” Authors: Unknown Abstract This study analyzes the financial resilience of agricultural and food production companies in Spain amid the Ukraine-Russia war using cluster analysis based on financial ratios. This research utilizes centered log-ratios to transform financial ratios for compositional data analysis. The dataset comprises financial information from 1197 firms in Spain’s agricultural and food sectors over the period 2021-2023. The analysis reveals distinct clusters of firms with varying financial performance, characterized by metrics of solvency and profitability. The results highlight an increase in resilient firms by 2023, underscoring sectoral adaptation to the conflict’s economic challenges. These findings together provide insights for stakeholders and policymakers to improve sectorial stability and strategic planning. ...

April 8, 2025 · 2 min · Research Team

Generating realistic metaorders from public data

Generating realistic metaorders from public data ArXiv ID: 2503.18199 “View on arXiv” Authors: Unknown Abstract This paper introduces a novel algorithm for generating realistic metaorders from public trade data, addressing a longstanding challenge in price impact research that has traditionally relied on proprietary datasets. Our method effectively recovers all established stylized facts of metaorders impact, such as the Square Root Law, the concave profile during metaorder execution, and the post-execution decay. This algorithm not only overcomes the dependence on proprietary data, a major barrier to research reproducibility, but also enables the creation of larger and more robust datasets that may increase the quality of empirical studies. Our findings strongly suggest that average realized short-term price impact is not due to information revelation (as in the Kyle framework) but has a mechanical origin which could explain the universality of the Square Root Law. ...

March 23, 2025 · 2 min · Research Team

Floating exercise boundaries for American options in time-inhomogeneous models

Floating exercise boundaries for American options in time-inhomogeneous models ArXiv ID: 2502.00740 “View on arXiv” Authors: Unknown Abstract This paper examines a semi-analytical approach for pricing American options in time-inhomogeneous models characterized by negative interest rates (for equity/FX) or negative convenience yields (for commodities/cryptocurrencies). Under such conditions, exercise boundaries may exhibit a “floating” structure - dynamically appearing and disappearing. For example, a second exercise boundary could emerge within the computational domain and subsequently both could collapse, demanding specialized pricing methodologies. ...

February 2, 2025 · 1 min · Research Team