false

Return Prediction for Mean-Variance Portfolio Selection: How Decision-Focused Learning Shapes Forecasting Models

Return Prediction for Mean-Variance Portfolio Selection: How Decision-Focused Learning Shapes Forecasting Models ArXiv ID: 2409.09684 “View on arXiv” Authors: Unknown Abstract Markowitz laid the foundation of portfolio theory through the mean-variance optimization (MVO) framework. However, the effectiveness of MVO is contingent on the precise estimation of expected returns, variances, and covariances of asset returns, which are typically uncertain. Machine learning models are becoming useful in estimating uncertain parameters, and such models are trained to minimize prediction errors, such as mean squared errors (MSE), which treat prediction errors uniformly across assets. Recent studies have pointed out that this approach would lead to suboptimal decisions and proposed Decision-Focused Learning (DFL) as a solution, integrating prediction and optimization to improve decision-making outcomes. While studies have shown DFL’s potential to enhance portfolio performance, the detailed mechanisms of how DFL modifies prediction models for MVO remain unexplored. This study investigates how DFL adjusts stock return prediction models to optimize decisions in MVO. Theoretically, we show that DFL’s gradient can be interpreted as tilting the MSE-based prediction errors by the inverse covariance matrix, effectively incorporating inter-asset correlations into the learning process, while MSE treats each asset’s error independently. This tilting mechanism leads to systematic prediction biases where DFL overestimates returns for assets included in portfolios while underestimating excluded assets. Our findings reveal why DFL achieves superior portfolio performance despite higher prediction errors. The strategic biases are features, not flaws. ...

September 15, 2024 · 2 min · Research Team

Disentangling the sources of cyber risk premia

Disentangling the sources of cyber risk premia ArXiv ID: 2409.08728 “View on arXiv” Authors: Unknown Abstract We use a methodology based on a machine learning algorithm to quantify firms’ cyber risks based on their disclosures and a dedicated cyber corpus. The model can identify paragraphs related to determined cyber-threat types and accordingly attribute several related cyber scores to the firm. The cyber scores are unrelated to other firms’ characteristics. Stocks with high cyber scores significantly outperform other stocks. The long-short cyber risk factors have positive risk premia, are robust to all factors’ benchmarks, and help price returns. Furthermore, we suggest the market does not distinguish between different types of cyber risks but instead views them as a single, aggregate cyber risk. ...

September 13, 2024 · 2 min · Research Team

Tuning into Climate Risks: Extracting Innovation from Television News for Clean Energy Firms

Tuning into Climate Risks: Extracting Innovation from Television News for Clean Energy Firms ArXiv ID: 2409.08701 “View on arXiv” Authors: Unknown Abstract This article develops multiple novel climate risk measures (or variables) based on the television news coverage by Bloomberg, CNBC, and Fox Business, and examines how they affect the systematic and idiosyncratic risks of clean energy firms in the United States. The measures are built on climate related keywords and cover the volume of coverage, type of coverage (climate crisis, renewable energy, and government & human initiatives), and media sentiments. We show that an increase in the aggregate measure of climate risk, as indicated by coverage volume, reduces idiosyncratic risk while increasing systematic risk. When climate risk is segregated, we find that systematic risk is positively affected by the physical risk of climate crises and transition risk from government & human initiatives, but no such impact is evident for idiosyncratic risk. Additionally, we observe an asymmetry in risk behavior: negative sentiments tend to decrease idiosyncratic risk and increase systematic risk, while positive sentiments have no significant impact. These findings remain robust to including print media and climate policy uncertainty variables, though some deviations are noted during the COVID-19 period. ...

September 13, 2024 · 2 min · Research Team

Market information of the fractional stochastic regularity model

Market information of the fractional stochastic regularity model ArXiv ID: 2409.07159 “View on arXiv” Authors: Unknown Abstract The Fractional Stochastic Regularity Model (FSRM) is an extension of Black-Scholes model describing the multifractal nature of prices. It is based on a multifractional process with a random Hurst exponent $H_t$, driven by a fractional Ornstein-Uhlenbeck (fOU) process. When the regularity parameter $H_t$ is equal to $1/2$, the efficient market hypothesis holds, but when $H_t\neq 1/2$ past price returns contain some information on a future trend or mean-reversion of the log-price process. In this paper, we investigate some properties of the fOU process and, thanks to information theory and Shannon’s entropy, we determine theoretically the serial information of the regularity process $H_t$ of the FSRM, giving some insight into one’s ability to forecast future price increments and to build statistical arbitrages with this model. ...

September 11, 2024 · 2 min · Research Team

Automate Strategy Finding with LLM in Quant Investment

Automate Strategy Finding with LLM in Quant Investment ArXiv ID: 2409.06289 “View on arXiv” Authors: Unknown Abstract We present a novel three-stage framework leveraging Large Language Models (LLMs) within a risk-aware multi-agent system for automate strategy finding in quantitative finance. Our approach addresses the brittleness of traditional deep learning models in financial applications by: employing prompt-engineered LLMs to generate executable alpha factor candidates across diverse financial data, implementing multimodal agent-based evaluation that filters factors based on market status, predictive quality while maintaining category balance, and deploying dynamic weight optimization that adapts to market conditions. Experimental results demonstrate the robust performance of the strategy in Chinese & US market regimes compared to established benchmarks. Our work extends LLMs capabilities to quantitative trading, providing a scalable architecture for financial signal extraction and portfolio construction. The overall framework significantly outperforms all benchmarks with 53.17% cumulative return on SSE50 (Jan 2023 to Jan 2024), demonstrating superior risk-adjusted performance and downside protection on the market. ...

September 10, 2024 · 2 min · Research Team

Limit Order Book Simulation and Trade Evaluation with $K$-Nearest-Neighbor Resampling

Limit Order Book Simulation and Trade Evaluation with $K$-Nearest-Neighbor Resampling ArXiv ID: 2409.06514 “View on arXiv” Authors: Unknown Abstract In this paper, we show how $K$-nearest neighbor ($K$-NN) resampling, an off-policy evaluation method proposed in \cite{“giegrich2023k”}, can be applied to simulate limit order book (LOB) markets and how it can be used to evaluate and calibrate trading strategies. Using historical LOB data, we demonstrate that our simulation method is capable of recreating realistic LOB dynamics and that synthetic trading within the simulation leads to a market impact in line with the corresponding literature. Compared to other statistical LOB simulation methods, our algorithm has theoretical convergence guarantees under general conditions, does not require optimization, is easy to implement and computationally efficient. Furthermore, we show that in a benchmark comparison our method outperforms a deep learning-based algorithm for several key statistics. In the context of a LOB with pro-rata type matching, we demonstrate how our algorithm can calibrate the size of limit orders for a liquidation strategy. Finally, we describe how $K$-NN resampling can be modified for choices of higher dimensional state spaces. ...

September 10, 2024 · 2 min · Research Team

MANA-Net: Mitigating Aggregated Sentiment Homogenization with News Weighting for Enhanced Market Prediction

MANA-Net: Mitigating Aggregated Sentiment Homogenization with News Weighting for Enhanced Market Prediction ArXiv ID: 2409.05698 “View on arXiv” Authors: Unknown Abstract It is widely acknowledged that extracting market sentiments from news data benefits market predictions. However, existing methods of using financial sentiments remain simplistic, relying on equal-weight and static aggregation to manage sentiments from multiple news items. This leads to a critical issue termed ``Aggregated Sentiment Homogenization’’, which has been explored through our analysis of a large financial news dataset from industry practice. This phenomenon occurs when aggregating numerous sentiments, causing representations to converge towards the mean values of sentiment distributions and thereby smoothing out unique and important information. Consequently, the aggregated sentiment representations lose much predictive value of news data. To address this problem, we introduce the Market Attention-weighted News Aggregation Network (MANA-Net), a novel method that leverages a dynamic market-news attention mechanism to aggregate news sentiments for market prediction. MANA-Net learns the relevance of news sentiments to price changes and assigns varying weights to individual news items. By integrating the news aggregation step into the networks for market prediction, MANA-Net allows for trainable sentiment representations that are optimized directly for prediction. We evaluate MANA-Net using the S&P 500 and NASDAQ 100 indices, along with financial news spanning from 2003 to 2018. Experimental results demonstrate that MANA-Net outperforms various recent market prediction methods, enhancing Profit & Loss by 1.1% and the daily Sharpe ratio by 0.252. ...

September 9, 2024 · 2 min · Research Team

Bellwether Trades: Characteristics of Trades influential in Predicting Future Price Movements in Markets

Bellwether Trades: Characteristics of Trades influential in Predicting Future Price Movements in Markets ArXiv ID: 2409.05192 “View on arXiv” Authors: Unknown Abstract In this study, we leverage powerful non-linear machine learning methods to identify the characteristics of trades that contain valuable information. First, we demonstrate the effectiveness of our optimized neural network predictor in accurately predicting future market movements. Then, we utilize the information from this successful neural network predictor to pinpoint the individual trades within each data point (trading window) that had the most impact on the optimized neural network’s prediction of future price movements. This approach helps us uncover important insights about the heterogeneity in information content provided by trades of different sizes, venues, trading contexts, and over time. ...

September 8, 2024 · 2 min · Research Team

Fundamental properties of linear factor models

Fundamental properties of linear factor models ArXiv ID: 2409.02521 “View on arXiv” Authors: Unknown Abstract We study conditional linear factor models in the context of asset pricing panels. Our analysis focuses on conditional means and covariances to characterize the cross-sectional and inter-temporal properties of returns and factors as well as their interrelationships. We also review the conditions outlined in Kozak and Nagel (2024) and show how the conditional mean-variance efficient portfolio of an unbalanced panel can be spanned by low-dimensional factor portfolios, even without assuming invertibility of the conditional covariance matrices. Our analysis provides a comprehensive foundation for the specification and estimation of conditional linear factor models. ...

September 4, 2024 · 2 min · Research Team

Leveraging RNNs and LSTMs for Synchronization Analysis in the Indian Stock Market: A Threshold-Based Classification Approach

Leveraging RNNs and LSTMs for Synchronization Analysis in the Indian Stock Market: A Threshold-Based Classification Approach ArXiv ID: 2409.06728 “View on arXiv” Authors: Unknown Abstract Our research presents a new approach for forecasting the synchronization of stock prices using machine learning and non-linear time-series analysis. To capture the complex non-linear relationships between stock prices, we utilize recurrence plots (RP) and cross-recurrence quantification analysis (CRQA). By transforming Cross Recurrence Plot (CRP) data into a time-series format, we enable the use of Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM) networks for predicting stock price synchronization through both regression and classification. We apply this methodology to a dataset of 20 highly capitalized stocks from the Indian market over a 21-year period. The findings reveal that our approach can predict stock price synchronization, with an accuracy of 0.98 and F1 score of 0.83 offering valuable insights for developing effective trading strategies and risk management tools. ...

August 27, 2024 · 2 min · Research Team