Narratives from GPT-derived Networks of News, and a link to Financial Markets Dislocations

ArXiv ID: 2311.14419 “View on arXiv”

Authors: Unknown

Abstract

Starting from a corpus of economic articles from The Wall Street Journal, we present a novel systematic way to analyse news content that evolves over time. We leverage on state-of-the-art natural language processing techniques (i.e. GPT3.5) to extract the most important entities of each article available, and aggregate co-occurrence of entities in a related graph at the weekly level. Network analysis techniques and fuzzy community detection are tested on the proposed set of graphs, and a framework is introduced that allows systematic but interpretable detection of topics and narratives. In parallel, we propose to consider the sentiment around main entities of an article as a more accurate proxy for the overall sentiment of such piece of text, and describe a case-study to motivate this choice. Finally, we design features that characterise the type and structure of news within each week, and map them to moments of financial markets dislocations. The latter are identified as dates with unusually high volatility across asset classes, and we find quantitative evidence that they relate to instances of high entropy in the high-dimensional space of interconnected news. This result further motivates the pursued efforts to provide a novel framework for the systematic analysis of narratives within news.

Keywords: natural language processing, graph analysis, topic detection, market volatility, news sentiment, Multi-Asset

Complexity vs Empirical Score

  • Math Complexity: 3.5/10
  • Empirical Rigor: 7.0/10
  • Quadrant: Street Traders
  • Why: The paper uses standard NLP and network analysis techniques (GPT extraction, graph theory, fuzzy clustering) with accessible formulas, lacking deep mathematical derivations. However, it demonstrates high empirical rigor with a substantial real-world dataset (Wall Street Journal, 4 years), multi-asset volatility indices, rolling z-score calculations, and a clear backtesting-ready framework for mapping news features to market dislocations.
  flowchart TD
    A["Research Goal:<br>Linking News Narratives to<br>Financial Market Dislocations"] --> B["Data Input:<br>Wall Street Journal<br>Economic Articles"]
    B --> C["Method: GPT3.5 Entity Extraction"]
    C --> D["Process: Weekly Co-occurrence<br>Graph Construction"]
    D --> E["Method: Fuzzy Community<br>Detection & Sentiment Analysis"]
    E --> F["Process: Feature Design<br>High Entropy & Topic Structure"]
    F --> G["Outcome: Quantitative Evidence<br>High News Entropy -><br>Market Volatility Dislocations"]