Large language models in finance : what is financial sentiment?

ArXiv ID: 2503.03612 “View on arXiv”

Authors: Unknown

Abstract

Financial sentiment has become a crucial yet complex concept in finance, increasingly used in market forecasting and investment strategies. Despite its growing importance, there remains a need to define and understand what financial sentiment truly represents and how it can be effectively measured. We explore the nature of financial sentiment and investigate how large language models (LLMs) contribute to its estimation. We trace the evolution of sentiment measurement in finance, from market-based and lexicon-based methods to advanced natural language processing techniques. The emergence of LLMs has significantly enhanced sentiment analysis, providing deeper contextual understanding and greater accuracy in extracting sentiment from financial text. We examine how BERT-based models, such as RoBERTa and FinBERT, are optimized for structured sentiment classification, while GPT-based models, including GPT-4, OPT, and LLaMA, excel in financial text generation and real-time sentiment interpretation. A comparative analysis of bidirectional and autoregressive transformer architectures highlights their respective roles in investor sentiment analysis, algorithmic trading, and financial decision-making. By exploring what financial sentiment is and how it is estimated within LLMs, we provide insights into the growing role of AI-driven sentiment analysis in finance.

Keywords: Financial Sentiment Analysis, Large Language Models (LLM), Transformer Architecture, FinBERT, Algorithmic Trading, General Financial Assets

Complexity vs Empirical Score

  • Math Complexity: 3.5/10
  • Empirical Rigor: 4.0/10
  • Quadrant: Philosophers
  • Why: The paper reviews existing LLM architectures and sentiment constructs with minimal novel mathematical derivation, relying instead on comparative model descriptions. While it references prior empirical studies and trading strategies, the provided excerpt lacks specific code, dataset details, or implementation-focused backtesting metrics.
  flowchart TD
    A["Research Goal: Defining & Measuring<br>Financial Sentiment with LLMs"] --> B["Data/Inputs:<br>Financial Texts & Market Data"]
    
    B --> C["Evolving Methodology:<br>Market/Lexicon → NLP → LLMs"]
    
    C --> D["LLM Architectures & Analysis"]
    
    D --> E["BERT-based Models<br>RoBERTa / FinBERT<br>Structured Classification"]
    
    D --> F["GPT-based Models<br>GPT-4 / OPT / LLaMA<br>Text Generation & Real-time"]
    
    E --> G["Key Outcomes:<br>Enhanced Sentiment Accuracy<br>Contextual Understanding<br>Investor Analysis & Trading"]
    
    F --> G