Multimodal Insights into Credit Risk Modelling: Integrating Climate and Text Data for Default Prediction

ArXiv ID: 2601.00478 “View on arXiv”

Authors: Zongxiao Wu, Ran Liu, Jiang Dai, Dan Luo

Abstract

Credit risk assessment increasingly relies on diverse sources of information beyond traditional structured financial data, particularly for micro and small enterprises (mSEs) with limited financial histories. This study proposes a multimodal framework that integrates structured credit variables, climate panel data, and unstructured textual narratives within a unified learning architecture. Specifically, we use long short-term memory (LSTM), the gated recurrent unit (GRU), and transformer models to analyse the interplay between these data modalities. The empirical results demonstrate that unimodal models based on climate or text data outperform those relying solely on structured data, while the integration of multiple data modalities yields significant improvements in credit default prediction. Using SHAP-based explainability methods, we find that physical climate risks play an important role in default prediction, with water-logging by rain emerging as the most influential factor. Overall, this study demonstrates the potential of multimodal approaches in AI-enabled decision-making, which provides robust tools for credit risk assessment while contributing to the broader integration of environmental and textual insights into predictive analytics.

Keywords: Multimodal Framework, Credit Risk Assessment, Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), SHAP Explainability, Credit

Complexity vs Empirical Score

  • Math Complexity: 4.5/10
  • Empirical Rigor: 7.5/10
  • Quadrant: Street Traders
  • Why: The paper employs established deep learning architectures (LSTM, GRU, Transformer) with SHAP for explainability, representing moderate mathematical complexity, but the focus is heavily on a specific empirical dataset and backtesting methodology.
  flowchart TD
    A["Research Goal<br>Integrate climate & text data<br>for mSE default prediction"] --> B
    
    subgraph B ["Data Inputs"]
        B1["Structured Credit Data"]
        B2["Climate Panel Data<br>Physical Risks e.g., Water-logging"]
        B3["Unstructured Textual Narratives"]
    end

    B --> C["Computational Framework<br>Multimodal Learning Architecture"]
    
    C --> D{"Deep Learning Models"}
    D --> D1["LSTM"]
    D --> D2["GRU"]
    D --> D3["Transformer"]
    
    D --> E["Analysis & Explainability"]
    E --> F["Key Findings & Outcomes"]
    
    subgraph F ["Outcomes"]
        F1["Text & Climate models<br>outperform Structured alone"]
        F2["Multimodal integration<br>yields significant improvement"]
        F3["SHAP Analysis:<br>Physical climate risks crucial<br>Water-logging = Top factor"]
    end