{"title":"Fusing LLMs and KGs for Formal Causal Reasoning behind Financial Risk Contagion","authors":"Guanyuan Yu, Xv Wang, Qing Li, Yu Zhao","doi":"arxiv-2407.17190","DOIUrl":null,"url":null,"abstract":"Financial risks trend to spread from one entity to another, ultimately\nleading to systemic risks. The key to preventing such risks lies in\nunderstanding the causal chains behind risk contagion. Despite this, prevailing\napproaches primarily emphasize identifying risks, overlooking the underlying\ncausal analysis of risk. To address such an issue, we propose a Risk Contagion\nCausal Reasoning model called RC2R, which uses the logical reasoning\ncapabilities of large language models (LLMs) to dissect the causal mechanisms\nof risk contagion grounded in the factual and expert knowledge embedded within\nfinancial knowledge graphs (KGs). At the data level, we utilize financial KGs\nto construct causal instructions, empowering LLMs to perform formal causal\nreasoning on risk propagation and tackle the \"causal parrot\" problem of LLMs.\nIn terms of model architecture, we integrate a fusion module that aligns tokens\nand nodes across various granularities via multi-scale contrastive learning,\nfollowed by the amalgamation of textual and graph-structured data through soft\nprompt with cross multi-head attention mechanisms. To quantify risk contagion,\nwe introduce a risk pathway inference module for calculating risk scores for\neach node in the graph. Finally, we visualize the risk contagion pathways and\ntheir intensities using Sankey diagrams, providing detailed causal\nexplanations. Comprehensive experiments on financial KGs and supply chain\ndatasets demonstrate that our model outperforms several state-of-the-art models\nin prediction performance and out-of-distribution (OOD) generalization\ncapabilities. We will make our dataset and code publicly accessible to\nencourage further research and development in this field.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computational Engineering, Finance, and Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.17190","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Financial risks trend to spread from one entity to another, ultimately
leading to systemic risks. The key to preventing such risks lies in
understanding the causal chains behind risk contagion. Despite this, prevailing
approaches primarily emphasize identifying risks, overlooking the underlying
causal analysis of risk. To address such an issue, we propose a Risk Contagion
Causal Reasoning model called RC2R, which uses the logical reasoning
capabilities of large language models (LLMs) to dissect the causal mechanisms
of risk contagion grounded in the factual and expert knowledge embedded within
financial knowledge graphs (KGs). At the data level, we utilize financial KGs
to construct causal instructions, empowering LLMs to perform formal causal
reasoning on risk propagation and tackle the "causal parrot" problem of LLMs.
In terms of model architecture, we integrate a fusion module that aligns tokens
and nodes across various granularities via multi-scale contrastive learning,
followed by the amalgamation of textual and graph-structured data through soft
prompt with cross multi-head attention mechanisms. To quantify risk contagion,
we introduce a risk pathway inference module for calculating risk scores for
each node in the graph. Finally, we visualize the risk contagion pathways and
their intensities using Sankey diagrams, providing detailed causal
explanations. Comprehensive experiments on financial KGs and supply chain
datasets demonstrate that our model outperforms several state-of-the-art models
in prediction performance and out-of-distribution (OOD) generalization
capabilities. We will make our dataset and code publicly accessible to
encourage further research and development in this field.