{"title":"审计意见预测的双模型协同:一种协同LLM代理框架方法","authors":"Yi Lu, Jinxing Hao, Xuesong Tang","doi":"10.1016/j.iref.2025.104642","DOIUrl":null,"url":null,"abstract":"<div><div>By combining the capabilities of Moonshot and DeepSeek-R1, which respectively evaluate risk scores based on MD&A text information and financial data, we exploit the complementary strengths of long-context and reasoning large language models (LLMs) to help evaluate material misstatement risks in audit opinions. Our results suggest that both MD&A-based and financial-based risk evaluations effectively distinguish qualified and unqualified audit opinions, but combining them yields the best performance. In addition to the interpretative analysis text output, the combined LLM evaluation consistently outperforms logistic regression prediction that incorporates all indicators, and it achieves comparable performance compared to the sophisticated machine learning methods like gradient boosting regression and random forests.</div><div>Further analysis reveals that LLMs excel in high-risk scenarios: in firms with 1) high financial constraints, 2) low internal controls, 3) low audit quality, 4) low readability, or 5) negative tone of MD&A texts. A topic model analysis has shown clear difference in the MD&A emphasis for firms the qualified and unqualified opinions given by our framework. These findings shed light on the potential role of the collaborative LLM agent framework as a tool to help auditors and investors detect financial fraud.</div></div>","PeriodicalId":14444,"journal":{"name":"International Review of Economics & Finance","volume":"104 ","pages":"Article 104642"},"PeriodicalIF":5.6000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dual-model synergy for audit opinion prediction: A collaborative LLM agent framework approach\",\"authors\":\"Yi Lu, Jinxing Hao, Xuesong Tang\",\"doi\":\"10.1016/j.iref.2025.104642\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>By combining the capabilities of Moonshot and DeepSeek-R1, which respectively evaluate risk scores based on MD&A text information and financial data, we exploit the complementary strengths of long-context and reasoning large language models (LLMs) to help evaluate material misstatement risks in audit opinions. Our results suggest that both MD&A-based and financial-based risk evaluations effectively distinguish qualified and unqualified audit opinions, but combining them yields the best performance. In addition to the interpretative analysis text output, the combined LLM evaluation consistently outperforms logistic regression prediction that incorporates all indicators, and it achieves comparable performance compared to the sophisticated machine learning methods like gradient boosting regression and random forests.</div><div>Further analysis reveals that LLMs excel in high-risk scenarios: in firms with 1) high financial constraints, 2) low internal controls, 3) low audit quality, 4) low readability, or 5) negative tone of MD&A texts. A topic model analysis has shown clear difference in the MD&A emphasis for firms the qualified and unqualified opinions given by our framework. These findings shed light on the potential role of the collaborative LLM agent framework as a tool to help auditors and investors detect financial fraud.</div></div>\",\"PeriodicalId\":14444,\"journal\":{\"name\":\"International Review of Economics & Finance\",\"volume\":\"104 \",\"pages\":\"Article 104642\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Review of Economics & Finance\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1059056025008056\",\"RegionNum\":2,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BUSINESS, FINANCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Review of Economics & Finance","FirstCategoryId":"96","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1059056025008056","RegionNum":2,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
Dual-model synergy for audit opinion prediction: A collaborative LLM agent framework approach
By combining the capabilities of Moonshot and DeepSeek-R1, which respectively evaluate risk scores based on MD&A text information and financial data, we exploit the complementary strengths of long-context and reasoning large language models (LLMs) to help evaluate material misstatement risks in audit opinions. Our results suggest that both MD&A-based and financial-based risk evaluations effectively distinguish qualified and unqualified audit opinions, but combining them yields the best performance. In addition to the interpretative analysis text output, the combined LLM evaluation consistently outperforms logistic regression prediction that incorporates all indicators, and it achieves comparable performance compared to the sophisticated machine learning methods like gradient boosting regression and random forests.
Further analysis reveals that LLMs excel in high-risk scenarios: in firms with 1) high financial constraints, 2) low internal controls, 3) low audit quality, 4) low readability, or 5) negative tone of MD&A texts. A topic model analysis has shown clear difference in the MD&A emphasis for firms the qualified and unqualified opinions given by our framework. These findings shed light on the potential role of the collaborative LLM agent framework as a tool to help auditors and investors detect financial fraud.
期刊介绍:
The International Review of Economics & Finance (IREF) is a scholarly journal devoted to the publication of high quality theoretical and empirical articles in all areas of international economics, macroeconomics and financial economics. Contributions that facilitate the communications between the real and the financial sectors of the economy are of particular interest.