Accounting fraud detection using contextual language learning

IF 6 3区管理学 Q2 BUSINESS

International Journal of Accounting Information Systems Pub Date : 2024-03-11 DOI:10.1016/j.accinf.2024.100682

Indranil Bhattacharya, Ana Mickovic

{"title":"Accounting fraud detection using contextual language learning","authors":"Indranil Bhattacharya, Ana Mickovic","doi":"10.1016/j.accinf.2024.100682","DOIUrl":null,"url":null,"abstract":"<div><p>Accounting fraud is a widespread problem that causes significant damage in the economic market. Detection and investigation of fraudulent firms require a large amount of time, money, and effort for corporate monitors and regulators. In this study, we explore how textual contents from financial reports help in detecting accounting fraud. Pre-trained contextual language learning models, such as BERT, have significantly advanced natural language processing in recent years. We fine-tune the BERT model on Management Discussion and Analysis (MD&A) sections of annual 10-K reports from the Securities and Exchange Commission (SEC) database. Our final model outperforms the textual benchmark model and the quantitative benchmark model from the previous literature by 15% and 12%, respectively. Further, our model identifies five times more fraudulent firm-year observations than the textual benchmark by investigating the same number of firms, and three times more than the quantitative benchmark. Optimizing this investigation process, where more fraudulent observations are detected in the same size of the investigation sample, would be of great economic significance for regulators, investors, financial analysts, and auditors.</p></div>","PeriodicalId":47170,"journal":{"name":"International Journal of Accounting Information Systems","volume":"53 ","pages":"Article 100682"},"PeriodicalIF":6.0000,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1467089524000150/pdfft?md5=616e550aaf8ef152e5e5bc126c9c0fc5&pid=1-s2.0-S1467089524000150-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Accounting Information Systems","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1467089524000150","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BUSINESS","Score":null,"Total":0}

引用次数: 0

Abstract

Accounting fraud is a widespread problem that causes significant damage in the economic market. Detection and investigation of fraudulent firms require a large amount of time, money, and effort for corporate monitors and regulators. In this study, we explore how textual contents from financial reports help in detecting accounting fraud. Pre-trained contextual language learning models, such as BERT, have significantly advanced natural language processing in recent years. We fine-tune the BERT model on Management Discussion and Analysis (MD&A) sections of annual 10-K reports from the Securities and Exchange Commission (SEC) database. Our final model outperforms the textual benchmark model and the quantitative benchmark model from the previous literature by 15% and 12%, respectively. Further, our model identifies five times more fraudulent firm-year observations than the textual benchmark by investigating the same number of firms, and three times more than the quantitative benchmark. Optimizing this investigation process, where more fraudulent observations are detected in the same size of the investigation sample, would be of great economic significance for regulators, investors, financial analysts, and auditors.

查看原文本刊更多论文

利用语境语言学习检测会计欺诈

会计欺诈是一个普遍存在的问题，给经济市场造成了重大损失。对舞弊公司的发现和调查需要企业监督者和监管者花费大量的时间、金钱和精力。在本研究中，我们探讨了财务报告中的文本内容如何帮助检测会计欺诈。近年来，预先训练的语境语言学习模型（如 BERT）极大地推动了自然语言处理的发展。我们根据美国证券交易委员会（SEC）数据库中 10-K 年度报告中的管理讨论与分析（MD&A）部分对 BERT 模型进行了微调。我们的最终模型比以往文献中的文本基准模型和定量基准模型分别高出 15% 和 12%。此外，通过调查相同数量的公司，我们的模型识别出的欺诈性公司年度观察结果是文本基准模型的五倍，是定量基准模型的三倍。优化这一调查过程，即在相同规模的调查样本中发现更多的欺诈性观察结果，对于监管机构、投资者、金融分析师和审计师来说具有重大的经济意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Accounting Information Systems Multiple-

CiteScore

9.00

自引率

6.50%

发文量

期刊介绍： The International Journal of Accounting Information Systems will publish thoughtful, well developed articles that examine the rapidly evolving relationship between accounting and information technology. Articles may range from empirical to analytical, from practice-based to the development of new techniques, but must be related to problems facing the integration of accounting and information technology. The journal will address (but will not limit itself to) the following specific issues: control and auditability of information systems; management of information technology; artificial intelligence research in accounting; development issues in accounting and information systems; human factors issues related to information technology; development of theories related to information technology; methodological issues in information technology research; information systems validation; human–computer interaction research in accounting information systems. The journal welcomes and encourages articles from both practitioners and academicians.