{"title":"Synergizing domain knowledge and machine learning: Intelligent early fraud detection enhanced by earnings management analysis","authors":"Shipei Zeng, Shan Dai","doi":"10.1111/irfi.70021","DOIUrl":null,"url":null,"abstract":"<p>This paper addresses firm fraud detection by synergizing domain knowledge with machine learning through heuristic and explainable feature construction. Unlike traditional approaches that focus on algorithmic improvements, our method introduces a set of features based on earnings management analysis, providing factors influencing firm fraudulent behavior. Empirical results using a firm-year dataset from China demonstrate better classification accuracy of fraud detection compared to machine learning models with raw financial statement features alone. Additionally, the results remain robust with different false positive rates, observation periods, and firm groups. This domain knowledge-enhanced machine learning method, with alternative features for fraud detection, leads to more transparent regulation and the potential for similar counterfeit detection applications in China and beyond.</p>","PeriodicalId":46664,"journal":{"name":"International Review of Finance","volume":"25 2","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Review of Finance","FirstCategoryId":"96","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/irfi.70021","RegionNum":4,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
引用次数: 0
Abstract
This paper addresses firm fraud detection by synergizing domain knowledge with machine learning through heuristic and explainable feature construction. Unlike traditional approaches that focus on algorithmic improvements, our method introduces a set of features based on earnings management analysis, providing factors influencing firm fraudulent behavior. Empirical results using a firm-year dataset from China demonstrate better classification accuracy of fraud detection compared to machine learning models with raw financial statement features alone. Additionally, the results remain robust with different false positive rates, observation periods, and firm groups. This domain knowledge-enhanced machine learning method, with alternative features for fraud detection, leads to more transparent regulation and the potential for similar counterfeit detection applications in China and beyond.
期刊介绍:
The International Review of Finance (IRF) publishes high-quality research on all aspects of financial economics, including traditional areas such as asset pricing, corporate finance, market microstructure, financial intermediation and regulation, financial econometrics, financial engineering and risk management, as well as new areas such as markets and institutions of emerging market economies, especially those in the Asia-Pacific region. In addition, the Letters Section in IRF is a premium outlet of letter-length research in all fields of finance. The length of the articles in the Letters Section is limited to a maximum of eight journal pages.