{"title":"A Predicting Model For Accounting Fraud Based On Ensemble Learning","authors":"Yunchuan Sun, Zixiu Ma, Xiaoping Zeng, Yao Guo","doi":"10.1109/INDIN45523.2021.9557545","DOIUrl":null,"url":null,"abstract":"Accounting fraud, usually difficult to detect, can cause significant harm to stakeholders and serious damage to the market. Effective methods of accounting fraud detection are needed for the prevention and governance of accounting fraud.In this study, we develop a novel accounting fraud prediction model using XGBoost, a powerful ensemble learning approach. We respectively select 12 financial ratios, 28 raw accounting numbers and 99 raw accounting numbers available from Chinese listed firms’ financial statements, as the model input. To assess the performance of fraud prediction models, we select two evaluation metrics - AUC and NDCG@k, and two benchmark models - the Dechow et al. (2011) logistic regression model based on financial ratios, and the Bao et al. (2020) AdaBoost model based on raw accounting numbers.Results show that: 1) our XGBoost-based prediction model outperforms two benchmark models by a large margin whatever model inputs and evaluation metrics; 2) the XGBoost-based prediction model with raw accounting numbers input outperforms the one with financial ratios input; 3) the XGoost-based prediction model with 99 raw accounting numbers input outperforms the one with 28 raw accounting numbers input.","PeriodicalId":370921,"journal":{"name":"2021 IEEE 19th International Conference on Industrial Informatics (INDIN)","volume":"27 20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 19th International Conference on Industrial Informatics (INDIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIN45523.2021.9557545","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Accounting fraud, usually difficult to detect, can cause significant harm to stakeholders and serious damage to the market. Effective methods of accounting fraud detection are needed for the prevention and governance of accounting fraud.In this study, we develop a novel accounting fraud prediction model using XGBoost, a powerful ensemble learning approach. We respectively select 12 financial ratios, 28 raw accounting numbers and 99 raw accounting numbers available from Chinese listed firms’ financial statements, as the model input. To assess the performance of fraud prediction models, we select two evaluation metrics - AUC and NDCG@k, and two benchmark models - the Dechow et al. (2011) logistic regression model based on financial ratios, and the Bao et al. (2020) AdaBoost model based on raw accounting numbers.Results show that: 1) our XGBoost-based prediction model outperforms two benchmark models by a large margin whatever model inputs and evaluation metrics; 2) the XGBoost-based prediction model with raw accounting numbers input outperforms the one with financial ratios input; 3) the XGoost-based prediction model with 99 raw accounting numbers input outperforms the one with 28 raw accounting numbers input.