使用监督机器学习方法预测税务欺诈

African Journal of Science, Technology, Innovation and Development Pub Date : 2023-04-09 DOI:10.1080/20421338.2023.2187930

Belle Fille Murorunkwere, D. Haughton, J. Nzabanita, Francis Kipkogei, I. Kabano

{"title":"使用监督机器学习方法预测税务欺诈","authors":"Belle Fille Murorunkwere, D. Haughton, J. Nzabanita, Francis Kipkogei, I. Kabano","doi":"10.1080/20421338.2023.2187930","DOIUrl":null,"url":null,"abstract":"With the advancement in technology, the tax base in Rwanda has become broader, and as a result, tax fraud is growing. Depending on the dataset used, fraud detection experts and researchers have used different methods to identify questionable cases. This paper aims to predict features of tax fraud using the most robust supervised machine-learning model. This research provides a context where a fraud expert can use a machine-learning model, and an implemented model offers instant feedback to the fraud expert. We evaluate supervised machine learning models such as Artificial Neural Network, Logistic Regression, Decision Tree, Random Forest, GaussianNB and XGBoost. Based on different evaluation metrics, Artificial Neural Network was the most robust model for predicting tax fraud. Findings reveal that the time of business that indicates the difference in time from when a business started and the time it was audited, the domestic businesses, taxpayers who import and export goods, those with no losses, those whose businesses are located in the eastern province, and those registered on withholding and Value Added Tax types are more susceptible to tax fraud. This study is among the few to evaluate the effectiveness of multiple supervised machine-learning models for identifying tax fraud factors on an accurate data set with numerous tax types. The evidence generated in the current study will serve as a valuable tool for both tax policymakers and auditors, as well as for enhancing awareness of more robust methods for predicting tax fraud.","PeriodicalId":7557,"journal":{"name":"African Journal of Science, Technology, Innovation and Development","volume":"60 1","pages":"731 - 742"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Predicting tax fraud using supervised machine learning approach\",\"authors\":\"Belle Fille Murorunkwere, D. Haughton, J. Nzabanita, Francis Kipkogei, I. Kabano\",\"doi\":\"10.1080/20421338.2023.2187930\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the advancement in technology, the tax base in Rwanda has become broader, and as a result, tax fraud is growing. Depending on the dataset used, fraud detection experts and researchers have used different methods to identify questionable cases. This paper aims to predict features of tax fraud using the most robust supervised machine-learning model. This research provides a context where a fraud expert can use a machine-learning model, and an implemented model offers instant feedback to the fraud expert. We evaluate supervised machine learning models such as Artificial Neural Network, Logistic Regression, Decision Tree, Random Forest, GaussianNB and XGBoost. Based on different evaluation metrics, Artificial Neural Network was the most robust model for predicting tax fraud. Findings reveal that the time of business that indicates the difference in time from when a business started and the time it was audited, the domestic businesses, taxpayers who import and export goods, those with no losses, those whose businesses are located in the eastern province, and those registered on withholding and Value Added Tax types are more susceptible to tax fraud. This study is among the few to evaluate the effectiveness of multiple supervised machine-learning models for identifying tax fraud factors on an accurate data set with numerous tax types. The evidence generated in the current study will serve as a valuable tool for both tax policymakers and auditors, as well as for enhancing awareness of more robust methods for predicting tax fraud.\",\"PeriodicalId\":7557,\"journal\":{\"name\":\"African Journal of Science, Technology, Innovation and Development\",\"volume\":\"60 1\",\"pages\":\"731 - 742\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"African Journal of Science, Technology, Innovation and Development\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/20421338.2023.2187930\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"African Journal of Science, Technology, Innovation and Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/20421338.2023.2187930","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

随着技术的进步，卢旺达的税基越来越广泛，因此，税务欺诈正在增加。根据所使用的数据集，欺诈检测专家和研究人员使用了不同的方法来识别可疑案例。本文旨在使用最强大的监督机器学习模型来预测税务欺诈的特征。这项研究为欺诈专家提供了一个可以使用机器学习模型的环境，并且实现的模型可以向欺诈专家提供即时反馈。我们评估了监督机器学习模型，如人工神经网络，逻辑回归，决策树，随机森林，GaussianNB和XGBoost。基于不同的评估指标，人工神经网络是预测税务欺诈最稳健的模型。调查结果显示，从开业时间到审计时间的时间差的营业时间、国内企业、进出口货物的纳税人、没有亏损的企业、位于东部省份的企业、以预扣税和增值税登记的企业更容易发生税务欺诈。本研究是为数不多的评估多个监督机器学习模型在具有多种税收类型的准确数据集上识别税务欺诈因素的有效性的研究之一。本研究中产生的证据将成为税收政策制定者和审计人员的宝贵工具，并有助于提高对预测税务欺诈的更有效方法的认识。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Predicting tax fraud using supervised machine learning approach

With the advancement in technology, the tax base in Rwanda has become broader, and as a result, tax fraud is growing. Depending on the dataset used, fraud detection experts and researchers have used different methods to identify questionable cases. This paper aims to predict features of tax fraud using the most robust supervised machine-learning model. This research provides a context where a fraud expert can use a machine-learning model, and an implemented model offers instant feedback to the fraud expert. We evaluate supervised machine learning models such as Artificial Neural Network, Logistic Regression, Decision Tree, Random Forest, GaussianNB and XGBoost. Based on different evaluation metrics, Artificial Neural Network was the most robust model for predicting tax fraud. Findings reveal that the time of business that indicates the difference in time from when a business started and the time it was audited, the domestic businesses, taxpayers who import and export goods, those with no losses, those whose businesses are located in the eastern province, and those registered on withholding and Value Added Tax types are more susceptible to tax fraud. This study is among the few to evaluate the effectiveness of multiple supervised machine-learning models for identifying tax fraud factors on an accurate data set with numerous tax types. The evidence generated in the current study will serve as a valuable tool for both tax policymakers and auditors, as well as for enhancing awareness of more robust methods for predicting tax fraud.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

African Journal of Science, Technology, Innovation and Development

自引率

0.00%

发文量