{"title":"A Review of Machine Learning Applications for Credit Card Fraud Detection with A Case study","authors":"Zahra Faraji","doi":"10.33215/sjom.v5i1.770","DOIUrl":null,"url":null,"abstract":"Purpose - This paper aims to highlight the widely used supervised techniques applied for fraud detection. In addition, this paper aims to apply some techniques to evaluate their performance on real-world data and develop an ensemble model as a potential solution for this problem.\nDesign/Methodology- Different techniques applied in this study for fraud detection purposes are logistic regression, decision tree, random forest, KNN, and XGBoost. The confusion matrix gives information about the assignment of inputs to the different classes. This study uses precision and recall to evaluate the performance, calculated based on the confusion matrix.\nFindings- XGBoost is the fastest and is expected to have the best performance; however, it is only outperforming the random forest in terms of accuracy, precision, recall, and f1-score. In general, the KNN and logistic regression have better performance, which means they better detect fraudulent transactions.\nPractical Implications- The new model can be applied to new data instead of the previous techniques.","PeriodicalId":215982,"journal":{"name":"SEISENSE Journal of Management","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SEISENSE Journal of Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33215/sjom.v5i1.770","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Purpose - This paper aims to highlight the widely used supervised techniques applied for fraud detection. In addition, this paper aims to apply some techniques to evaluate their performance on real-world data and develop an ensemble model as a potential solution for this problem.
Design/Methodology- Different techniques applied in this study for fraud detection purposes are logistic regression, decision tree, random forest, KNN, and XGBoost. The confusion matrix gives information about the assignment of inputs to the different classes. This study uses precision and recall to evaluate the performance, calculated based on the confusion matrix.
Findings- XGBoost is the fastest and is expected to have the best performance; however, it is only outperforming the random forest in terms of accuracy, precision, recall, and f1-score. In general, the KNN and logistic regression have better performance, which means they better detect fraudulent transactions.
Practical Implications- The new model can be applied to new data instead of the previous techniques.