{"title":"机器学习在信用卡欺诈检测中的应用综述(以案例为例)","authors":"Zahra Faraji","doi":"10.33215/sjom.v5i1.770","DOIUrl":null,"url":null,"abstract":"Purpose - This paper aims to highlight the widely used supervised techniques applied for fraud detection. In addition, this paper aims to apply some techniques to evaluate their performance on real-world data and develop an ensemble model as a potential solution for this problem.\nDesign/Methodology- Different techniques applied in this study for fraud detection purposes are logistic regression, decision tree, random forest, KNN, and XGBoost. The confusion matrix gives information about the assignment of inputs to the different classes. This study uses precision and recall to evaluate the performance, calculated based on the confusion matrix.\nFindings- XGBoost is the fastest and is expected to have the best performance; however, it is only outperforming the random forest in terms of accuracy, precision, recall, and f1-score. In general, the KNN and logistic regression have better performance, which means they better detect fraudulent transactions.\nPractical Implications- The new model can be applied to new data instead of the previous techniques.","PeriodicalId":215982,"journal":{"name":"SEISENSE Journal of Management","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"A Review of Machine Learning Applications for Credit Card Fraud Detection with A Case study\",\"authors\":\"Zahra Faraji\",\"doi\":\"10.33215/sjom.v5i1.770\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Purpose - This paper aims to highlight the widely used supervised techniques applied for fraud detection. In addition, this paper aims to apply some techniques to evaluate their performance on real-world data and develop an ensemble model as a potential solution for this problem.\\nDesign/Methodology- Different techniques applied in this study for fraud detection purposes are logistic regression, decision tree, random forest, KNN, and XGBoost. The confusion matrix gives information about the assignment of inputs to the different classes. This study uses precision and recall to evaluate the performance, calculated based on the confusion matrix.\\nFindings- XGBoost is the fastest and is expected to have the best performance; however, it is only outperforming the random forest in terms of accuracy, precision, recall, and f1-score. In general, the KNN and logistic regression have better performance, which means they better detect fraudulent transactions.\\nPractical Implications- The new model can be applied to new data instead of the previous techniques.\",\"PeriodicalId\":215982,\"journal\":{\"name\":\"SEISENSE Journal of Management\",\"volume\":\"77 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SEISENSE Journal of Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.33215/sjom.v5i1.770\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SEISENSE Journal of Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33215/sjom.v5i1.770","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Review of Machine Learning Applications for Credit Card Fraud Detection with A Case study
Purpose - This paper aims to highlight the widely used supervised techniques applied for fraud detection. In addition, this paper aims to apply some techniques to evaluate their performance on real-world data and develop an ensemble model as a potential solution for this problem.
Design/Methodology- Different techniques applied in this study for fraud detection purposes are logistic regression, decision tree, random forest, KNN, and XGBoost. The confusion matrix gives information about the assignment of inputs to the different classes. This study uses precision and recall to evaluate the performance, calculated based on the confusion matrix.
Findings- XGBoost is the fastest and is expected to have the best performance; however, it is only outperforming the random forest in terms of accuracy, precision, recall, and f1-score. In general, the KNN and logistic regression have better performance, which means they better detect fraudulent transactions.
Practical Implications- The new model can be applied to new data instead of the previous techniques.