信用卡欺诈检测的集成技术

International Journal of Smart Business and Technology Pub Date : 2021-09-30 DOI:10.21742/ijsbt.2021.9.2.03

Satya Dileep Penmetsa, Sabah Mohammed

{"title":"信用卡欺诈检测的集成技术","authors":"Satya Dileep Penmetsa, Sabah Mohammed","doi":"10.21742/ijsbt.2021.9.2.03","DOIUrl":null,"url":null,"abstract":"Credit card fraud is a problem that has grown by great danger and has a huge impact on the financial sector. The challenges of credit card fraud are the availability of public data, high imbalance in data, and volatility of the fraud nature. Over the years ensemble learning has gained more importance and proved to give better performance. Here we try to do a comparative study of various ensemble approaches using various learning algorithms on the credit card fraud data and to understand multiple models based on various evaluation and performance metrics using the SMOTE balancing technique. machine learning algorithms presented several standard models which include NB, SVM, and DL. They used a publicly available credit card data set has been used for evaluation using individual (standard) models and hybrid models using AdaBoost and majority voting combination methods. The MCC metric was adopted as a performance measure, as it takes into account the true and false positive and negative predicted outcomes. The best MCC score is 0.823, achieved using majority voting. A perfect MCC score of 1 was achieved using AdaBoost and majority voting methods. To further evaluate the hybrid models, noise from 10% to 30% has been added into the data samples. The majority voting method yielded the best MCC score of 0.942 for 30% noise added to the data set. This shows that the majority voting method offers robust performance in the presence of noise. The use of ensemble techniques is very significant in the prediction of faulty credit card transactions from normal credit card transactions.","PeriodicalId":448069,"journal":{"name":"International Journal of Smart Business and Technology","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Ensemble Techniques for Credit Card Fraud Detection\",\"authors\":\"Satya Dileep Penmetsa, Sabah Mohammed\",\"doi\":\"10.21742/ijsbt.2021.9.2.03\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Credit card fraud is a problem that has grown by great danger and has a huge impact on the financial sector. The challenges of credit card fraud are the availability of public data, high imbalance in data, and volatility of the fraud nature. Over the years ensemble learning has gained more importance and proved to give better performance. Here we try to do a comparative study of various ensemble approaches using various learning algorithms on the credit card fraud data and to understand multiple models based on various evaluation and performance metrics using the SMOTE balancing technique. machine learning algorithms presented several standard models which include NB, SVM, and DL. They used a publicly available credit card data set has been used for evaluation using individual (standard) models and hybrid models using AdaBoost and majority voting combination methods. The MCC metric was adopted as a performance measure, as it takes into account the true and false positive and negative predicted outcomes. The best MCC score is 0.823, achieved using majority voting. A perfect MCC score of 1 was achieved using AdaBoost and majority voting methods. To further evaluate the hybrid models, noise from 10% to 30% has been added into the data samples. The majority voting method yielded the best MCC score of 0.942 for 30% noise added to the data set. This shows that the majority voting method offers robust performance in the presence of noise. The use of ensemble techniques is very significant in the prediction of faulty credit card transactions from normal credit card transactions.\",\"PeriodicalId\":448069,\"journal\":{\"name\":\"International Journal of Smart Business and Technology\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Smart Business and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21742/ijsbt.2021.9.2.03\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Smart Business and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21742/ijsbt.2021.9.2.03","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

信用卡诈骗是一个日益严重的问题，对金融行业产生了巨大的影响。信用卡诈骗面临的挑战是公共数据的可用性、数据的高度不平衡以及诈骗性质的波动性。多年来，集成学习越来越受到重视，并被证明具有更好的性能。在这里，我们尝试对使用信用卡欺诈数据的各种学习算法的各种集成方法进行比较研究，并使用SMOTE平衡技术来理解基于各种评估和性能指标的多个模型。机器学习算法提出了几种标准模型，包括NB、SVM和DL。他们使用了一个公开可用的信用卡数据集，使用个人(标准)模型和使用AdaBoost和多数投票组合方法的混合模型进行评估。MCC指标被用作绩效衡量标准，因为它考虑了预测结果的真、假阳性和阴性。最佳MCC得分为0.823，采用多数决法。使用AdaBoost和多数投票方法获得了1分的完美MCC得分。为了进一步评价混合模型，在数据样本中加入了10% ~ 30%的噪声。当数据集中加入30%的噪声时，多数投票法的MCC得分为0.942。这表明多数投票方法在存在噪声的情况下具有鲁棒性。集成技术的使用对于从正常的信用卡交易中预测错误的信用卡交易是非常重要的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Ensemble Techniques for Credit Card Fraud Detection

Credit card fraud is a problem that has grown by great danger and has a huge impact on the financial sector. The challenges of credit card fraud are the availability of public data, high imbalance in data, and volatility of the fraud nature. Over the years ensemble learning has gained more importance and proved to give better performance. Here we try to do a comparative study of various ensemble approaches using various learning algorithms on the credit card fraud data and to understand multiple models based on various evaluation and performance metrics using the SMOTE balancing technique. machine learning algorithms presented several standard models which include NB, SVM, and DL. They used a publicly available credit card data set has been used for evaluation using individual (standard) models and hybrid models using AdaBoost and majority voting combination methods. The MCC metric was adopted as a performance measure, as it takes into account the true and false positive and negative predicted outcomes. The best MCC score is 0.823, achieved using majority voting. A perfect MCC score of 1 was achieved using AdaBoost and majority voting methods. To further evaluate the hybrid models, noise from 10% to 30% has been added into the data samples. The majority voting method yielded the best MCC score of 0.942 for 30% noise added to the data set. This shows that the majority voting method offers robust performance in the presence of noise. The use of ensemble techniques is very significant in the prediction of faulty credit card transactions from normal credit card transactions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Smart Business and Technology

自引率

0.00%

发文量