信用卡欺诈检测的集成技术

Satya Dileep Penmetsa, Sabah Mohammed
{"title":"信用卡欺诈检测的集成技术","authors":"Satya Dileep Penmetsa, Sabah Mohammed","doi":"10.21742/ijsbt.2021.9.2.03","DOIUrl":null,"url":null,"abstract":"Credit card fraud is a problem that has grown by great danger and has a huge impact on the financial sector. The challenges of credit card fraud are the availability of public data, high imbalance in data, and volatility of the fraud nature. Over the years ensemble learning has gained more importance and proved to give better performance. Here we try to do a comparative study of various ensemble approaches using various learning algorithms on the credit card fraud data and to understand multiple models based on various evaluation and performance metrics using the SMOTE balancing technique. machine learning algorithms presented several standard models which include NB, SVM, and DL. They used a publicly available credit card data set has been used for evaluation using individual (standard) models and hybrid models using AdaBoost and majority voting combination methods. The MCC metric was adopted as a performance measure, as it takes into account the true and false positive and negative predicted outcomes. The best MCC score is 0.823, achieved using majority voting. A perfect MCC score of 1 was achieved using AdaBoost and majority voting methods. To further evaluate the hybrid models, noise from 10% to 30% has been added into the data samples. The majority voting method yielded the best MCC score of 0.942 for 30% noise added to the data set. This shows that the majority voting method offers robust performance in the presence of noise. The use of ensemble techniques is very significant in the prediction of faulty credit card transactions from normal credit card transactions.","PeriodicalId":448069,"journal":{"name":"International Journal of Smart Business and Technology","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Ensemble Techniques for Credit Card Fraud Detection\",\"authors\":\"Satya Dileep Penmetsa, Sabah Mohammed\",\"doi\":\"10.21742/ijsbt.2021.9.2.03\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Credit card fraud is a problem that has grown by great danger and has a huge impact on the financial sector. The challenges of credit card fraud are the availability of public data, high imbalance in data, and volatility of the fraud nature. Over the years ensemble learning has gained more importance and proved to give better performance. Here we try to do a comparative study of various ensemble approaches using various learning algorithms on the credit card fraud data and to understand multiple models based on various evaluation and performance metrics using the SMOTE balancing technique. machine learning algorithms presented several standard models which include NB, SVM, and DL. They used a publicly available credit card data set has been used for evaluation using individual (standard) models and hybrid models using AdaBoost and majority voting combination methods. The MCC metric was adopted as a performance measure, as it takes into account the true and false positive and negative predicted outcomes. The best MCC score is 0.823, achieved using majority voting. A perfect MCC score of 1 was achieved using AdaBoost and majority voting methods. To further evaluate the hybrid models, noise from 10% to 30% has been added into the data samples. The majority voting method yielded the best MCC score of 0.942 for 30% noise added to the data set. This shows that the majority voting method offers robust performance in the presence of noise. The use of ensemble techniques is very significant in the prediction of faulty credit card transactions from normal credit card transactions.\",\"PeriodicalId\":448069,\"journal\":{\"name\":\"International Journal of Smart Business and Technology\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Smart Business and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21742/ijsbt.2021.9.2.03\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Smart Business and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21742/ijsbt.2021.9.2.03","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

信用卡诈骗是一个日益严重的问题,对金融行业产生了巨大的影响。信用卡诈骗面临的挑战是公共数据的可用性、数据的高度不平衡以及诈骗性质的波动性。多年来,集成学习越来越受到重视,并被证明具有更好的性能。在这里,我们尝试对使用信用卡欺诈数据的各种学习算法的各种集成方法进行比较研究,并使用SMOTE平衡技术来理解基于各种评估和性能指标的多个模型。机器学习算法提出了几种标准模型,包括NB、SVM和DL。他们使用了一个公开可用的信用卡数据集,使用个人(标准)模型和使用AdaBoost和多数投票组合方法的混合模型进行评估。MCC指标被用作绩效衡量标准,因为它考虑了预测结果的真、假阳性和阴性。最佳MCC得分为0.823,采用多数决法。使用AdaBoost和多数投票方法获得了1分的完美MCC得分。为了进一步评价混合模型,在数据样本中加入了10% ~ 30%的噪声。当数据集中加入30%的噪声时,多数投票法的MCC得分为0.942。这表明多数投票方法在存在噪声的情况下具有鲁棒性。集成技术的使用对于从正常的信用卡交易中预测错误的信用卡交易是非常重要的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Ensemble Techniques for Credit Card Fraud Detection
Credit card fraud is a problem that has grown by great danger and has a huge impact on the financial sector. The challenges of credit card fraud are the availability of public data, high imbalance in data, and volatility of the fraud nature. Over the years ensemble learning has gained more importance and proved to give better performance. Here we try to do a comparative study of various ensemble approaches using various learning algorithms on the credit card fraud data and to understand multiple models based on various evaluation and performance metrics using the SMOTE balancing technique. machine learning algorithms presented several standard models which include NB, SVM, and DL. They used a publicly available credit card data set has been used for evaluation using individual (standard) models and hybrid models using AdaBoost and majority voting combination methods. The MCC metric was adopted as a performance measure, as it takes into account the true and false positive and negative predicted outcomes. The best MCC score is 0.823, achieved using majority voting. A perfect MCC score of 1 was achieved using AdaBoost and majority voting methods. To further evaluate the hybrid models, noise from 10% to 30% has been added into the data samples. The majority voting method yielded the best MCC score of 0.942 for 30% noise added to the data set. This shows that the majority voting method offers robust performance in the presence of noise. The use of ensemble techniques is very significant in the prediction of faulty credit card transactions from normal credit card transactions.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信