{"title":"信用卡欺诈检测的集成技术","authors":"Satya Dileep Penmetsa, Sabah Mohammed","doi":"10.21742/ijsbt.2021.9.2.03","DOIUrl":null,"url":null,"abstract":"Credit card fraud is a problem that has grown by great danger and has a huge impact on the financial sector. The challenges of credit card fraud are the availability of public data, high imbalance in data, and volatility of the fraud nature. Over the years ensemble learning has gained more importance and proved to give better performance. Here we try to do a comparative study of various ensemble approaches using various learning algorithms on the credit card fraud data and to understand multiple models based on various evaluation and performance metrics using the SMOTE balancing technique. machine learning algorithms presented several standard models which include NB, SVM, and DL. They used a publicly available credit card data set has been used for evaluation using individual (standard) models and hybrid models using AdaBoost and majority voting combination methods. The MCC metric was adopted as a performance measure, as it takes into account the true and false positive and negative predicted outcomes. The best MCC score is 0.823, achieved using majority voting. A perfect MCC score of 1 was achieved using AdaBoost and majority voting methods. To further evaluate the hybrid models, noise from 10% to 30% has been added into the data samples. The majority voting method yielded the best MCC score of 0.942 for 30% noise added to the data set. This shows that the majority voting method offers robust performance in the presence of noise. The use of ensemble techniques is very significant in the prediction of faulty credit card transactions from normal credit card transactions.","PeriodicalId":448069,"journal":{"name":"International Journal of Smart Business and Technology","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Ensemble Techniques for Credit Card Fraud Detection\",\"authors\":\"Satya Dileep Penmetsa, Sabah Mohammed\",\"doi\":\"10.21742/ijsbt.2021.9.2.03\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Credit card fraud is a problem that has grown by great danger and has a huge impact on the financial sector. The challenges of credit card fraud are the availability of public data, high imbalance in data, and volatility of the fraud nature. Over the years ensemble learning has gained more importance and proved to give better performance. Here we try to do a comparative study of various ensemble approaches using various learning algorithms on the credit card fraud data and to understand multiple models based on various evaluation and performance metrics using the SMOTE balancing technique. machine learning algorithms presented several standard models which include NB, SVM, and DL. They used a publicly available credit card data set has been used for evaluation using individual (standard) models and hybrid models using AdaBoost and majority voting combination methods. The MCC metric was adopted as a performance measure, as it takes into account the true and false positive and negative predicted outcomes. The best MCC score is 0.823, achieved using majority voting. A perfect MCC score of 1 was achieved using AdaBoost and majority voting methods. To further evaluate the hybrid models, noise from 10% to 30% has been added into the data samples. The majority voting method yielded the best MCC score of 0.942 for 30% noise added to the data set. This shows that the majority voting method offers robust performance in the presence of noise. The use of ensemble techniques is very significant in the prediction of faulty credit card transactions from normal credit card transactions.\",\"PeriodicalId\":448069,\"journal\":{\"name\":\"International Journal of Smart Business and Technology\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Smart Business and Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21742/ijsbt.2021.9.2.03\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Smart Business and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21742/ijsbt.2021.9.2.03","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Ensemble Techniques for Credit Card Fraud Detection
Credit card fraud is a problem that has grown by great danger and has a huge impact on the financial sector. The challenges of credit card fraud are the availability of public data, high imbalance in data, and volatility of the fraud nature. Over the years ensemble learning has gained more importance and proved to give better performance. Here we try to do a comparative study of various ensemble approaches using various learning algorithms on the credit card fraud data and to understand multiple models based on various evaluation and performance metrics using the SMOTE balancing technique. machine learning algorithms presented several standard models which include NB, SVM, and DL. They used a publicly available credit card data set has been used for evaluation using individual (standard) models and hybrid models using AdaBoost and majority voting combination methods. The MCC metric was adopted as a performance measure, as it takes into account the true and false positive and negative predicted outcomes. The best MCC score is 0.823, achieved using majority voting. A perfect MCC score of 1 was achieved using AdaBoost and majority voting methods. To further evaluate the hybrid models, noise from 10% to 30% has been added into the data samples. The majority voting method yielded the best MCC score of 0.942 for 30% noise added to the data set. This shows that the majority voting method offers robust performance in the presence of noise. The use of ensemble techniques is very significant in the prediction of faulty credit card transactions from normal credit card transactions.