基于信用卡欺诈数据的决策分析与预测

Deshan Huang, Yu Lin, Zhaoxing Weng, Jiajie Xiong
{"title":"基于信用卡欺诈数据的决策分析与预测","authors":"Deshan Huang, Yu Lin, Zhaoxing Weng, Jiajie Xiong","doi":"10.1145/3478301.3478305","DOIUrl":null,"url":null,"abstract":"With the common use of credit cards in today's transactions, the related fraudulent behavior inevitably occurs and causes considerable loss of money. To solve this problem, our work used a dataset that contains legal credit card transactions as well as fraud transactions to find an effective solution. In this paper, through processing and analyzing the transaction data, the data was discovered to be unbalanced, so stratified sampling and oversampling were performed to achieve a more reliable analysis of the unbalanced dataset. Meanwhile, due to the randomness of sampling, the cross-validated were used for the final model selection. Further, we utilized five algorithms to build models which contains statistical machine learning model and deep learning model. To obtain optimal performance of the models, hyperparameter tuning was performed for the five classifiers. Finally, the results indicate that the optimal model was XGBoost, and its performance can be verified in a real-life scenario in the future.","PeriodicalId":338866,"journal":{"name":"The 2nd European Symposium on Computer and Communications","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Decision Analysis and Prediction Based on Credit Card Fraud Data\",\"authors\":\"Deshan Huang, Yu Lin, Zhaoxing Weng, Jiajie Xiong\",\"doi\":\"10.1145/3478301.3478305\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the common use of credit cards in today's transactions, the related fraudulent behavior inevitably occurs and causes considerable loss of money. To solve this problem, our work used a dataset that contains legal credit card transactions as well as fraud transactions to find an effective solution. In this paper, through processing and analyzing the transaction data, the data was discovered to be unbalanced, so stratified sampling and oversampling were performed to achieve a more reliable analysis of the unbalanced dataset. Meanwhile, due to the randomness of sampling, the cross-validated were used for the final model selection. Further, we utilized five algorithms to build models which contains statistical machine learning model and deep learning model. To obtain optimal performance of the models, hyperparameter tuning was performed for the five classifiers. Finally, the results indicate that the optimal model was XGBoost, and its performance can be verified in a real-life scenario in the future.\",\"PeriodicalId\":338866,\"journal\":{\"name\":\"The 2nd European Symposium on Computer and Communications\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The 2nd European Symposium on Computer and Communications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3478301.3478305\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 2nd European Symposium on Computer and Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3478301.3478305","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

随着信用卡在当今交易中的普遍使用,相关的欺诈行为不可避免地发生,并造成相当大的金钱损失。为了解决这个问题,我们的工作使用了一个包含合法信用卡交易和欺诈交易的数据集来找到一个有效的解决方案。本文通过对交易数据的处理和分析,发现数据不平衡,因此进行分层抽样和过抽样,以实现对不平衡数据集的更可靠的分析。同时,由于抽样的随机性,最终的模型选择采用交叉验证。在此基础上,利用五种算法构建了统计机器学习模型和深度学习模型。为了获得最优的模型性能,对五个分类器进行了超参数调优。最后,结果表明,最优模型为XGBoost,其性能可以在未来的实际场景中得到验证。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Decision Analysis and Prediction Based on Credit Card Fraud Data
With the common use of credit cards in today's transactions, the related fraudulent behavior inevitably occurs and causes considerable loss of money. To solve this problem, our work used a dataset that contains legal credit card transactions as well as fraud transactions to find an effective solution. In this paper, through processing and analyzing the transaction data, the data was discovered to be unbalanced, so stratified sampling and oversampling were performed to achieve a more reliable analysis of the unbalanced dataset. Meanwhile, due to the randomness of sampling, the cross-validated were used for the final model selection. Further, we utilized five algorithms to build models which contains statistical machine learning model and deep learning model. To obtain optimal performance of the models, hyperparameter tuning was performed for the five classifiers. Finally, the results indicate that the optimal model was XGBoost, and its performance can be verified in a real-life scenario in the future.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信