Predicting Credit Card Transaction Fraud Using Machine Learning Algorithms

Jiaxin Gao, Zirui Zhou, Jiangshan Ai, Bingxin Xia, Stephen Coggeshall
{"title":"Predicting Credit Card Transaction Fraud Using Machine Learning Algorithms","authors":"Jiaxin Gao, Zirui Zhou, Jiangshan Ai, Bingxin Xia, Stephen Coggeshall","doi":"10.4236/JILSA.2019.113003","DOIUrl":null,"url":null,"abstract":"Credit card fraud is a wide-ranging issue for financial institutions, involving theft and fraud committed using a payment card. In this paper, we explore the application of linear and nonlinear statistical modeling and machine learning models on real credit card transaction data. The models built are supervised fraud models that attempt to identify which transactions are most likely fraudulent. We discuss the processes of data exploration, data cleaning, variable creation, feature selection, model algorithms, and results. Five different supervised models are explored and compared including logistic regression, neural networks, random forest, boosted tree and support vector machines. The boosted tree model shows the best fraud detection result (FDR = 49.83%) for this particular data set. The resulting model can be utilized in a credit card fraud detection system. A similar model development process can be performed in related business domains such as insurance and telecommunications, to avoid or detect fraudulent activity.","PeriodicalId":69452,"journal":{"name":"智能学习系统与应用(英文)","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"智能学习系统与应用(英文)","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.4236/JILSA.2019.113003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

Abstract

Credit card fraud is a wide-ranging issue for financial institutions, involving theft and fraud committed using a payment card. In this paper, we explore the application of linear and nonlinear statistical modeling and machine learning models on real credit card transaction data. The models built are supervised fraud models that attempt to identify which transactions are most likely fraudulent. We discuss the processes of data exploration, data cleaning, variable creation, feature selection, model algorithms, and results. Five different supervised models are explored and compared including logistic regression, neural networks, random forest, boosted tree and support vector machines. The boosted tree model shows the best fraud detection result (FDR = 49.83%) for this particular data set. The resulting model can be utilized in a credit card fraud detection system. A similar model development process can be performed in related business domains such as insurance and telecommunications, to avoid or detect fraudulent activity.
使用机器学习算法预测信用卡交易欺诈
信用卡诈骗是金融机构的一个广泛问题,涉及使用支付卡进行的盗窃和欺诈。在本文中,我们探索了线性和非线性统计建模以及机器学习模型在真实信用卡交易数据上的应用。建立的模型是有监督的欺诈模型,试图识别哪些交易最有可能是欺诈的。我们讨论了数据探索、数据清理、变量创建、特征选择、模型算法和结果的过程。探讨并比较了五种不同的监督模型,包括逻辑回归、神经网络、随机森林、增强树和支持向量机。对于这个特定的数据集,增强树模型显示了最佳的欺诈检测结果(FDR = 49.83%)。该模型可用于信用卡欺诈检测系统。类似的模型开发过程可以在相关的业务领域(如保险和电信)中执行,以避免或检测欺诈活动。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
135
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信