Predicting Credit Card Transaction Fraud Using Machine Learning Algorithms

智能学习系统与应用(英文) Pub Date : 2019-08-14 DOI:10.4236/JILSA.2019.113003

Jiaxin Gao, Zirui Zhou, Jiangshan Ai, Bingxin Xia, Stephen Coggeshall

引用次数: 13

Abstract

Credit card fraud is a wide-ranging issue for financial institutions, involving theft and fraud committed using a payment card. In this paper, we explore the application of linear and nonlinear statistical modeling and machine learning models on real credit card transaction data. The models built are supervised fraud models that attempt to identify which transactions are most likely fraudulent. We discuss the processes of data exploration, data cleaning, variable creation, feature selection, model algorithms, and results. Five different supervised models are explored and compared including logistic regression, neural networks, random forest, boosted tree and support vector machines. The boosted tree model shows the best fraud detection result (FDR = 49.83%) for this particular data set. The resulting model can be utilized in a credit card fraud detection system. A similar model development process can be performed in related business domains such as insurance and telecommunications, to avoid or detect fraudulent activity.

查看原文本刊更多论文

使用机器学习算法预测信用卡交易欺诈

信用卡诈骗是金融机构的一个广泛问题，涉及使用支付卡进行的盗窃和欺诈。在本文中，我们探索了线性和非线性统计建模以及机器学习模型在真实信用卡交易数据上的应用。建立的模型是有监督的欺诈模型，试图识别哪些交易最有可能是欺诈的。我们讨论了数据探索、数据清理、变量创建、特征选择、模型算法和结果的过程。探讨并比较了五种不同的监督模型，包括逻辑回归、神经网络、随机森林、增强树和支持向量机。对于这个特定的数据集，增强树模型显示了最佳的欺诈检测结果(FDR = 49.83%)。该模型可用于信用卡欺诈检测系统。类似的模型开发过程可以在相关的业务领域(如保险和电信)中执行，以避免或检测欺诈活动。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

智能学习系统与应用(英文)

自引率

0.00%

发文量

135