基于Logistic回归和随机森林的信用卡欺诈识别

Wasit Journal of Computer and Mathematics Science Pub Date : 2023-09-30 DOI:10.31185/wjcms.184

Wang Yundong, None Alexander Zhulev, None Omar G. Ahmed

{"title":"基于Logistic回归和随机森林的信用卡欺诈识别","authors":"Wang Yundong, None Alexander Zhulev, None Omar G. Ahmed","doi":"10.31185/wjcms.184","DOIUrl":null,"url":null,"abstract":"Fraud is an ancient yet ever-changing profession. Because of the digitization of money, financial transactions, banks, fraudsters now have a limitless number of possibilities to perpetrate crime from behind a screen, anywhere around the world. Fraud has a broad influence, with direct ramifications for business and the economy. It is of great worry to cybercrime organizations as recent studies have proven that ML algorithms may successfully be utilized to identify fraudulent transactions in massive amounts of payment data. Such techniques may identify fraudulent transactions in real time, which human auditors may miss. In this research, we apply supervised ML algorithms to the issue of fraud identification by analyzing simulated financial transaction data that is available to the public. Our aim is to show how supervised ML methods may be utilized to successfully identify data with extreme class disproportion. By way of example, we show how exploratory analysis may be utilized to identify fraudulent from real purchases. We also show that Random Forest outperform Logistic Regression when applied to a clearly distinguished dataset.","PeriodicalId":224730,"journal":{"name":"Wasit Journal of Computer and Mathematics Science","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Credit Card Fraud Identification using Logistic Regression and Random Forest\",\"authors\":\"Wang Yundong, None Alexander Zhulev, None Omar G. Ahmed\",\"doi\":\"10.31185/wjcms.184\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fraud is an ancient yet ever-changing profession. Because of the digitization of money, financial transactions, banks, fraudsters now have a limitless number of possibilities to perpetrate crime from behind a screen, anywhere around the world. Fraud has a broad influence, with direct ramifications for business and the economy. It is of great worry to cybercrime organizations as recent studies have proven that ML algorithms may successfully be utilized to identify fraudulent transactions in massive amounts of payment data. Such techniques may identify fraudulent transactions in real time, which human auditors may miss. In this research, we apply supervised ML algorithms to the issue of fraud identification by analyzing simulated financial transaction data that is available to the public. Our aim is to show how supervised ML methods may be utilized to successfully identify data with extreme class disproportion. By way of example, we show how exploratory analysis may be utilized to identify fraudulent from real purchases. We also show that Random Forest outperform Logistic Regression when applied to a clearly distinguished dataset.\",\"PeriodicalId\":224730,\"journal\":{\"name\":\"Wasit Journal of Computer and Mathematics Science\",\"volume\":\"98 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Wasit Journal of Computer and Mathematics Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31185/wjcms.184\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Wasit Journal of Computer and Mathematics Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31185/wjcms.184","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

欺诈是一种古老而又不断变化的职业。由于货币、金融交易、银行的数字化，欺诈者现在有无限的可能性从屏幕后面实施犯罪，在世界任何地方。欺诈具有广泛的影响，对商业和经济产生直接影响。网络犯罪组织非常担心，因为最近的研究证明，机器学习算法可以成功地用于识别大量支付数据中的欺诈交易。这些技术可以实时识别欺诈性交易，而人类审计员可能会错过这些交易。在本研究中，我们通过分析可供公众使用的模拟金融交易数据，将监督ML算法应用于欺诈识别问题。我们的目的是展示如何利用监督机器学习方法成功识别具有极端类不比例的数据。通过示例，我们展示了如何利用探索性分析来识别真实购买中的欺诈行为。我们还表明，当应用于明确区分的数据集时，随机森林优于逻辑回归。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Credit Card Fraud Identification using Logistic Regression and Random Forest

Fraud is an ancient yet ever-changing profession. Because of the digitization of money, financial transactions, banks, fraudsters now have a limitless number of possibilities to perpetrate crime from behind a screen, anywhere around the world. Fraud has a broad influence, with direct ramifications for business and the economy. It is of great worry to cybercrime organizations as recent studies have proven that ML algorithms may successfully be utilized to identify fraudulent transactions in massive amounts of payment data. Such techniques may identify fraudulent transactions in real time, which human auditors may miss. In this research, we apply supervised ML algorithms to the issue of fraud identification by analyzing simulated financial transaction data that is available to the public. Our aim is to show how supervised ML methods may be utilized to successfully identify data with extreme class disproportion. By way of example, we show how exploratory analysis may be utilized to identify fraudulent from real purchases. We also show that Random Forest outperform Logistic Regression when applied to a clearly distinguished dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Wasit Journal of Computer and Mathematics Science

自引率

0.00%

发文量