{"title":"Securing Transactions: A Hybrid Dependable Ensemble Machine Learning Model using IHT-LR and Grid Search","authors":"Md. Alamin Talukder, Rakib Hossen, Md Ashraf Uddin, Mohammed Nasir Uddin, Uzzal Kumar Acharjee","doi":"arxiv-2402.14389","DOIUrl":null,"url":null,"abstract":"Financial institutions and businesses face an ongoing challenge from\nfraudulent transactions, prompting the need for effective detection methods.\nDetecting credit card fraud is crucial for identifying and preventing\nunauthorized transactions.Timely detection of fraud enables investigators to\ntake swift actions to mitigate further losses. However, the investigation\nprocess is often time-consuming, limiting the number of alerts that can be\nthoroughly examined each day. Therefore, the primary objective of a fraud\ndetection model is to provide accurate alerts while minimizing false alarms and\nmissed fraud cases. In this paper, we introduce a state-of-the-art hybrid\nensemble (ENS) dependable Machine learning (ML) model that intelligently\ncombines multiple algorithms with proper weighted optimization using Grid\nsearch, including Decision Tree (DT), Random Forest (RF), K-Nearest Neighbor\n(KNN), and Multilayer Perceptron (MLP), to enhance fraud identification. To\naddress the data imbalance issue, we employ the Instant Hardness Threshold\n(IHT) technique in conjunction with Logistic Regression (LR), surpassing\nconventional approaches. Our experiments are conducted on a publicly available\ncredit card dataset comprising 284,807 transactions. The proposed model\nachieves impressive accuracy rates of 99.66%, 99.73%, 98.56%, and 99.79%, and a\nperfect 100% for the DT, RF, KNN, MLP and ENS models, respectively. The hybrid\nensemble model outperforms existing works, establishing a new benchmark for\ndetecting fraudulent transactions in high-frequency scenarios. The results\nhighlight the effectiveness and reliability of our approach, demonstrating\nsuperior performance metrics and showcasing its exceptional potential for\nreal-world fraud detection applications.","PeriodicalId":501372,"journal":{"name":"arXiv - QuantFin - General Finance","volume":"75 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - General Finance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2402.14389","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Financial institutions and businesses face an ongoing challenge from
fraudulent transactions, prompting the need for effective detection methods.
Detecting credit card fraud is crucial for identifying and preventing
unauthorized transactions.Timely detection of fraud enables investigators to
take swift actions to mitigate further losses. However, the investigation
process is often time-consuming, limiting the number of alerts that can be
thoroughly examined each day. Therefore, the primary objective of a fraud
detection model is to provide accurate alerts while minimizing false alarms and
missed fraud cases. In this paper, we introduce a state-of-the-art hybrid
ensemble (ENS) dependable Machine learning (ML) model that intelligently
combines multiple algorithms with proper weighted optimization using Grid
search, including Decision Tree (DT), Random Forest (RF), K-Nearest Neighbor
(KNN), and Multilayer Perceptron (MLP), to enhance fraud identification. To
address the data imbalance issue, we employ the Instant Hardness Threshold
(IHT) technique in conjunction with Logistic Regression (LR), surpassing
conventional approaches. Our experiments are conducted on a publicly available
credit card dataset comprising 284,807 transactions. The proposed model
achieves impressive accuracy rates of 99.66%, 99.73%, 98.56%, and 99.79%, and a
perfect 100% for the DT, RF, KNN, MLP and ENS models, respectively. The hybrid
ensemble model outperforms existing works, establishing a new benchmark for
detecting fraudulent transactions in high-frequency scenarios. The results
highlight the effectiveness and reliability of our approach, demonstrating
superior performance metrics and showcasing its exceptional potential for
real-world fraud detection applications.