利用预测区间进行无监督的欺诈交易检测:一个案例研究

Asian Journal of Engineering and Applied Technology Pub Date : 2022-10-28 DOI:10.51983/ajeat-2022.11.2.3348

I. Hewapathirana

{"title":"利用预测区间进行无监督的欺诈交易检测:一个案例研究","authors":"I. Hewapathirana","doi":"10.51983/ajeat-2022.11.2.3348","DOIUrl":null,"url":null,"abstract":"Money laundering operations have a high negative impact on the growth of a country’s national economy. As all financial sectors are increasingly being integrated, it is vital to implement effective technological measures to address these fraudulent operations. Machine learning methods are widely used to classify an incoming transaction as fraudulent or non-fraudulent by analyzing the behaviour of past transactions. Unsupervised machine learning methods do not require label information on past transactions, and a classification is made solely based on the distribution of the transaction. This research presents three unsupervised classification methods: ordinary least squares regression-based (OLS) fraud detection, random forest-based (RF) fraud detection and dropout neural network-based (DNN) fraud detection. For each method, the goal is to classify an incoming transaction amount as fraudulent or non-fraudulent. The novelty in the proposed approach is the application of prediction interval calculation for automatically validating incoming transactions. The three methods are applied to a real-world dataset of credit card transactions. The fraud labels available for the dataset are removed during the model training phase but are later used to evaluate the performance of the final predictions. The performance of the proposed methods is further compared with two other unsupervised state-of-the-art methods. Based on the experimental results, the OLS and RF methods show the best performance in predicting the correct label of a transaction, while the DNN method is the most robust method for detecting fraudulent transactions. This novel concept of calculating prediction intervals for validating an incoming transaction introduces a new direction for unsupervised fraud detection. Since fraud labels on past transactions are not required for training, the proposed methods can be applied in an online setting to different areas, such as detecting money laundering activities, telecommunication fraud and intrusion detection.","PeriodicalId":8524,"journal":{"name":"Asian Journal of Engineering and Applied Technology","volume":"81 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Utilizing Prediction Intervals for Unsupervised Detection of Fraudulent Transactions: A Case Study\",\"authors\":\"I. Hewapathirana\",\"doi\":\"10.51983/ajeat-2022.11.2.3348\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Money laundering operations have a high negative impact on the growth of a country’s national economy. As all financial sectors are increasingly being integrated, it is vital to implement effective technological measures to address these fraudulent operations. Machine learning methods are widely used to classify an incoming transaction as fraudulent or non-fraudulent by analyzing the behaviour of past transactions. Unsupervised machine learning methods do not require label information on past transactions, and a classification is made solely based on the distribution of the transaction. This research presents three unsupervised classification methods: ordinary least squares regression-based (OLS) fraud detection, random forest-based (RF) fraud detection and dropout neural network-based (DNN) fraud detection. For each method, the goal is to classify an incoming transaction amount as fraudulent or non-fraudulent. The novelty in the proposed approach is the application of prediction interval calculation for automatically validating incoming transactions. The three methods are applied to a real-world dataset of credit card transactions. The fraud labels available for the dataset are removed during the model training phase but are later used to evaluate the performance of the final predictions. The performance of the proposed methods is further compared with two other unsupervised state-of-the-art methods. Based on the experimental results, the OLS and RF methods show the best performance in predicting the correct label of a transaction, while the DNN method is the most robust method for detecting fraudulent transactions. This novel concept of calculating prediction intervals for validating an incoming transaction introduces a new direction for unsupervised fraud detection. Since fraud labels on past transactions are not required for training, the proposed methods can be applied in an online setting to different areas, such as detecting money laundering activities, telecommunication fraud and intrusion detection.\",\"PeriodicalId\":8524,\"journal\":{\"name\":\"Asian Journal of Engineering and Applied Technology\",\"volume\":\"81 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asian Journal of Engineering and Applied Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.51983/ajeat-2022.11.2.3348\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Engineering and Applied Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.51983/ajeat-2022.11.2.3348","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

洗钱活动对一个国家的国民经济增长有很大的负面影响。随着所有金融部门日益一体化，实施有效的技术措施来解决这些欺诈操作至关重要。机器学习方法被广泛用于通过分析过去交易的行为来将传入的交易分类为欺诈或非欺诈。无监督机器学习方法不需要过去交易的标签信息，并且仅根据交易的分布进行分类。本研究提出了三种无监督分类方法:基于普通最小二乘回归(OLS)的欺诈检测、基于随机森林(RF)的欺诈检测和基于dropout神经网络(DNN)的欺诈检测。对于每种方法，目标是将传入的交易金额分类为欺诈性或非欺诈性。该方法的新颖之处在于应用预测区间计算来自动验证传入的事务。这三种方法应用于真实世界的信用卡交易数据集。可用于数据集的欺诈标签在模型训练阶段被删除，但随后用于评估最终预测的性能。该方法的性能进一步与其他两种无监督的最新方法进行了比较。基于实验结果，OLS和RF方法在预测交易的正确标签方面表现出最好的性能，而DNN方法在检测欺诈交易方面是最鲁棒的方法。这种计算验证传入事务的预测间隔的新概念为无监督欺诈检测引入了一个新的方向。由于培训不需要对过去的交易进行欺诈标记，因此建议的方法可以在在线环境中应用于不同领域，例如检测洗钱活动、电信欺诈和入侵检测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Utilizing Prediction Intervals for Unsupervised Detection of Fraudulent Transactions: A Case Study

Money laundering operations have a high negative impact on the growth of a country’s national economy. As all financial sectors are increasingly being integrated, it is vital to implement effective technological measures to address these fraudulent operations. Machine learning methods are widely used to classify an incoming transaction as fraudulent or non-fraudulent by analyzing the behaviour of past transactions. Unsupervised machine learning methods do not require label information on past transactions, and a classification is made solely based on the distribution of the transaction. This research presents three unsupervised classification methods: ordinary least squares regression-based (OLS) fraud detection, random forest-based (RF) fraud detection and dropout neural network-based (DNN) fraud detection. For each method, the goal is to classify an incoming transaction amount as fraudulent or non-fraudulent. The novelty in the proposed approach is the application of prediction interval calculation for automatically validating incoming transactions. The three methods are applied to a real-world dataset of credit card transactions. The fraud labels available for the dataset are removed during the model training phase but are later used to evaluate the performance of the final predictions. The performance of the proposed methods is further compared with two other unsupervised state-of-the-art methods. Based on the experimental results, the OLS and RF methods show the best performance in predicting the correct label of a transaction, while the DNN method is the most robust method for detecting fraudulent transactions. This novel concept of calculating prediction intervals for validating an incoming transaction introduces a new direction for unsupervised fraud detection. Since fraud labels on past transactions are not required for training, the proposed methods can be applied in an online setting to different areas, such as detecting money laundering activities, telecommunication fraud and intrusion detection.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Asian Journal of Engineering and Applied Technology

自引率

0.00%

发文量