FiFrauD: Unsupervised Financial Fraud Detection in Dynamic Graph Streams

IF 4.8 3区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

ACM Transactions on Knowledge Discovery from Data Pub Date : 2024-01-27 DOI:10.1145/3641857

Samira Khodabandehlou, Alireza Hashemi Golpayegani

{"title":"FiFrauD: Unsupervised Financial Fraud Detection in Dynamic Graph Streams","authors":"Samira Khodabandehlou, Alireza Hashemi Golpayegani","doi":"10.1145/3641857","DOIUrl":null,"url":null,"abstract":"<p>Given a stream of financial transactions between traders in an e-market, how can we accurately detect fraudulent traders and suspicious behaviors in real time? Despite the efforts made in detecting these fraudsters, this field still faces serious challenges, including the ineffectiveness of existing methods for the complex and streaming environment of e-markets. As a result, it is still difficult to quickly and accurately detect suspected traders and behavior patterns in real-time transactions, and it is still considered an open problem. Therefore, to solve this problem and alleviate the existing challenges, in this paper, we propose FiFrauD, which is an unsupervised, scalable approach that depicts the behavior of manipulators in a transaction stream. In this approach, real-time transactions between traders are converted into a stream of graphs, and instead of using supervised and semi-supervised learning methods, fraudulent traders are detected precisely by exploiting density signals in graphs. Specifically, we reveal the traits of fraudulent traders in the market and propose a novel metric from this perspective, i.e., graph topology, time, and behavior. Then, we search for suspicious blocks by greedily optimizing the proposed metric. Theoretical analysis demonstrates upper bounds for FiFrauD's effectiveness in catching suspicious trades. Extensive experiments on five real-world datasets with both actual and synthetic labels demonstrate that FiFrauD achieves significant accuracy improvements compared to state-of-the-art fraud detection methods. Also, it can find various suspicious behavior patterns in a linear running time and provide interpretable results. Furthermore, FiFrauD is resistant to the camouflage tactics used by fraudulent traders.</p>","PeriodicalId":49249,"journal":{"name":"ACM Transactions on Knowledge Discovery from Data","volume":"35 1","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Knowledge Discovery from Data","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3641857","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Given a stream of financial transactions between traders in an e-market, how can we accurately detect fraudulent traders and suspicious behaviors in real time? Despite the efforts made in detecting these fraudsters, this field still faces serious challenges, including the ineffectiveness of existing methods for the complex and streaming environment of e-markets. As a result, it is still difficult to quickly and accurately detect suspected traders and behavior patterns in real-time transactions, and it is still considered an open problem. Therefore, to solve this problem and alleviate the existing challenges, in this paper, we propose FiFrauD, which is an unsupervised, scalable approach that depicts the behavior of manipulators in a transaction stream. In this approach, real-time transactions between traders are converted into a stream of graphs, and instead of using supervised and semi-supervised learning methods, fraudulent traders are detected precisely by exploiting density signals in graphs. Specifically, we reveal the traits of fraudulent traders in the market and propose a novel metric from this perspective, i.e., graph topology, time, and behavior. Then, we search for suspicious blocks by greedily optimizing the proposed metric. Theoretical analysis demonstrates upper bounds for FiFrauD's effectiveness in catching suspicious trades. Extensive experiments on five real-world datasets with both actual and synthetic labels demonstrate that FiFrauD achieves significant accuracy improvements compared to state-of-the-art fraud detection methods. Also, it can find various suspicious behavior patterns in a linear running time and provide interpretable results. Furthermore, FiFrauD is resistant to the camouflage tactics used by fraudulent traders.

查看原文本刊更多论文

FiFrauD：动态图流中的无监督金融欺诈检测

面对电子市场中交易者之间的金融交易流，我们如何才能准确地实时检测出欺诈交易者和可疑行为？尽管在侦测这些欺诈者方面做出了努力，但这一领域仍面临严峻挑战，包括现有方法在电子市场复杂的流式环境中效果不佳。因此，要在实时交易中快速准确地检测出可疑交易者和行为模式仍然十分困难，而且仍被认为是一个悬而未决的问题。因此，为了解决这一问题，缓解现有的挑战，我们在本文中提出了 FiFrauD，这是一种无监督、可扩展的方法，用于描述交易流中操纵者的行为。在这种方法中，交易者之间的实时交易被转换成图流，而不是使用监督和半监督学习方法，而是通过利用图中的密度信号来精确检测欺诈交易者。具体来说，我们揭示了市场中欺诈交易者的特征，并从这个角度提出了一种新的度量方法，即图拓扑、时间和行为。然后，我们通过贪婪地优化所提出的指标来搜索可疑区块。理论分析表明了 FiFrauD 在捕捉可疑交易方面的有效性上限。在实际和合成标签的五个真实数据集上进行的广泛实验表明，与最先进的欺诈检测方法相比，FiFrauD 的准确率有了显著提高。此外，它还能在线性运行时间内发现各种可疑行为模式，并提供可解释的结果。此外，FiFrauD 还能抵御欺诈交易者使用的伪装策略。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Knowledge Discovery from Data COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

6.70

自引率

5.60%

发文量

172

审稿时长

3 months

期刊介绍： TKDD welcomes papers on a full range of research in the knowledge discovery and analysis of diverse forms of data. Such subjects include, but are not limited to: scalable and effective algorithms for data mining and big data analysis, mining brain networks, mining data streams, mining multi-media data, mining high-dimensional data, mining text, Web, and semi-structured data, mining spatial and temporal data, data mining for community generation, social network analysis, and graph structured data, security and privacy issues in data mining, visual, interactive and online data mining, pre-processing and post-processing for data mining, robust and scalable statistical methods, data mining languages, foundations of data mining, KDD framework and process, and novel applications and infrastructures exploiting data mining technology including massively parallel processing and cloud computing platforms. TKDD encourages papers that explore the above subjects in the context of large distributed networks of computers, parallel or multiprocessing computers, or new data devices. TKDD also encourages papers that describe emerging data mining applications that cannot be satisfied by the current data mining technology.