Xinzhi Wang;Hang Yu;Jiayu Guo;Pengbo Li;Xiangfeng Luo
{"title":"Towards Fraud Detection via Fine-Grained Classification of User Behavior","authors":"Xinzhi Wang;Hang Yu;Jiayu Guo;Pengbo Li;Xiangfeng Luo","doi":"10.1109/TBDATA.2024.3517313","DOIUrl":null,"url":null,"abstract":"The mass volume of data in the modern business world requires fraud detection to be automated. Hence, some researchers constructed the fraud scenario into graph data and proposed graph-based fraud detection methods. These methods treat the problem of fraud detection as a binary node classification task. However, the differences between the nodes of the same class are ignored. In this paper, we try to distinguish differences in behavior among nodes of the same class to improve the model’s ability to detect deviation, i.e., we make a fine-grained classification of user behavior (called prototypes) and propose an adaptive prototype-based graph neural network (APGNN) for fraud detection. APGNN learns node behavior representations by extracting both neighborhood and global information, supplying preliminary knowledge for the adaptive creation of several prototypes, each representing a distinct behavior pattern. Subsequently, a new loss function is employed to enhance the prototypes’ capacity to capture these behavior patterns and to amplify the feature differences between different prototypes. Nodes are then projected onto these prototypes to derive the final behavior patterns. Extensive experiments on four real-world datasets show that this method can provide better fraud detection as well as a more understandable result.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"11 4","pages":"1994-2007"},"PeriodicalIF":5.7000,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Big Data","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10798511/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The mass volume of data in the modern business world requires fraud detection to be automated. Hence, some researchers constructed the fraud scenario into graph data and proposed graph-based fraud detection methods. These methods treat the problem of fraud detection as a binary node classification task. However, the differences between the nodes of the same class are ignored. In this paper, we try to distinguish differences in behavior among nodes of the same class to improve the model’s ability to detect deviation, i.e., we make a fine-grained classification of user behavior (called prototypes) and propose an adaptive prototype-based graph neural network (APGNN) for fraud detection. APGNN learns node behavior representations by extracting both neighborhood and global information, supplying preliminary knowledge for the adaptive creation of several prototypes, each representing a distinct behavior pattern. Subsequently, a new loss function is employed to enhance the prototypes’ capacity to capture these behavior patterns and to amplify the feature differences between different prototypes. Nodes are then projected onto these prototypes to derive the final behavior patterns. Extensive experiments on four real-world datasets show that this method can provide better fraud detection as well as a more understandable result.
期刊介绍:
The IEEE Transactions on Big Data publishes peer-reviewed articles focusing on big data. These articles present innovative research ideas and application results across disciplines, including novel theories, algorithms, and applications. Research areas cover a wide range, such as big data analytics, visualization, curation, management, semantics, infrastructure, standards, performance analysis, intelligence extraction, scientific discovery, security, privacy, and legal issues specific to big data. The journal also prioritizes applications of big data in fields generating massive datasets.