{"title":"基于图的欺诈检测抗噪声模型","authors":"Zhengyang Liu, Hang Yu, Xiangfeng Luo","doi":"10.1016/j.ipm.2025.104198","DOIUrl":null,"url":null,"abstract":"<div><div>Graph-based fraud detection is a critical task that identifies anomalous nodes that deviate from the majority of normal nodes within a graph. It can be applied in various practical situations, including but not limited to fake review detection, fraud transaction detection, and bot account detection. Current graph fraud detection models leverage popular Graph Neural Networks (GNNs) as their foundation, achieving significant success from the view of homogeneous and heterogeneous edges. However, these methods assume a sufficient proportion of completely accurate labeled nodes, overlooking the issue of noisy labels present in real-world scenarios. This can lead to significant performance degradation of current graph fraud detection methods. To address this challenge, we propose a Noise-Resistant Model for Graph Fraud Detection. First, we design a foundational graph fraud detection model from a spectral perspective to capture both homogeneous and heterogeneous information of nodes. Based on a conditional variational autoencoder(CVAE), we are able to obtain node features augmented from different perspectives. Next, nodes with noisy labels are trained alongside nodes with clean labels. Utilizing a self-supervised approach, noisy nodes with high prediction confidence that align with their labels are gradually incorporated to the training set. For nodes with lower confidence, we aim to learn better representations and gradually include more of them into the training set. With the augmented features generated by the CVAE, combined with a support set constructed from clean labels, we compute the consistency loss with adversarial strategies to ensure that features augmented from both normal and anomalous perspectives are brought closer to the relevant categories within the support set. Extensive experiments comparing our method with twelve state-of-the-art baselines on six real-world datasets – Amazon, Yelp, Elliptic, FDCompCN, T-Finance, and T-Social – showcase the superiority of our model.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 5","pages":"Article 104198"},"PeriodicalIF":7.4000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Noise-Resistant Model for Graph-based Fraud Detection\",\"authors\":\"Zhengyang Liu, Hang Yu, Xiangfeng Luo\",\"doi\":\"10.1016/j.ipm.2025.104198\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Graph-based fraud detection is a critical task that identifies anomalous nodes that deviate from the majority of normal nodes within a graph. It can be applied in various practical situations, including but not limited to fake review detection, fraud transaction detection, and bot account detection. Current graph fraud detection models leverage popular Graph Neural Networks (GNNs) as their foundation, achieving significant success from the view of homogeneous and heterogeneous edges. However, these methods assume a sufficient proportion of completely accurate labeled nodes, overlooking the issue of noisy labels present in real-world scenarios. This can lead to significant performance degradation of current graph fraud detection methods. To address this challenge, we propose a Noise-Resistant Model for Graph Fraud Detection. First, we design a foundational graph fraud detection model from a spectral perspective to capture both homogeneous and heterogeneous information of nodes. Based on a conditional variational autoencoder(CVAE), we are able to obtain node features augmented from different perspectives. Next, nodes with noisy labels are trained alongside nodes with clean labels. Utilizing a self-supervised approach, noisy nodes with high prediction confidence that align with their labels are gradually incorporated to the training set. For nodes with lower confidence, we aim to learn better representations and gradually include more of them into the training set. With the augmented features generated by the CVAE, combined with a support set constructed from clean labels, we compute the consistency loss with adversarial strategies to ensure that features augmented from both normal and anomalous perspectives are brought closer to the relevant categories within the support set. Extensive experiments comparing our method with twelve state-of-the-art baselines on six real-world datasets – Amazon, Yelp, Elliptic, FDCompCN, T-Finance, and T-Social – showcase the superiority of our model.</div></div>\",\"PeriodicalId\":50365,\"journal\":{\"name\":\"Information Processing & Management\",\"volume\":\"62 5\",\"pages\":\"Article 104198\"},\"PeriodicalIF\":7.4000,\"publicationDate\":\"2025-05-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Processing & Management\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306457325001396\",\"RegionNum\":1,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325001396","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
A Noise-Resistant Model for Graph-based Fraud Detection
Graph-based fraud detection is a critical task that identifies anomalous nodes that deviate from the majority of normal nodes within a graph. It can be applied in various practical situations, including but not limited to fake review detection, fraud transaction detection, and bot account detection. Current graph fraud detection models leverage popular Graph Neural Networks (GNNs) as their foundation, achieving significant success from the view of homogeneous and heterogeneous edges. However, these methods assume a sufficient proportion of completely accurate labeled nodes, overlooking the issue of noisy labels present in real-world scenarios. This can lead to significant performance degradation of current graph fraud detection methods. To address this challenge, we propose a Noise-Resistant Model for Graph Fraud Detection. First, we design a foundational graph fraud detection model from a spectral perspective to capture both homogeneous and heterogeneous information of nodes. Based on a conditional variational autoencoder(CVAE), we are able to obtain node features augmented from different perspectives. Next, nodes with noisy labels are trained alongside nodes with clean labels. Utilizing a self-supervised approach, noisy nodes with high prediction confidence that align with their labels are gradually incorporated to the training set. For nodes with lower confidence, we aim to learn better representations and gradually include more of them into the training set. With the augmented features generated by the CVAE, combined with a support set constructed from clean labels, we compute the consistency loss with adversarial strategies to ensure that features augmented from both normal and anomalous perspectives are brought closer to the relevant categories within the support set. Extensive experiments comparing our method with twelve state-of-the-art baselines on six real-world datasets – Amazon, Yelp, Elliptic, FDCompCN, T-Finance, and T-Social – showcase the superiority of our model.
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.