Junwei Cheng,Ke Liang,Pengxing Feng,Weixiong Liu,Yong Tang,Chaobo He
{"title":"变分图自编码器的频率信号调制聚类扩散模型。","authors":"Junwei Cheng,Ke Liang,Pengxing Feng,Weixiong Liu,Yong Tang,Chaobo He","doi":"10.1109/tpami.2025.3614385","DOIUrl":null,"url":null,"abstract":"Variational autoencoders (VAEs) have been widely used for node clustering, with existing methods mainly focusing on enhancing the expressiveness of their latent space. Recently, the integration of diffusion models with VAEs has provided new opportunities to achieve this objective. However, the mechanism by which the diffusion model improves performance remains unclear. To bridge this gap, we conduct an empirical analysis from the perspective of graph spectral theory, revealing that the signal modulation induced by diffusion models closely aligns with the low-frequency spectral characteristics of VAEs, which in turn explains their effectiveness. Nevertheless, further experiments highlight that diffusion models exhibit limitations in modulating high-frequency signals, which diverge from the spectral characteristics of VAEs. Moreover, existing diffusion methods fail to enable the latent space to adequately capture and reflect cluster-specific characteristics. To address these challenges, we propose a novel plug-and-play method, FVD, to improve the performance of VAE-based methods in node clustering tasks. Specifically, we incorporate the graph wavelet transform as a secondary signal modulator, enabling independent adjustments of specific frequency bands to better align with the spectral characteristics of VAEs. Additionally, we introduce the Student's t-distribution as a conditional constraint in the reverse process of FVD, deriving a more compact variational lower bound. This enhancement preserves fine-grained node information while focusing on clustering details, effectively mitigating the cluster collapse phenomenon. Comprehensive experimental results demonstrate that integrating FVD with existing methods achieves competitive performance improvements in most cases.","PeriodicalId":13426,"journal":{"name":"IEEE Transactions on Pattern Analysis and Machine Intelligence","volume":"1 1","pages":""},"PeriodicalIF":18.6000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Clustering Diffusion Model with Frequency-Signal Modulation for Variational Graph Autoencoders.\",\"authors\":\"Junwei Cheng,Ke Liang,Pengxing Feng,Weixiong Liu,Yong Tang,Chaobo He\",\"doi\":\"10.1109/tpami.2025.3614385\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Variational autoencoders (VAEs) have been widely used for node clustering, with existing methods mainly focusing on enhancing the expressiveness of their latent space. Recently, the integration of diffusion models with VAEs has provided new opportunities to achieve this objective. However, the mechanism by which the diffusion model improves performance remains unclear. To bridge this gap, we conduct an empirical analysis from the perspective of graph spectral theory, revealing that the signal modulation induced by diffusion models closely aligns with the low-frequency spectral characteristics of VAEs, which in turn explains their effectiveness. Nevertheless, further experiments highlight that diffusion models exhibit limitations in modulating high-frequency signals, which diverge from the spectral characteristics of VAEs. Moreover, existing diffusion methods fail to enable the latent space to adequately capture and reflect cluster-specific characteristics. To address these challenges, we propose a novel plug-and-play method, FVD, to improve the performance of VAE-based methods in node clustering tasks. Specifically, we incorporate the graph wavelet transform as a secondary signal modulator, enabling independent adjustments of specific frequency bands to better align with the spectral characteristics of VAEs. Additionally, we introduce the Student's t-distribution as a conditional constraint in the reverse process of FVD, deriving a more compact variational lower bound. This enhancement preserves fine-grained node information while focusing on clustering details, effectively mitigating the cluster collapse phenomenon. Comprehensive experimental results demonstrate that integrating FVD with existing methods achieves competitive performance improvements in most cases.\",\"PeriodicalId\":13426,\"journal\":{\"name\":\"IEEE Transactions on Pattern Analysis and Machine Intelligence\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":18.6000,\"publicationDate\":\"2025-09-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Pattern Analysis and Machine Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1109/tpami.2025.3614385\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Pattern Analysis and Machine Intelligence","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/tpami.2025.3614385","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Clustering Diffusion Model with Frequency-Signal Modulation for Variational Graph Autoencoders.
Variational autoencoders (VAEs) have been widely used for node clustering, with existing methods mainly focusing on enhancing the expressiveness of their latent space. Recently, the integration of diffusion models with VAEs has provided new opportunities to achieve this objective. However, the mechanism by which the diffusion model improves performance remains unclear. To bridge this gap, we conduct an empirical analysis from the perspective of graph spectral theory, revealing that the signal modulation induced by diffusion models closely aligns with the low-frequency spectral characteristics of VAEs, which in turn explains their effectiveness. Nevertheless, further experiments highlight that diffusion models exhibit limitations in modulating high-frequency signals, which diverge from the spectral characteristics of VAEs. Moreover, existing diffusion methods fail to enable the latent space to adequately capture and reflect cluster-specific characteristics. To address these challenges, we propose a novel plug-and-play method, FVD, to improve the performance of VAE-based methods in node clustering tasks. Specifically, we incorporate the graph wavelet transform as a secondary signal modulator, enabling independent adjustments of specific frequency bands to better align with the spectral characteristics of VAEs. Additionally, we introduce the Student's t-distribution as a conditional constraint in the reverse process of FVD, deriving a more compact variational lower bound. This enhancement preserves fine-grained node information while focusing on clustering details, effectively mitigating the cluster collapse phenomenon. Comprehensive experimental results demonstrate that integrating FVD with existing methods achieves competitive performance improvements in most cases.
期刊介绍:
The IEEE Transactions on Pattern Analysis and Machine Intelligence publishes articles on all traditional areas of computer vision and image understanding, all traditional areas of pattern analysis and recognition, and selected areas of machine intelligence, with a particular emphasis on machine learning for pattern analysis. Areas such as techniques for visual search, document and handwriting analysis, medical image analysis, video and image sequence analysis, content-based retrieval of image and video, face and gesture recognition and relevant specialized hardware and/or software architectures are also covered.