CommunityGCN: community detection using node classification with graph convolution network

IF 1.7 4区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS

Data Technologies and Applications Pub Date : 2023-02-06 DOI:10.1108/dta-02-2022-0056

Riju Bhattacharya, N. K. Nagwani, Sarsij Tripathi

{"title":"CommunityGCN: community detection using node classification with graph convolution network","authors":"Riju Bhattacharya, N. K. Nagwani, Sarsij Tripathi","doi":"10.1108/dta-02-2022-0056","DOIUrl":null,"url":null,"abstract":"PurposeA community demonstrates the unique qualities and relationships between its members that distinguish it from other communities within a network. Network analysis relies heavily on community detection. Despite the traditional spectral clustering and statistical inference methods, deep learning techniques for community detection have grown in popularity due to their ease of processing high-dimensional network data. Graph convolutional neural networks (GCNNs) have received much attention recently and have developed into a potential and ubiquitous method for directly detecting communities on graphs. Inspired by the promising results of graph convolutional networks (GCNs) in analyzing graph structure data, a novel community graph convolutional network (CommunityGCN) as a semi-supervised node classification model has been proposed and compared with recent baseline methods graph attention network (GAT), GCN-based technique for unsupervised community detection and Markov random fields combined with graph convolutional network (MRFasGCN).Design/methodology/approachThis work presents the method for identifying communities that combines the notion of node classification via message passing with the architecture of a semi-supervised graph neural network. Six benchmark datasets, namely, Cora, CiteSeer, ACM, Karate, IMDB and Facebook, have been used in the experimentation.FindingsIn the first set of experiments, the scaled normalized average matrix of all neighbor's features including the node itself was obtained, followed by obtaining the weighted average matrix of low-dimensional nodes. In the second set of experiments, the average weighted matrix was forwarded to the GCN with two layers and the activation function for predicting the node class was applied. The results demonstrate that node classification with GCN can improve the performance of identifying communities on graph datasets.Originality/valueThe experiment reveals that the CommunityGCN approach has given better results with accuracy, normalized mutual information, F1 and modularity scores of 91.26, 79.9, 92.58 and 70.5 per cent, respectively, for detecting communities in the graph network, which is much greater than the range of 55.7–87.07 per cent reported in previous literature. Thus, it has been concluded that the GCN with node classification models has improved the accuracy.","PeriodicalId":56156,"journal":{"name":"Data Technologies and Applications","volume":" ","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2023-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data Technologies and Applications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1108/dta-02-2022-0056","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

PurposeA community demonstrates the unique qualities and relationships between its members that distinguish it from other communities within a network. Network analysis relies heavily on community detection. Despite the traditional spectral clustering and statistical inference methods, deep learning techniques for community detection have grown in popularity due to their ease of processing high-dimensional network data. Graph convolutional neural networks (GCNNs) have received much attention recently and have developed into a potential and ubiquitous method for directly detecting communities on graphs. Inspired by the promising results of graph convolutional networks (GCNs) in analyzing graph structure data, a novel community graph convolutional network (CommunityGCN) as a semi-supervised node classification model has been proposed and compared with recent baseline methods graph attention network (GAT), GCN-based technique for unsupervised community detection and Markov random fields combined with graph convolutional network (MRFasGCN).Design/methodology/approachThis work presents the method for identifying communities that combines the notion of node classification via message passing with the architecture of a semi-supervised graph neural network. Six benchmark datasets, namely, Cora, CiteSeer, ACM, Karate, IMDB and Facebook, have been used in the experimentation.FindingsIn the first set of experiments, the scaled normalized average matrix of all neighbor's features including the node itself was obtained, followed by obtaining the weighted average matrix of low-dimensional nodes. In the second set of experiments, the average weighted matrix was forwarded to the GCN with two layers and the activation function for predicting the node class was applied. The results demonstrate that node classification with GCN can improve the performance of identifying communities on graph datasets.Originality/valueThe experiment reveals that the CommunityGCN approach has given better results with accuracy, normalized mutual information, F1 and modularity scores of 91.26, 79.9, 92.58 and 70.5 per cent, respectively, for detecting communities in the graph network, which is much greater than the range of 55.7–87.07 per cent reported in previous literature. Thus, it has been concluded that the GCN with node classification models has improved the accuracy.

查看原文本刊更多论文

CommunityGCN:基于图卷积网络的节点分类社区检测

一个社区展示了其成员之间的独特品质和关系，使其区别于网络中的其他社区。网络分析在很大程度上依赖于社区检测。与传统的光谱聚类和统计推断方法不同，深度学习技术由于易于处理高维网络数据而越来越受欢迎。图卷积神经网络(GCNNs)近年来受到广泛关注，已发展成为一种有潜力且普遍存在的直接检测图上社区的方法。受图卷积网络(GCNs)在图结构数据分析方面的良好结果的启发，提出了一种新的社区图卷积网络(CommunityGCN)作为半监督节点分类模型，并与现有的基线方法图注意网络(GAT)、基于gcn的无监督社区检测技术和马尔可夫随机场结合图卷积网络(MRFasGCN)进行了比较。设计/方法/方法本工作提出了一种识别社区的方法，该方法结合了通过消息传递的节点分类概念和半监督图神经网络的体系结构。实验中使用了6个基准数据集，分别是Cora、CiteSeer、ACM、Karate、IMDB和Facebook。在第一组实验中，首先得到包括节点本身在内的所有邻居特征的缩放归一化平均矩阵，然后得到低维节点的加权平均矩阵。在第二组实验中，将平均加权矩阵转发给两层GCN，并应用预测节点类别的激活函数。结果表明，使用GCN进行节点分类可以提高图数据集上社区识别的性能。实验表明，CommunityGCN方法在图网络社区检测方面取得了较好的结果，准确率、归一化互信息、F1和模块化得分分别为91.26%、79.9%、92.58和70.5%，远远大于以往文献报道的55.7% - 87.07%。由此可见，采用节点分类模型的GCN提高了分类准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Data Technologies and Applications Social Sciences-Library and Information Sciences

CiteScore

3.80

自引率

6.20%

发文量

期刊介绍： Previously published as: Program Online from: 2018 Subject Area: Information & Knowledge Management, Library Studies