{"title":"Understanding digitally enabled complex networks: a plural granulation based hybrid community detection approach","authors":"Samrat Gupta, Swanand J. Deodhar","doi":"10.1108/ITP-10-2020-0682","DOIUrl":null,"url":null,"abstract":"PurposeCommunities representing groups of agents with similar interests or functions are one of the essential features of complex networks. Finding communities in real-world networks is critical for analyzing complex systems in various areas ranging from collaborative information to political systems. Given the different characteristics of networks and the capability of community detection in handling a plethora of societal problems, community detection methods represent an emerging area of research. Contributing to this field, the authors propose a new community detection algorithm based on the hybridization of node and link granulation.Design/methodology/approachThe proposed algorithm utilizes a rough set-theoretic concept called closure on networks. Initial sets are constructed by using neighborhood topology around the nodes as well as links and represented as two different categories of granules. Subsequently, the authors iteratively obtain the constrained closure of these sets. The authors use node mutuality and link mutuality as merging criteria for node and link granules, respectively, during the iterations. Finally, the constrained closure subsets of nodes and links are combined and refined using the Jaccard similarity coefficient and a local density function to obtain communities in a binary network.FindingsExtensive experiments conducted on twelve real-world networks followed by a comparison with state-of-the-art methods demonstrate the viability and effectiveness of the proposed algorithm.Research limitations/implicationsThe study also contributes to the ongoing effort related to the application of soft computing techniques to model complex systems. The extant literature has integrated a rough set-theoretic approach with a fuzzy granular model (Kundu and Pal, 2015) and spectral clustering (Huang and Xiao, 2012) for node-centric community detection in complex networks. In contributing to this stream of work, the proposed algorithm leverages the unexplored synergy between rough set theory, node granulation and link granulation in the context of complex networks. Combined with experiments of network datasets from various domains, the results indicate that the proposed algorithm can effectively reveal co-occurring disjoint, overlapping and nested communities without necessarily assigning each node to a community.Practical implicationsThis study carries important practical implications for complex adaptive systems in business and management sciences, in which entities are increasingly getting organized into communities (Jacucci et al., 2006). The proposed community detection method can be used for network-based fraud detection by enabling experts to understand the formation and development of fraudulent setups with an active exchange of information and resources between the firms (Van Vlasselaer et al., 2017). Products and services are getting connected and mapped in every walk of life due to the emergence of a variety of interconnected devices, social networks and software applications.Social implicationsThe proposed algorithm could be extended for community detection on customer trajectory patterns and design recommendation systems for online products and services (Ghose et al., 2019; Liu and Wang, 2017). In line with prior research, the proposed algorithm can aid companies in investigating the characteristics of implicit communities of bloggers or social media users for their services and products so as to identify peer influencers and conduct targeted marketing (Chau and Xu, 2012; De Matos et al., 2014; Zhang et al., 2016). The proposed algorithm can be used to understand the behavior of each group and the appropriate communication strategy for that group. For instance, a group using a specific language or following a specific account might benefit more from a particular piece of content than another group. The proposed algorithm can thus help in exploring the factors defining communities and confronting many real-life challenges.Originality/valueThis work is based on a theoretical argument that communities in networks are not only based on compatibility among nodes but also on the compatibility among links. Building up on the aforementioned argument, the authors propose a community detection method that considers the relationship among both the entities in a network (nodes and links) as opposed to traditional methods, which are predominantly based on relationships among nodes only.","PeriodicalId":47740,"journal":{"name":"Information Technology & People","volume":null,"pages":null},"PeriodicalIF":4.9000,"publicationDate":"2021-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Technology & People","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1108/ITP-10-2020-0682","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 6
Abstract
PurposeCommunities representing groups of agents with similar interests or functions are one of the essential features of complex networks. Finding communities in real-world networks is critical for analyzing complex systems in various areas ranging from collaborative information to political systems. Given the different characteristics of networks and the capability of community detection in handling a plethora of societal problems, community detection methods represent an emerging area of research. Contributing to this field, the authors propose a new community detection algorithm based on the hybridization of node and link granulation.Design/methodology/approachThe proposed algorithm utilizes a rough set-theoretic concept called closure on networks. Initial sets are constructed by using neighborhood topology around the nodes as well as links and represented as two different categories of granules. Subsequently, the authors iteratively obtain the constrained closure of these sets. The authors use node mutuality and link mutuality as merging criteria for node and link granules, respectively, during the iterations. Finally, the constrained closure subsets of nodes and links are combined and refined using the Jaccard similarity coefficient and a local density function to obtain communities in a binary network.FindingsExtensive experiments conducted on twelve real-world networks followed by a comparison with state-of-the-art methods demonstrate the viability and effectiveness of the proposed algorithm.Research limitations/implicationsThe study also contributes to the ongoing effort related to the application of soft computing techniques to model complex systems. The extant literature has integrated a rough set-theoretic approach with a fuzzy granular model (Kundu and Pal, 2015) and spectral clustering (Huang and Xiao, 2012) for node-centric community detection in complex networks. In contributing to this stream of work, the proposed algorithm leverages the unexplored synergy between rough set theory, node granulation and link granulation in the context of complex networks. Combined with experiments of network datasets from various domains, the results indicate that the proposed algorithm can effectively reveal co-occurring disjoint, overlapping and nested communities without necessarily assigning each node to a community.Practical implicationsThis study carries important practical implications for complex adaptive systems in business and management sciences, in which entities are increasingly getting organized into communities (Jacucci et al., 2006). The proposed community detection method can be used for network-based fraud detection by enabling experts to understand the formation and development of fraudulent setups with an active exchange of information and resources between the firms (Van Vlasselaer et al., 2017). Products and services are getting connected and mapped in every walk of life due to the emergence of a variety of interconnected devices, social networks and software applications.Social implicationsThe proposed algorithm could be extended for community detection on customer trajectory patterns and design recommendation systems for online products and services (Ghose et al., 2019; Liu and Wang, 2017). In line with prior research, the proposed algorithm can aid companies in investigating the characteristics of implicit communities of bloggers or social media users for their services and products so as to identify peer influencers and conduct targeted marketing (Chau and Xu, 2012; De Matos et al., 2014; Zhang et al., 2016). The proposed algorithm can be used to understand the behavior of each group and the appropriate communication strategy for that group. For instance, a group using a specific language or following a specific account might benefit more from a particular piece of content than another group. The proposed algorithm can thus help in exploring the factors defining communities and confronting many real-life challenges.Originality/valueThis work is based on a theoretical argument that communities in networks are not only based on compatibility among nodes but also on the compatibility among links. Building up on the aforementioned argument, the authors propose a community detection method that considers the relationship among both the entities in a network (nodes and links) as opposed to traditional methods, which are predominantly based on relationships among nodes only.
具有相似兴趣或功能的智能体组成的群体是复杂网络的基本特征之一。在现实世界的网络中寻找社区对于分析从协作信息到政治系统等各个领域的复杂系统至关重要。鉴于网络的不同特征和社区检测处理大量社会问题的能力,社区检测方法是一个新兴的研究领域。为此,作者提出了一种基于节点和链路粒化杂交的社区检测算法。设计/方法/方法所提出的算法利用了一个称为网络闭包的粗糙集理论概念。初始集通过节点和链路周围的邻域拓扑构造,并表示为两种不同类别的颗粒。随后,作者迭代地得到了这些集合的约束闭包。在迭代过程中,作者分别使用节点互性和链路互性作为节点颗粒和链路颗粒的合并准则。最后,利用Jaccard相似系数和局部密度函数对节点和链路的约束闭包子集进行组合和细化,得到二元网络中的社团。在12个真实世界的网络上进行了大量的实验,然后与最先进的方法进行了比较,证明了所提出算法的可行性和有效性。研究的局限性/意义本研究也有助于将软计算技术应用于复杂系统建模方面的持续努力。现有文献将粗糙集理论方法与模糊颗粒模型(Kundu and Pal, 2015)和光谱聚类(Huang and Xiao, 2012)相结合,用于复杂网络中以节点为中心的社区检测。为了促进这一工作流,所提出的算法利用了复杂网络背景下粗糙集理论、节点粒化和链接粒化之间未被探索的协同作用。结合不同领域网络数据集的实验结果表明,该算法可以有效地揭示共同出现的不相交、重叠和嵌套社区,而不必将每个节点分配给一个社区。实际意义这项研究对商业和管理科学中的复杂适应系统具有重要的实际意义,在这些系统中,实体越来越多地组织成社区(Jacucci et al., 2006)。所提出的社区检测方法可用于基于网络的欺诈检测,使专家能够了解欺诈设置的形成和发展,并在公司之间积极交换信息和资源(Van Vlasselaer等人,2017)。由于各种互联设备、社交网络和软件应用程序的出现,产品和服务正在被连接和映射到生活的各个方面。所提出的算法可以扩展到客户轨迹模式的社区检测和设计在线产品和服务的推荐系统(Ghose等人,2019;Liu and Wang, 2017)。与先前的研究一致,所提出的算法可以帮助公司调查博客或社交媒体用户对其服务和产品的隐性社区特征,从而识别同行影响者并进行有针对性的营销(Chau和Xu, 2012;De Matos et al., 2014;张等人,2016)。该算法可用于理解每个组的行为以及该组的适当通信策略。例如,使用特定语言或关注特定帐户的组可能比其他组从特定内容中获益更多。因此,提出的算法可以帮助探索定义社区的因素,并应对许多现实生活中的挑战。原创性/价值这项工作基于一个理论论点,即网络中的社区不仅基于节点之间的兼容性,还基于链路之间的兼容性。在上述论点的基础上,作者提出了一种社区检测方法,该方法考虑了网络(节点和链接)中实体之间的关系,而不是主要基于节点之间关系的传统方法。
期刊介绍:
Information Technology & People publishes work that is dedicated to understanding the implications of information technology as a tool, resource and format for people in their daily work in organizations. Impact on performance is part of this, since it is essential to the well being of employees and organizations alike. Contributions to the journal include case studies, comparative theory, and quantitative research, as well as inquiries into systems development methods and practice.