Enhancing Community Detection for Big Sensor Data Clustering via Hyperbolic Network Embedding

V. Karyotis, Konstantinos Tsitseklis, Konstantinos Sotiropoulos, S. Papavassiliou
{"title":"Enhancing Community Detection for Big Sensor Data Clustering via Hyperbolic Network Embedding","authors":"V. Karyotis, Konstantinos Tsitseklis, Konstantinos Sotiropoulos, S. Papavassiliou","doi":"10.1109/PERCOMW.2018.8480134","DOIUrl":null,"url":null,"abstract":"In this paper we present a novel big data clustering approach for measurements obtained from pervasive sensor networks. To address the potential very large scale of such datasets, we map the problem of data clustering to a community detection one. Datasets are cast in the form of graphs, representing the relations among individual observations and data clustering is mapped to node clustering (community detection) in the data graph. We propose a novel computational approach for enhancing the traditional Girvan-Newman (GN) community detection algorithm via hyperbolic network embedding. The data dependency graph is embedded in the hyperbolic space via Rigel embedding, making it possible to compute more efficiently the hyperbolic edge-betweenness centrality (HEBC) needed in the modified GN algorithm. This allows for more efficient clustering of the nodes of the data graph without significantly sacrificing accuracy. We demonstrate the efficacy of our approach with artificial network and data topologies, and real benchmark datasets. The proposed methodology can be used for efficient clustering of datasets obtained from massive pervasive smart city/building sensor networks, such as the FIESTA-IoT platform, and exploited in various applications such as lower-cost sensing.","PeriodicalId":190096,"journal":{"name":"2018 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PERCOMW.2018.8480134","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper we present a novel big data clustering approach for measurements obtained from pervasive sensor networks. To address the potential very large scale of such datasets, we map the problem of data clustering to a community detection one. Datasets are cast in the form of graphs, representing the relations among individual observations and data clustering is mapped to node clustering (community detection) in the data graph. We propose a novel computational approach for enhancing the traditional Girvan-Newman (GN) community detection algorithm via hyperbolic network embedding. The data dependency graph is embedded in the hyperbolic space via Rigel embedding, making it possible to compute more efficiently the hyperbolic edge-betweenness centrality (HEBC) needed in the modified GN algorithm. This allows for more efficient clustering of the nodes of the data graph without significantly sacrificing accuracy. We demonstrate the efficacy of our approach with artificial network and data topologies, and real benchmark datasets. The proposed methodology can be used for efficient clustering of datasets obtained from massive pervasive smart city/building sensor networks, such as the FIESTA-IoT platform, and exploited in various applications such as lower-cost sensing.
利用双曲网络嵌入增强大传感器数据聚类中的社区检测
在本文中,我们提出了一种新的大数据聚类方法,用于从普适传感器网络中获得的测量结果。为了解决此类数据集潜在的超大规模问题,我们将数据聚类问题映射为社区检测问题。数据集以图的形式进行转换,表示单个观测值之间的关系,并将数据聚类映射到数据图中的节点聚类(社区检测)。我们提出了一种新的计算方法,通过双曲网络嵌入来增强传统的Girvan-Newman (GN)社区检测算法。通过Rigel嵌入将数据依赖图嵌入到双曲空间中,使得改进的GN算法可以更有效地计算双曲边间中心性(HEBC)。这允许在不显著牺牲准确性的情况下更有效地对数据图的节点进行聚类。我们用人工网络和数据拓扑以及真实的基准数据集证明了我们的方法的有效性。所提出的方法可用于从大规模无处不在的智能城市/建筑传感器网络(如FIESTA-IoT平台)获得的数据集的高效聚类,并可用于低成本传感等各种应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信