{"title":"The Node-Similarity Distribution of Complex Networks and Its Applications in Link Prediction (Extended Abstract)","authors":"Cunlai Pu, Jie Li, Jian Wang, Tony Q. S. Quek","doi":"10.1109/ICDE55515.2023.00376","DOIUrl":null,"url":null,"abstract":"Node-similarity distributions not only characterize different types of complex networks, but also offer insights in the structural predictability of complex networks, and even facilitate prediction tasks in complex networks. By means of the generating function, we propose a framework to calculate the common neighbor based similarity (CNS) distributions, offering theoretical results of similarity distributions of various complex networks. Furthermore, we apply node-similarity distributions to link prediction, a key task in network analysis. Specifically, by deriving analytical solutions for two metrics: i) precision and ii) area under the receiver operating characteristic curve (AUC), we give theoretical evaluation of link prediction. Also, by analyzing i) the expected prediction accuracy of similarity scores and ii) optimal prediction priority of unconnected node pairs, we optimize link prediction with similarity distributions. Simulation results confirm our findings and also validate the proposed methods for evaluating and optimizing link prediction.","PeriodicalId":434744,"journal":{"name":"2023 IEEE 39th International Conference on Data Engineering (ICDE)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 39th International Conference on Data Engineering (ICDE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDE55515.2023.00376","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Node-similarity distributions not only characterize different types of complex networks, but also offer insights in the structural predictability of complex networks, and even facilitate prediction tasks in complex networks. By means of the generating function, we propose a framework to calculate the common neighbor based similarity (CNS) distributions, offering theoretical results of similarity distributions of various complex networks. Furthermore, we apply node-similarity distributions to link prediction, a key task in network analysis. Specifically, by deriving analytical solutions for two metrics: i) precision and ii) area under the receiver operating characteristic curve (AUC), we give theoretical evaluation of link prediction. Also, by analyzing i) the expected prediction accuracy of similarity scores and ii) optimal prediction priority of unconnected node pairs, we optimize link prediction with similarity distributions. Simulation results confirm our findings and also validate the proposed methods for evaluating and optimizing link prediction.