{"title":"Top-k Graph Similarity Search Based on Hierarchical Inverted Index","authors":"Zhong-qing Wang, Yan Yang, Y. Zhong","doi":"10.1109/ICIST52614.2021.9440632","DOIUrl":null,"url":null,"abstract":"Graph similarity search is an important research problem in many applications, such as finding result graphs that have a similar structure to a given entity in biochemistry, data mining, and pattern recognition. Top-k graph similarity search is one of graph similarity search tasks, which aims to find the top-k graphs that are most similar to the query graph in a given graph database. In this paper, the top-k similarity search problem based on the graph edit distance is studied according to the corollary of the partitioned similarity theorem. Firstly, in order to speed up the online search process and avoid scanning each graph in the database one by one, an offline hierarchical inverted index is constructed to satisfy top-k search. Secondly, the offline hierarchical inverted index is used to filter the candidate graphs online and verify them, which reduces the time of searching the graphs. Finally, the good performance of the algorithm in running time and scalability is verified by running the similarity algorithm on real dataset and synthetic dataset.","PeriodicalId":371599,"journal":{"name":"2021 11th International Conference on Information Science and Technology (ICIST)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 11th International Conference on Information Science and Technology (ICIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIST52614.2021.9440632","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Graph similarity search is an important research problem in many applications, such as finding result graphs that have a similar structure to a given entity in biochemistry, data mining, and pattern recognition. Top-k graph similarity search is one of graph similarity search tasks, which aims to find the top-k graphs that are most similar to the query graph in a given graph database. In this paper, the top-k similarity search problem based on the graph edit distance is studied according to the corollary of the partitioned similarity theorem. Firstly, in order to speed up the online search process and avoid scanning each graph in the database one by one, an offline hierarchical inverted index is constructed to satisfy top-k search. Secondly, the offline hierarchical inverted index is used to filter the candidate graphs online and verify them, which reduces the time of searching the graphs. Finally, the good performance of the algorithm in running time and scalability is verified by running the similarity algorithm on real dataset and synthetic dataset.