图相似度搜索的研究

Q4 Computer Science

Journal of Information Processing Pub Date : 2012-07-25 DOI:10.11185/IMT.7.1565

Haichuan Shang, M. Kitsuregawa

{"title":"图相似度搜索的研究","authors":"Haichuan Shang, M. Kitsuregawa","doi":"10.11185/IMT.7.1565","DOIUrl":null,"url":null,"abstract":"Graph similarity search is to retrieve graphs that approximately contain a given query graph. It has many applications, e.g., detecting similar functions among chemical compounds. The problem is challenging as even testing subgraph containment between two graphs is NP-complete. Hence, existing techniques adopt the filtering-and-verification framework with the focus on developing effective and efficient techniques to remove non-promising graphs. Nevertheless, existing filtering techniques may be still unable to effectively remove many ”low” quality candidates. To resolve this, in this paper we propose a novel indexing technique to index graphs according to their ”distances” to features. We then develop lower and upper bounding techniques that exploit the index to (1) prune non-promising graphs and (2) include graphs whose similarities are guaranteed to exceed the given similarity threshold. Considering that the verification phase is not well studied and plays the dominant role in the whole process, we devise efficient algorithms to verify candidates. A comprehensive experiment using real datasets demonstrates that our proposed methods significantly outperform existing methods.","PeriodicalId":16243,"journal":{"name":"Journal of Information Processing","volume":"7 1","pages":"1565-1570"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Study on Graph Similarity Search\",\"authors\":\"Haichuan Shang, M. Kitsuregawa\",\"doi\":\"10.11185/IMT.7.1565\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Graph similarity search is to retrieve graphs that approximately contain a given query graph. It has many applications, e.g., detecting similar functions among chemical compounds. The problem is challenging as even testing subgraph containment between two graphs is NP-complete. Hence, existing techniques adopt the filtering-and-verification framework with the focus on developing effective and efficient techniques to remove non-promising graphs. Nevertheless, existing filtering techniques may be still unable to effectively remove many ”low” quality candidates. To resolve this, in this paper we propose a novel indexing technique to index graphs according to their ”distances” to features. We then develop lower and upper bounding techniques that exploit the index to (1) prune non-promising graphs and (2) include graphs whose similarities are guaranteed to exceed the given similarity threshold. Considering that the verification phase is not well studied and plays the dominant role in the whole process, we devise efficient algorithms to verify candidates. A comprehensive experiment using real datasets demonstrates that our proposed methods significantly outperform existing methods.\",\"PeriodicalId\":16243,\"journal\":{\"name\":\"Journal of Information Processing\",\"volume\":\"7 1\",\"pages\":\"1565-1570\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Information Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.11185/IMT.7.1565\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Information Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11185/IMT.7.1565","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

摘要

图相似度搜索是检索近似包含给定查询图的图。它有许多应用，例如，检测化合物之间的相似功能。这个问题具有挑战性，因为即使测试两个图之间的子图包含也是np完全的。因此，现有的技术采用过滤和验证框架，重点是开发有效和高效的技术来去除无希望的图。然而，现有的过滤技术可能仍然无法有效地去除许多“低”质量的候选。为了解决这个问题，本文提出了一种新的索引技术，根据图与特征的“距离”对图进行索引。然后，我们开发了下限和上限技术，利用索引来(1)修剪无希望的图和(2)包括相似度保证超过给定相似度阈值的图。考虑到验证阶段在整个过程中占主导地位，我们设计了有效的算法来验证候选对象。使用真实数据集的综合实验表明，我们提出的方法显着优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Study on Graph Similarity Search

Graph similarity search is to retrieve graphs that approximately contain a given query graph. It has many applications, e.g., detecting similar functions among chemical compounds. The problem is challenging as even testing subgraph containment between two graphs is NP-complete. Hence, existing techniques adopt the filtering-and-verification framework with the focus on developing effective and efficient techniques to remove non-promising graphs. Nevertheless, existing filtering techniques may be still unable to effectively remove many ”low” quality candidates. To resolve this, in this paper we propose a novel indexing technique to index graphs according to their ”distances” to features. We then develop lower and upper bounding techniques that exploit the index to (1) prune non-promising graphs and (2) include graphs whose similarities are guaranteed to exceed the given similarity threshold. Considering that the verification phase is not well studied and plays the dominant role in the whole process, we devise efficient algorithms to verify candidates. A comprehensive experiment using real datasets demonstrates that our proposed methods significantly outperform existing methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Information Processing Computer Science-Computer Science (all)

CiteScore

1.20

自引率

0.00%

发文量