Graph similarity learning for cross-level interactions

IF 7.4 1区管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Processing & Management Pub Date : 2024-10-23 DOI:10.1016/j.ipm.2024.103932

Cuifang Zou, Guangquan Lu, Longqing Du, Xuxia Zeng, Shilong Lin

{"title":"Graph similarity learning for cross-level interactions","authors":"Cuifang Zou, Guangquan Lu, Longqing Du, Xuxia Zeng, Shilong Lin","doi":"10.1016/j.ipm.2024.103932","DOIUrl":null,"url":null,"abstract":"<div><div>Graph similarity computation is crucial in fields such as bioinformatics, e.g., identifying compounds with similar biological activities by comparing molecular structural similarities. Traditional methods such as graph edit distance (GED) and maximal common subgraphs suffer from high computational complexity and sensitivity to noise, which limit their practical applications. Existing deep learning methods make it difficult to extract graph features, which affects computational accuracy comprehensively. To address these problems, we propose a new method, CLSim, which improves performance by enhancing feature extraction and improving graph similarity computation. Using the attention mechanism, CLSim first aligns graph pair features to the shared space and aggregates node features into global embeddings. The directionality of the embedding vectors is considered when extracting graph-level features to handle more complex data. In addition, we develop cross-layer feature extraction techniques that combine node-level information with graph-level embeddings to capture detailed node-graph interaction details. Experimental results on three datasets show that CLSim has excellent generalization capabilities and achieves lower error rates compared to the GED approach and the graph neural network baseline. In the worst case, its time complexity remains quadratic. Example query results further validate the effectiveness of the model, providing a more efficient and accurate solutions for graph similarity tasks.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103932"},"PeriodicalIF":7.4000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457324002917","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Graph similarity computation is crucial in fields such as bioinformatics, e.g., identifying compounds with similar biological activities by comparing molecular structural similarities. Traditional methods such as graph edit distance (GED) and maximal common subgraphs suffer from high computational complexity and sensitivity to noise, which limit their practical applications. Existing deep learning methods make it difficult to extract graph features, which affects computational accuracy comprehensively. To address these problems, we propose a new method, CLSim, which improves performance by enhancing feature extraction and improving graph similarity computation. Using the attention mechanism, CLSim first aligns graph pair features to the shared space and aggregates node features into global embeddings. The directionality of the embedding vectors is considered when extracting graph-level features to handle more complex data. In addition, we develop cross-layer feature extraction techniques that combine node-level information with graph-level embeddings to capture detailed node-graph interaction details. Experimental results on three datasets show that CLSim has excellent generalization capabilities and achieves lower error rates compared to the GED approach and the graph neural network baseline. In the worst case, its time complexity remains quadratic. Example query results further validate the effectiveness of the model, providing a more efficient and accurate solutions for graph similarity tasks.

查看原文本刊更多论文

跨级别交互的图相似性学习

图相似性计算在生物信息学等领域至关重要，例如，通过比较分子结构相似性来识别具有相似生物活性的化合物。图编辑距离（GED）和最大公共子图等传统方法存在计算复杂度高、对噪声敏感等问题，限制了它们的实际应用。现有的深度学习方法难以提取图特征，全面影响了计算精度。针对这些问题，我们提出了一种新方法--CLSim，它通过加强特征提取和改进图相似性计算来提高性能。利用注意力机制，CLSim 首先将图对特征对齐到共享空间，并将节点特征聚合为全局嵌入。在提取图层特征时，会考虑嵌入向量的方向性，以处理更复杂的数据。此外，我们还开发了跨层特征提取技术，将节点级信息与图级嵌入相结合，以捕捉详细的节点-图交互细节。在三个数据集上的实验结果表明，与 GED 方法和图神经网络基线相比，CLSim 具有出色的泛化能力和更低的错误率。在最坏的情况下，其时间复杂度仍为二次方。示例查询结果进一步验证了该模型的有效性，为图形相似性任务提供了更高效、更准确的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information Processing & Management 工程技术-计算机：信息系统

CiteScore

17.00

自引率

11.60%

发文量

276

审稿时长

39 days

期刊介绍： Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing. We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.