Graph similarity learning for cross-level interactions

IF 7.4 1区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Cuifang Zou, Guangquan Lu, Longqing Du, Xuxia Zeng, Shilong Lin
{"title":"Graph similarity learning for cross-level interactions","authors":"Cuifang Zou,&nbsp;Guangquan Lu,&nbsp;Longqing Du,&nbsp;Xuxia Zeng,&nbsp;Shilong Lin","doi":"10.1016/j.ipm.2024.103932","DOIUrl":null,"url":null,"abstract":"<div><div>Graph similarity computation is crucial in fields such as bioinformatics, e.g., identifying compounds with similar biological activities by comparing molecular structural similarities. Traditional methods such as graph edit distance (GED) and maximal common subgraphs suffer from high computational complexity and sensitivity to noise, which limit their practical applications. Existing deep learning methods make it difficult to extract graph features, which affects computational accuracy comprehensively. To address these problems, we propose a new method, CLSim, which improves performance by enhancing feature extraction and improving graph similarity computation. Using the attention mechanism, CLSim first aligns graph pair features to the shared space and aggregates node features into global embeddings. The directionality of the embedding vectors is considered when extracting graph-level features to handle more complex data. In addition, we develop cross-layer feature extraction techniques that combine node-level information with graph-level embeddings to capture detailed node-graph interaction details. Experimental results on three datasets show that CLSim has excellent generalization capabilities and achieves lower error rates compared to the GED approach and the graph neural network baseline. In the worst case, its time complexity remains quadratic. Example query results further validate the effectiveness of the model, providing a more efficient and accurate solutions for graph similarity tasks.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103932"},"PeriodicalIF":7.4000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457324002917","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Graph similarity computation is crucial in fields such as bioinformatics, e.g., identifying compounds with similar biological activities by comparing molecular structural similarities. Traditional methods such as graph edit distance (GED) and maximal common subgraphs suffer from high computational complexity and sensitivity to noise, which limit their practical applications. Existing deep learning methods make it difficult to extract graph features, which affects computational accuracy comprehensively. To address these problems, we propose a new method, CLSim, which improves performance by enhancing feature extraction and improving graph similarity computation. Using the attention mechanism, CLSim first aligns graph pair features to the shared space and aggregates node features into global embeddings. The directionality of the embedding vectors is considered when extracting graph-level features to handle more complex data. In addition, we develop cross-layer feature extraction techniques that combine node-level information with graph-level embeddings to capture detailed node-graph interaction details. Experimental results on three datasets show that CLSim has excellent generalization capabilities and achieves lower error rates compared to the GED approach and the graph neural network baseline. In the worst case, its time complexity remains quadratic. Example query results further validate the effectiveness of the model, providing a more efficient and accurate solutions for graph similarity tasks.
跨级别交互的图相似性学习
图相似性计算在生物信息学等领域至关重要,例如,通过比较分子结构相似性来识别具有相似生物活性的化合物。图编辑距离(GED)和最大公共子图等传统方法存在计算复杂度高、对噪声敏感等问题,限制了它们的实际应用。现有的深度学习方法难以提取图特征,全面影响了计算精度。针对这些问题,我们提出了一种新方法--CLSim,它通过加强特征提取和改进图相似性计算来提高性能。利用注意力机制,CLSim 首先将图对特征对齐到共享空间,并将节点特征聚合为全局嵌入。在提取图层特征时,会考虑嵌入向量的方向性,以处理更复杂的数据。此外,我们还开发了跨层特征提取技术,将节点级信息与图级嵌入相结合,以捕捉详细的节点-图交互细节。在三个数据集上的实验结果表明,与 GED 方法和图神经网络基线相比,CLSim 具有出色的泛化能力和更低的错误率。在最坏的情况下,其时间复杂度仍为二次方。示例查询结果进一步验证了该模型的有效性,为图形相似性任务提供了更高效、更准确的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Information Processing & Management
Information Processing & Management 工程技术-计算机:信息系统
CiteScore
17.00
自引率
11.60%
发文量
276
审稿时长
39 days
期刊介绍: Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing. We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信