Cuifang Zou, Guangquan Lu, Longqing Du, Xuxia Zeng, Shilong Lin
{"title":"Graph similarity learning for cross-level interactions","authors":"Cuifang Zou, Guangquan Lu, Longqing Du, Xuxia Zeng, Shilong Lin","doi":"10.1016/j.ipm.2024.103932","DOIUrl":null,"url":null,"abstract":"<div><div>Graph similarity computation is crucial in fields such as bioinformatics, e.g., identifying compounds with similar biological activities by comparing molecular structural similarities. Traditional methods such as graph edit distance (GED) and maximal common subgraphs suffer from high computational complexity and sensitivity to noise, which limit their practical applications. Existing deep learning methods make it difficult to extract graph features, which affects computational accuracy comprehensively. To address these problems, we propose a new method, CLSim, which improves performance by enhancing feature extraction and improving graph similarity computation. Using the attention mechanism, CLSim first aligns graph pair features to the shared space and aggregates node features into global embeddings. The directionality of the embedding vectors is considered when extracting graph-level features to handle more complex data. In addition, we develop cross-layer feature extraction techniques that combine node-level information with graph-level embeddings to capture detailed node-graph interaction details. Experimental results on three datasets show that CLSim has excellent generalization capabilities and achieves lower error rates compared to the GED approach and the graph neural network baseline. In the worst case, its time complexity remains quadratic. Example query results further validate the effectiveness of the model, providing a more efficient and accurate solutions for graph similarity tasks.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":null,"pages":null},"PeriodicalIF":7.4000,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457324002917","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Graph similarity computation is crucial in fields such as bioinformatics, e.g., identifying compounds with similar biological activities by comparing molecular structural similarities. Traditional methods such as graph edit distance (GED) and maximal common subgraphs suffer from high computational complexity and sensitivity to noise, which limit their practical applications. Existing deep learning methods make it difficult to extract graph features, which affects computational accuracy comprehensively. To address these problems, we propose a new method, CLSim, which improves performance by enhancing feature extraction and improving graph similarity computation. Using the attention mechanism, CLSim first aligns graph pair features to the shared space and aggregates node features into global embeddings. The directionality of the embedding vectors is considered when extracting graph-level features to handle more complex data. In addition, we develop cross-layer feature extraction techniques that combine node-level information with graph-level embeddings to capture detailed node-graph interaction details. Experimental results on three datasets show that CLSim has excellent generalization capabilities and achieves lower error rates compared to the GED approach and the graph neural network baseline. In the worst case, its time complexity remains quadratic. Example query results further validate the effectiveness of the model, providing a more efficient and accurate solutions for graph similarity tasks.
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.