End-to-End Learning of Graph Similarity

Zhixin Chen, Mengxiang Lin, Deqing Wang
{"title":"End-to-End Learning of Graph Similarity","authors":"Zhixin Chen, Mengxiang Lin, Deqing Wang","doi":"10.1109/HPCS48598.2019.9188094","DOIUrl":null,"url":null,"abstract":"Constructing and calculating the metrics of graphs comparison precisely can be expensive due to the prohibitively high time complexity, exponential in some cases. Thus building a learning model to approximate the metrics is expected. In this paper, we convert the computation of graphs similarity/distance into a learning problem and propose an end-to-end GCN(Graph Convolutional Network) based model to calculate the GFD(Graphlet Frequency Distribution) distance of graphs. In this way, the trained model predicts the GFD distance of graphs directly rather than constructs a GFD vector by counting graphlets as in traditional methods. A experimental evaluation is conducted to validate the effectiveness of our model in real-world networks scaled from tens of nodes to thousands of nodes. Our trained model takes $ 480\\times$ less time on average compared with the count-based method in the dataset. The 3-top nearest accuracy reaches 74.6% while the 5-top nearest accuracy reaches 85.2% in the test data.","PeriodicalId":371856,"journal":{"name":"2019 International Conference on High Performance Computing & Simulation (HPCS)","volume":"284 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on High Performance Computing & Simulation (HPCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HPCS48598.2019.9188094","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Constructing and calculating the metrics of graphs comparison precisely can be expensive due to the prohibitively high time complexity, exponential in some cases. Thus building a learning model to approximate the metrics is expected. In this paper, we convert the computation of graphs similarity/distance into a learning problem and propose an end-to-end GCN(Graph Convolutional Network) based model to calculate the GFD(Graphlet Frequency Distribution) distance of graphs. In this way, the trained model predicts the GFD distance of graphs directly rather than constructs a GFD vector by counting graphlets as in traditional methods. A experimental evaluation is conducted to validate the effectiveness of our model in real-world networks scaled from tens of nodes to thousands of nodes. Our trained model takes $ 480\times$ less time on average compared with the count-based method in the dataset. The 3-top nearest accuracy reaches 74.6% while the 5-top nearest accuracy reaches 85.2% in the test data.
图相似度的端到端学习
精确地构造和计算图比较的度量是非常昂贵的,因为时间复杂度非常高,在某些情况下是指数级的。因此,需要建立一个学习模型来近似度量。本文将图的相似度/距离的计算转化为一个学习问题,提出了一种基于端到端的GCN(图卷积网络)模型来计算图的GFD(Graphlet Frequency Distribution)距离。这样,训练后的模型可以直接预测图的GFD距离,而不是像传统方法那样通过计算graphlet来构建一个GFD向量。实验评估验证了我们的模型在从数十个节点到数千个节点的真实网络中的有效性。与数据集中基于计数的方法相比,我们训练的模型平均花费的时间减少了480美元。在测试数据中,3个最接近的准确率达到74.6%,5个最接近的准确率达到85.2%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信