基于潜在空间模型生成链接的大规模信息网络嵌入评价

Proceedings of the 2022 5th International Conference on Software Engineering and Information Management Pub Date : 2022-01-21 DOI:10.1145/3520084.3520111

Shotaro Kawasaki, Ryosuke Motegi, Shogo Matsuno, Yoichi Seki

{"title":"基于潜在空间模型生成链接的大规模信息网络嵌入评价","authors":"Shotaro Kawasaki, Ryosuke Motegi, Shogo Matsuno, Yoichi Seki","doi":"10.1145/3520084.3520111","DOIUrl":null,"url":null,"abstract":"Graph representation learning encodes vertices as low-dimensional vectors that summarize their graph position and the structure of their local graph neighborhood. These methods give us beneficial representation in continuous space from big relational data. However, the algorithms are usually evaluated indirectly from the accuracy of applying the learning results to classification tasks because of not giving the correct answer when graph representation learning is applied. Therefore, this study proposes a method to evaluate graph representation learning algorithms by preparing correct learning results for the data by distributing objects in the latent space in advance and probabilistically generating relational graph data from the distributions in the latent space. Using this method, we evaluated LINE: Large-scale information network embedding, one of the most popular algorithms for learning graph representations. LINE consists of two algorithms optimizing two objective functions defined by first-order proximity and second-order proximity. We prepared two link-generating models suitable for these two objective functions and clarified that the corresponding LINE algorithm performed well for the link data generated by each model.","PeriodicalId":444957,"journal":{"name":"Proceedings of the 2022 5th International Conference on Software Engineering and Information Management","volume":"82 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Evaluation of Large-scale Information Network Embedding based on Latent Space Model Generating Links\",\"authors\":\"Shotaro Kawasaki, Ryosuke Motegi, Shogo Matsuno, Yoichi Seki\",\"doi\":\"10.1145/3520084.3520111\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Graph representation learning encodes vertices as low-dimensional vectors that summarize their graph position and the structure of their local graph neighborhood. These methods give us beneficial representation in continuous space from big relational data. However, the algorithms are usually evaluated indirectly from the accuracy of applying the learning results to classification tasks because of not giving the correct answer when graph representation learning is applied. Therefore, this study proposes a method to evaluate graph representation learning algorithms by preparing correct learning results for the data by distributing objects in the latent space in advance and probabilistically generating relational graph data from the distributions in the latent space. Using this method, we evaluated LINE: Large-scale information network embedding, one of the most popular algorithms for learning graph representations. LINE consists of two algorithms optimizing two objective functions defined by first-order proximity and second-order proximity. We prepared two link-generating models suitable for these two objective functions and clarified that the corresponding LINE algorithm performed well for the link data generated by each model.\",\"PeriodicalId\":444957,\"journal\":{\"name\":\"Proceedings of the 2022 5th International Conference on Software Engineering and Information Management\",\"volume\":\"82 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 5th International Conference on Software Engineering and Information Management\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3520084.3520111\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Software Engineering and Information Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3520084.3520111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

图表示学习将顶点编码为低维向量，总结了它们的图位置和局部图邻域的结构。这些方法为我们从大关系数据中获得连续空间的表示提供了有益的途径。然而，由于在应用图表示学习时不能给出正确的答案，因此通常从将学习结果应用于分类任务的准确性来间接评价算法。因此，本研究提出了一种评估图表示学习算法的方法，通过预先在潜在空间中分布对象，并从潜在空间的分布中概率地生成关系图数据，为数据准备正确的学习结果。使用这种方法，我们评估了LINE:大规模信息网络嵌入，这是学习图表示最流行的算法之一。LINE由两种算法组成，分别对一阶接近度和二阶接近度定义的两个目标函数进行优化。我们针对这两个目标函数准备了两个适合的链路生成模型，并阐明了对应的LINE算法对于每个模型生成的链路数据都表现良好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An Evaluation of Large-scale Information Network Embedding based on Latent Space Model Generating Links

Graph representation learning encodes vertices as low-dimensional vectors that summarize their graph position and the structure of their local graph neighborhood. These methods give us beneficial representation in continuous space from big relational data. However, the algorithms are usually evaluated indirectly from the accuracy of applying the learning results to classification tasks because of not giving the correct answer when graph representation learning is applied. Therefore, this study proposes a method to evaluate graph representation learning algorithms by preparing correct learning results for the data by distributing objects in the latent space in advance and probabilistically generating relational graph data from the distributions in the latent space. Using this method, we evaluated LINE: Large-scale information network embedding, one of the most popular algorithms for learning graph representations. LINE consists of two algorithms optimizing two objective functions defined by first-order proximity and second-order proximity. We prepared two link-generating models suitable for these two objective functions and clarified that the corresponding LINE algorithm performed well for the link data generated by each model.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2022 5th International Conference on Software Engineering and Information Management

自引率

0.00%

发文量