Discriminative Graph Autoencoder

2018 IEEE International Conference on Big Knowledge (ICBK) Pub Date : 2018-11-01 DOI:10.1109/ICBK.2018.00033

Haifeng Jin, Qingquan Song, X. Hu

{"title":"Discriminative Graph Autoencoder","authors":"Haifeng Jin, Qingquan Song, X. Hu","doi":"10.1109/ICBK.2018.00033","DOIUrl":null,"url":null,"abstract":"With the abundance of graph-structured data in various applications, graph representation learning has become an effective computational tool for seeking informative vector representations for graphs. Traditional graph kernel approaches are usually frequency-based. Each dimension of a learned vector representation for a graph is the frequency of a certain type of substructure. They encounter high computational cost for counting the occurrence of predefined substructures. The learned vector representations are very sparse, which prohibit the use of inner products. Moreover, the learned vector representations are not in a smooth space since the values can only be integers. The state-of-the-art approaches tackle the challenges by changing kernel functions instead of producing better vector representations. They can only produce kernel matrices for kernel-based methods and not compatible with methods requiring vector representations. Effectively learning smooth vector representations for graphs of various structures and sizes remains a challenging task. Motivated by the recent advances in deep autoencoders, in this paper, we explore the capability of autoencoder on learning representations for graphs. Unlike videos or images, the graphs are usually of various sizes and are not readily prepared for autoencoder. Therefore, a novel framework, namely discriminative graph autoencoder (DGA), is proposed to learn low-dimensional vector representations for graphs. The algorithm decomposes the large graphs into small subgraphs, from which the structural information is sampled. The DGA produces smooth and informative vector representations of graphs efficiently while preserving the discriminative information according to their labels. Extensive experiments have been conducted to evaluate DGA. The experimental results demonstrate the efficiency and effectiveness of DGA comparing with traditional and state-of-the-art approaches on various real-world datasets and applications, e.g., classification and visualization.","PeriodicalId":144958,"journal":{"name":"2018 IEEE International Conference on Big Knowledge (ICBK)","volume":"518 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Big Knowledge (ICBK)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBK.2018.00033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

With the abundance of graph-structured data in various applications, graph representation learning has become an effective computational tool for seeking informative vector representations for graphs. Traditional graph kernel approaches are usually frequency-based. Each dimension of a learned vector representation for a graph is the frequency of a certain type of substructure. They encounter high computational cost for counting the occurrence of predefined substructures. The learned vector representations are very sparse, which prohibit the use of inner products. Moreover, the learned vector representations are not in a smooth space since the values can only be integers. The state-of-the-art approaches tackle the challenges by changing kernel functions instead of producing better vector representations. They can only produce kernel matrices for kernel-based methods and not compatible with methods requiring vector representations. Effectively learning smooth vector representations for graphs of various structures and sizes remains a challenging task. Motivated by the recent advances in deep autoencoders, in this paper, we explore the capability of autoencoder on learning representations for graphs. Unlike videos or images, the graphs are usually of various sizes and are not readily prepared for autoencoder. Therefore, a novel framework, namely discriminative graph autoencoder (DGA), is proposed to learn low-dimensional vector representations for graphs. The algorithm decomposes the large graphs into small subgraphs, from which the structural information is sampled. The DGA produces smooth and informative vector representations of graphs efficiently while preserving the discriminative information according to their labels. Extensive experiments have been conducted to evaluate DGA. The experimental results demonstrate the efficiency and effectiveness of DGA comparing with traditional and state-of-the-art approaches on various real-world datasets and applications, e.g., classification and visualization.

查看原文本刊更多论文

判别图自动编码器

随着各种应用中图结构数据的丰富，图表示学习已成为寻找图的信息向量表示的有效计算工具。传统的图核方法通常是基于频率的。图的学习向量表示的每一个维度是某一类型子结构的频率。它们在计算预定义子结构的出现次数时遇到了很高的计算成本。学习到的向量表示是非常稀疏的，这禁止使用内积。此外，学习到的向量表示不是在光滑空间中，因为值只能是整数。最先进的方法通过改变核函数来解决挑战，而不是产生更好的向量表示。它们只能为基于核的方法生成核矩阵，而与需要向量表示的方法不兼容。有效地学习各种结构和大小的图形的平滑向量表示仍然是一项具有挑战性的任务。受深度自编码器最新进展的启发，本文探讨了深度自编码器学习图表示的能力。与视频或图像不同，图形通常有各种大小，并且不容易为自动编码器准备。因此，提出了一种新的框架，即判别图自编码器(DGA)来学习图的低维向量表示。该算法将大图分解为小图，从中采样结构信息。该算法在保留图的标签判别信息的同时，有效地生成了平滑且信息丰富的图向量表示。已经进行了大量的实验来评估DGA。实验结果表明，在各种现实世界的数据集和应用(例如分类和可视化)上，与传统和最先进的方法相比，DGA的效率和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE International Conference on Big Knowledge (ICBK)

自引率

0.00%

发文量