P. Hajibabaee, Masoud Malekzadeh, Maryam Heidari, Samira Zad, Ozlem Uzuner, James H. Jones
{"title":"An Empirical Study of the GraphSAGE and Word2vec Algorithms for Graph Multiclass Classification","authors":"P. Hajibabaee, Masoud Malekzadeh, Maryam Heidari, Samira Zad, Ozlem Uzuner, James H. Jones","doi":"10.1109/iemcon53756.2021.9623238","DOIUrl":null,"url":null,"abstract":"A fundamental task in machine learning involves node classification. However, when considering the context of large graph data, this problem becomes much more challenging. In this paper, we use the Wikipedia hyperlink dataset to evaluate our semi-supervised node classification model. Given a small set of labeled nodes, we develop a multiclass classifier that utilizes the network structure as well as textual descriptions of nodes to predict the most probable category(label) for each test node in a semi-supervised setting. Our experiment shows promising results for graph multiclass classification using directed graphSAGE and word2vec algorithms together. We also visualize the node embeddings in 2D using the t-SNE method.","PeriodicalId":272590,"journal":{"name":"2021 IEEE 12th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 12th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iemcon53756.2021.9623238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
Abstract
A fundamental task in machine learning involves node classification. However, when considering the context of large graph data, this problem becomes much more challenging. In this paper, we use the Wikipedia hyperlink dataset to evaluate our semi-supervised node classification model. Given a small set of labeled nodes, we develop a multiclass classifier that utilizes the network structure as well as textual descriptions of nodes to predict the most probable category(label) for each test node in a semi-supervised setting. Our experiment shows promising results for graph multiclass classification using directed graphSAGE and word2vec algorithms together. We also visualize the node embeddings in 2D using the t-SNE method.