{"title":"Text classification problems via BERT embedding method and graph convolutional neural network","authors":"L. Tran, Tuan-Kiet Tran, An Mai","doi":"10.1109/atc52653.2021.9598337","DOIUrl":null,"url":null,"abstract":"This paper presents a hybrid technique of combining the BERT embedding method and the graph convolutional neural network. This combination is then employed to solve the text classification problem. Initially, we apply the BERT embedding method to the whole corpus in order to transform all the texts into numerical vectors. Then, the graph convolutional neural network will be applied to these numerical vectors to classify these texts into their appropriate classes. Especially, in our approach, we need only a few labeled texts for the model training. For the illustration, in this paper, we use the BBC news and the IMDB movie reviews datasets to perform our experiments, showing that the performance of the graph convolutional neural network model is better than the performances of the combination of the BERT embedding method with other classical machine learning models.","PeriodicalId":196900,"journal":{"name":"2021 International Conference on Advanced Technologies for Communications (ATC)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Advanced Technologies for Communications (ATC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/atc52653.2021.9598337","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents a hybrid technique of combining the BERT embedding method and the graph convolutional neural network. This combination is then employed to solve the text classification problem. Initially, we apply the BERT embedding method to the whole corpus in order to transform all the texts into numerical vectors. Then, the graph convolutional neural network will be applied to these numerical vectors to classify these texts into their appropriate classes. Especially, in our approach, we need only a few labeled texts for the model training. For the illustration, in this paper, we use the BBC news and the IMDB movie reviews datasets to perform our experiments, showing that the performance of the graph convolutional neural network model is better than the performances of the combination of the BERT embedding method with other classical machine learning models.