Quoc Thai Nguyen, Thoaī Nguyen, N. H. Luong, Quoc Hung Ngo
{"title":"Fine-Tuning BERT for Sentiment Analysis of Vietnamese Reviews","authors":"Quoc Thai Nguyen, Thoaī Nguyen, N. H. Luong, Quoc Hung Ngo","doi":"10.1109/NICS51282.2020.9335899","DOIUrl":null,"url":null,"abstract":"Sentiment analysis is an important task in the field of Nature Language Processing (NLP), in which users' feedback data on a specific issue are evaluated and analyzed. Many deep learning models have been proposed to tackle this task, including the recently-introduced Bidirectional Encoder Representations from Transformers (BERT) model. In this paper, we experiment with two BERT fine-tuning methods for the sentiment analysis task on datasets of Vietnamese reviews: 1) a method that uses only the [CLS] token as the input for an attached feed-forward neural network, and 2) another method in which all BERT output vectors are used as the input for classification. Experimental results on two datasets show that models using BERT slightly outperform other models using GloVe and FastText. Also, regarding the datasets employed in this study, our proposed BERT fine-tuning method produces a model with better performance than the original BERT fine-tuning method.","PeriodicalId":308944,"journal":{"name":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 7th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS51282.2020.9335899","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 27
Abstract
Sentiment analysis is an important task in the field of Nature Language Processing (NLP), in which users' feedback data on a specific issue are evaluated and analyzed. Many deep learning models have been proposed to tackle this task, including the recently-introduced Bidirectional Encoder Representations from Transformers (BERT) model. In this paper, we experiment with two BERT fine-tuning methods for the sentiment analysis task on datasets of Vietnamese reviews: 1) a method that uses only the [CLS] token as the input for an attached feed-forward neural network, and 2) another method in which all BERT output vectors are used as the input for classification. Experimental results on two datasets show that models using BERT slightly outperform other models using GloVe and FastText. Also, regarding the datasets employed in this study, our proposed BERT fine-tuning method produces a model with better performance than the original BERT fine-tuning method.