{"title":"基于ERNIE-BiLSTM的中文文本情感分类方法","authors":"Haiyuan Guo, Chengying Chi, Xuegang Zhan","doi":"10.1109/ICCEA53728.2021.00024","DOIUrl":null,"url":null,"abstract":"For the Chinese text sentiment classification task, the preprocessing based on deep learning models cannot retain the information and polysemy of the word in the sentence well. So this paper adopts the newly developed ERNIE [1–2] (Knowledge Enhanced Semantic Representation) pre-training model from Baidu, which is based on word feature input modeling, not only enhances the semantic representation of the word, but also preserves the contextual information of the word and the polysemy of the word. After pre-training by ERNIE model, the output word vector is used as the input of BiLSTM (bidirectional long and short-term memory network) model for training and obtaining sentiment classification results. The accuracy rate of Ernie bilstm model is 92.35% after verification on nlpcc2014 microblog sentiment analysis sample data set, which proves that the model has good performance in Chinese text sentiment classification task.","PeriodicalId":325790,"journal":{"name":"2021 International Conference on Computer Engineering and Application (ICCEA)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"ERNIE-BiLSTM Based Chinese Text Sentiment Classification Method\",\"authors\":\"Haiyuan Guo, Chengying Chi, Xuegang Zhan\",\"doi\":\"10.1109/ICCEA53728.2021.00024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For the Chinese text sentiment classification task, the preprocessing based on deep learning models cannot retain the information and polysemy of the word in the sentence well. So this paper adopts the newly developed ERNIE [1–2] (Knowledge Enhanced Semantic Representation) pre-training model from Baidu, which is based on word feature input modeling, not only enhances the semantic representation of the word, but also preserves the contextual information of the word and the polysemy of the word. After pre-training by ERNIE model, the output word vector is used as the input of BiLSTM (bidirectional long and short-term memory network) model for training and obtaining sentiment classification results. The accuracy rate of Ernie bilstm model is 92.35% after verification on nlpcc2014 microblog sentiment analysis sample data set, which proves that the model has good performance in Chinese text sentiment classification task.\",\"PeriodicalId\":325790,\"journal\":{\"name\":\"2021 International Conference on Computer Engineering and Application (ICCEA)\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Computer Engineering and Application (ICCEA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCEA53728.2021.00024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer Engineering and Application (ICCEA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCEA53728.2021.00024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
ERNIE-BiLSTM Based Chinese Text Sentiment Classification Method
For the Chinese text sentiment classification task, the preprocessing based on deep learning models cannot retain the information and polysemy of the word in the sentence well. So this paper adopts the newly developed ERNIE [1–2] (Knowledge Enhanced Semantic Representation) pre-training model from Baidu, which is based on word feature input modeling, not only enhances the semantic representation of the word, but also preserves the contextual information of the word and the polysemy of the word. After pre-training by ERNIE model, the output word vector is used as the input of BiLSTM (bidirectional long and short-term memory network) model for training and obtaining sentiment classification results. The accuracy rate of Ernie bilstm model is 92.35% after verification on nlpcc2014 microblog sentiment analysis sample data set, which proves that the model has good performance in Chinese text sentiment classification task.