{"title":"基于词特征和Bi-LSTM的在线评论情感分类模型","authors":"Jingxuan Hu","doi":"10.1109/ICDSCA56264.2022.9988320","DOIUrl":null,"url":null,"abstract":"With the rapid development of e-commerce, many purchase and comment records are produced. Sentiment classification of commodity reviews is of great value for automatically monitoring bad reviews and assisting merchants in analyzing consumer feedback. At present, the Bi-LSTM model is representative of Chinese text sentiment classification, which can understand the semantic information in time sequence. However, due to the lack of processing lexical information, there is a problem that word vectors cannot highlight the information of sentiment words. Therefore, this paper proposes a sentiment classification model of Chinese product reviews based on word features and Bi-LSTM. The new model firstly uses Word2vec's CBOW model to train the word vectors, secondly uses an improved information gain algorithm with the word distribution and sentiment weights to calculate the amount of information, and finally uses the Naive Bayes model to classify the network classification results twice, which solves the problem that the basic Bi-LSTM model lacks understanding of lexical information. The experimental results show that the new model achieves better results relative to the basic Bi-LSTM model and can capture the sentiment information of the comments more accurately. In the test set, the accuracy rate reached 89.03%.","PeriodicalId":416983,"journal":{"name":"2022 IEEE 2nd International Conference on Data Science and Computer Application (ICDSCA)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Sentiment Classification Model of Online Reviews Based on Word Features and Bi-LSTM\",\"authors\":\"Jingxuan Hu\",\"doi\":\"10.1109/ICDSCA56264.2022.9988320\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of e-commerce, many purchase and comment records are produced. Sentiment classification of commodity reviews is of great value for automatically monitoring bad reviews and assisting merchants in analyzing consumer feedback. At present, the Bi-LSTM model is representative of Chinese text sentiment classification, which can understand the semantic information in time sequence. However, due to the lack of processing lexical information, there is a problem that word vectors cannot highlight the information of sentiment words. Therefore, this paper proposes a sentiment classification model of Chinese product reviews based on word features and Bi-LSTM. The new model firstly uses Word2vec's CBOW model to train the word vectors, secondly uses an improved information gain algorithm with the word distribution and sentiment weights to calculate the amount of information, and finally uses the Naive Bayes model to classify the network classification results twice, which solves the problem that the basic Bi-LSTM model lacks understanding of lexical information. The experimental results show that the new model achieves better results relative to the basic Bi-LSTM model and can capture the sentiment information of the comments more accurately. In the test set, the accuracy rate reached 89.03%.\",\"PeriodicalId\":416983,\"journal\":{\"name\":\"2022 IEEE 2nd International Conference on Data Science and Computer Application (ICDSCA)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 2nd International Conference on Data Science and Computer Application (ICDSCA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSCA56264.2022.9988320\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 2nd International Conference on Data Science and Computer Application (ICDSCA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSCA56264.2022.9988320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Sentiment Classification Model of Online Reviews Based on Word Features and Bi-LSTM
With the rapid development of e-commerce, many purchase and comment records are produced. Sentiment classification of commodity reviews is of great value for automatically monitoring bad reviews and assisting merchants in analyzing consumer feedback. At present, the Bi-LSTM model is representative of Chinese text sentiment classification, which can understand the semantic information in time sequence. However, due to the lack of processing lexical information, there is a problem that word vectors cannot highlight the information of sentiment words. Therefore, this paper proposes a sentiment classification model of Chinese product reviews based on word features and Bi-LSTM. The new model firstly uses Word2vec's CBOW model to train the word vectors, secondly uses an improved information gain algorithm with the word distribution and sentiment weights to calculate the amount of information, and finally uses the Naive Bayes model to classify the network classification results twice, which solves the problem that the basic Bi-LSTM model lacks understanding of lexical information. The experimental results show that the new model achieves better results relative to the basic Bi-LSTM model and can capture the sentiment information of the comments more accurately. In the test set, the accuracy rate reached 89.03%.