Jinjun Ruan, Jonathan M. Caballero, Ronaldo Juanatas
{"title":"Chinese News Text Classification Method Based On Attention Mechanism","authors":"Jinjun Ruan, Jonathan M. Caballero, Ronaldo Juanatas","doi":"10.1109/ICBIR54589.2022.9786458","DOIUrl":null,"url":null,"abstract":"Combining the convolution neural network (CNN) model and bidirectional long short-term memory (BiLSTM) model, an ATT-CN-BILSTM Chinese news classification model is proposed based on the attention mechanism. The model uses the attention mechanism to improve the feature extraction process of CNN and BiLSTM. After cancelling the CNN pooling layer, it pays attention to the critical local features obtained by CNN convolution according to the timing features output by BiLSTM, giving full play to the respective advantages of CNN and BiLSTM models. The experimental results on Thucnews dataset show that the accuracy of the model for Chinese news text classification is 97.87%, and the recall rate and F1 score are better than the comparison model.","PeriodicalId":216904,"journal":{"name":"2022 7th International Conference on Business and Industrial Research (ICBIR)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 7th International Conference on Business and Industrial Research (ICBIR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBIR54589.2022.9786458","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Combining the convolution neural network (CNN) model and bidirectional long short-term memory (BiLSTM) model, an ATT-CN-BILSTM Chinese news classification model is proposed based on the attention mechanism. The model uses the attention mechanism to improve the feature extraction process of CNN and BiLSTM. After cancelling the CNN pooling layer, it pays attention to the critical local features obtained by CNN convolution according to the timing features output by BiLSTM, giving full play to the respective advantages of CNN and BiLSTM models. The experimental results on Thucnews dataset show that the accuracy of the model for Chinese news text classification is 97.87%, and the recall rate and F1 score are better than the comparison model.