{"title":"一种基于记忆卷积神经网络的改进文本分类模型","authors":"Yiyao Wang, Lihua Tian, Chen Li","doi":"10.1145/3404555.3404595","DOIUrl":null,"url":null,"abstract":"This paper proposes a text classification model, called improved memory neural network model, which is used to process large-scale training data. In this model, the optimized transformer feature extractor is used to replace the memory neural network which can not be parallelized. At the same time, the multi-level void convolution matrix is designed to replace the convolution neural network, so as to extract more accurate semantic unit features. Finally, in order to reduce the model parameters, each level of the convolution network pooling layer and the full connection layer are eliminated, but the global average pooling layer is instead used. The experimental results on THUCNews dataset and Twitter dataset show that the proposed method achieves competitive results in the accuracy, model parameters and convergence rate.","PeriodicalId":220526,"journal":{"name":"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"An Improved Text Classification Model Based on Memory Convolution Neural Network\",\"authors\":\"Yiyao Wang, Lihua Tian, Chen Li\",\"doi\":\"10.1145/3404555.3404595\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a text classification model, called improved memory neural network model, which is used to process large-scale training data. In this model, the optimized transformer feature extractor is used to replace the memory neural network which can not be parallelized. At the same time, the multi-level void convolution matrix is designed to replace the convolution neural network, so as to extract more accurate semantic unit features. Finally, in order to reduce the model parameters, each level of the convolution network pooling layer and the full connection layer are eliminated, but the global average pooling layer is instead used. The experimental results on THUCNews dataset and Twitter dataset show that the proposed method achieves competitive results in the accuracy, model parameters and convergence rate.\",\"PeriodicalId\":220526,\"journal\":{\"name\":\"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3404555.3404595\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 6th International Conference on Computing and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3404555.3404595","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Improved Text Classification Model Based on Memory Convolution Neural Network
This paper proposes a text classification model, called improved memory neural network model, which is used to process large-scale training data. In this model, the optimized transformer feature extractor is used to replace the memory neural network which can not be parallelized. At the same time, the multi-level void convolution matrix is designed to replace the convolution neural network, so as to extract more accurate semantic unit features. Finally, in order to reduce the model parameters, each level of the convolution network pooling layer and the full connection layer are eliminated, but the global average pooling layer is instead used. The experimental results on THUCNews dataset and Twitter dataset show that the proposed method achieves competitive results in the accuracy, model parameters and convergence rate.