{"title":"基于交互注意机制的双通道文本分类模型","authors":"Wei Han, Cheng Peng","doi":"10.1109/DSA56465.2022.00096","DOIUrl":null,"url":null,"abstract":"Aiming at the problem that convolutional neural network(CNN) focuses on local features and lacks the ability of text context feature extraction, In this paper, we propose a dual-channel text classification model based on Interactive Attention Mechanism(IAM). The model uses skip-gram to embed words into dense low latitude vectors and obtains the text embedding matrix, which is input into the Gate Convolution Neural Network(GCNN) and Multi-Head Attention(MHA) at the same time, and then after Pointwise Convolution(PC), the features obtained from the feature extraction layer in the two channels are calculated by an IAM, and finally, the features are fused. Compared with CNN, LSTM, and other improved models, the classification effect of this hybrid model is improved.","PeriodicalId":208148,"journal":{"name":"2022 9th International Conference on Dependable Systems and Their Applications (DSA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Dual-channel Text Classification Model based on an Interactive Attention Mechanism\",\"authors\":\"Wei Han, Cheng Peng\",\"doi\":\"10.1109/DSA56465.2022.00096\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aiming at the problem that convolutional neural network(CNN) focuses on local features and lacks the ability of text context feature extraction, In this paper, we propose a dual-channel text classification model based on Interactive Attention Mechanism(IAM). The model uses skip-gram to embed words into dense low latitude vectors and obtains the text embedding matrix, which is input into the Gate Convolution Neural Network(GCNN) and Multi-Head Attention(MHA) at the same time, and then after Pointwise Convolution(PC), the features obtained from the feature extraction layer in the two channels are calculated by an IAM, and finally, the features are fused. Compared with CNN, LSTM, and other improved models, the classification effect of this hybrid model is improved.\",\"PeriodicalId\":208148,\"journal\":{\"name\":\"2022 9th International Conference on Dependable Systems and Their Applications (DSA)\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 9th International Conference on Dependable Systems and Their Applications (DSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DSA56465.2022.00096\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 9th International Conference on Dependable Systems and Their Applications (DSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSA56465.2022.00096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Dual-channel Text Classification Model based on an Interactive Attention Mechanism
Aiming at the problem that convolutional neural network(CNN) focuses on local features and lacks the ability of text context feature extraction, In this paper, we propose a dual-channel text classification model based on Interactive Attention Mechanism(IAM). The model uses skip-gram to embed words into dense low latitude vectors and obtains the text embedding matrix, which is input into the Gate Convolution Neural Network(GCNN) and Multi-Head Attention(MHA) at the same time, and then after Pointwise Convolution(PC), the features obtained from the feature extraction layer in the two channels are calculated by an IAM, and finally, the features are fused. Compared with CNN, LSTM, and other improved models, the classification effect of this hybrid model is improved.