具有方向性自关注的BiGRU文本分类

2022 11th International Conference of Information and Communication Technology (ICTech)) Pub Date : 2022-02-01 DOI:10.1109/ICTech55460.2022.00085

Tiantian Jiang, Zhanguo Wang

{"title":"具有方向性自关注的BiGRU文本分类","authors":"Tiantian Jiang, Zhanguo Wang","doi":"10.1109/ICTech55460.2022.00085","DOIUrl":null,"url":null,"abstract":"In the field of natural language processing, text classification is a key daily task. The main goal is to obtain effective features from text information, find the correspondence between feature representations and category labels, so as to classify the text. From the perspective of data flow, it is mainly divided into five stages: text preprocessing, vector representation of text, feature extraction, classifier classification and model training to complete text classification tasks. Among them, feature extraction is a very important stage, and it is also the focus of this article. GRU can learn long-term dependencies from learned local features, and bidirectional GRU can learn hidden features in sentences. The self-attention mechanism exhibits superior performance in many fields in natural language processing. It can mine the autocorrelation of data and highlight key information by adjusting the weight of keywords. Therefore, in view of the shortcomings of existing models in text global information modeling, this paper combines bidirectional GRU and self-attention mechanism, and proposes a hybrid model BiGRU-MA for text classification, which can extract deep semantic features and solve the problem of classification performance degradation due to the lack of semantic information. This article uses text classification related technology to model, describes the modeling ideas, and introduces the technology used, and finally compares experiments with existing models to verify the effectiveness of the model.","PeriodicalId":290836,"journal":{"name":"2022 11th International Conference of Information and Communication Technology (ICTech))","volume":"180 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Text Classification Using BiGRU with Directional Self-Attention\",\"authors\":\"Tiantian Jiang, Zhanguo Wang\",\"doi\":\"10.1109/ICTech55460.2022.00085\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the field of natural language processing, text classification is a key daily task. The main goal is to obtain effective features from text information, find the correspondence between feature representations and category labels, so as to classify the text. From the perspective of data flow, it is mainly divided into five stages: text preprocessing, vector representation of text, feature extraction, classifier classification and model training to complete text classification tasks. Among them, feature extraction is a very important stage, and it is also the focus of this article. GRU can learn long-term dependencies from learned local features, and bidirectional GRU can learn hidden features in sentences. The self-attention mechanism exhibits superior performance in many fields in natural language processing. It can mine the autocorrelation of data and highlight key information by adjusting the weight of keywords. Therefore, in view of the shortcomings of existing models in text global information modeling, this paper combines bidirectional GRU and self-attention mechanism, and proposes a hybrid model BiGRU-MA for text classification, which can extract deep semantic features and solve the problem of classification performance degradation due to the lack of semantic information. This article uses text classification related technology to model, describes the modeling ideas, and introduces the technology used, and finally compares experiments with existing models to verify the effectiveness of the model.\",\"PeriodicalId\":290836,\"journal\":{\"name\":\"2022 11th International Conference of Information and Communication Technology (ICTech))\",\"volume\":\"180 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 11th International Conference of Information and Communication Technology (ICTech))\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTech55460.2022.00085\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 11th International Conference of Information and Communication Technology (ICTech))","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTech55460.2022.00085","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在自然语言处理领域，文本分类是一项重要的日常任务。主要目标是从文本信息中获取有效特征，找到特征表示与类别标签之间的对应关系，从而对文本进行分类。从数据流的角度来看，主要分为文本预处理、文本向量表示、特征提取、分类器分类和模型训练五个阶段来完成文本分类任务。其中特征提取是非常重要的一个阶段，也是本文研究的重点。GRU可以从学习到的局部特征中学习到长期依赖关系，双向GRU可以学习到句子中的隐藏特征。自注意机制在自然语言处理的许多领域都表现出优越的性能。通过调整关键词的权重，挖掘数据的自相关性，突出显示关键信息。因此，针对现有模型在文本全局信息建模方面存在的不足，本文将双向GRU与自关注机制相结合，提出了一种用于文本分类的混合模型BiGRU-MA，该模型能够提取深层语义特征，解决了由于语义信息缺乏而导致分类性能下降的问题。本文利用文本分类相关技术进行建模，描述了建模思路，并介绍了所采用的技术，最后与现有模型进行了实验对比，验证了模型的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Text Classification Using BiGRU with Directional Self-Attention

In the field of natural language processing, text classification is a key daily task. The main goal is to obtain effective features from text information, find the correspondence between feature representations and category labels, so as to classify the text. From the perspective of data flow, it is mainly divided into five stages: text preprocessing, vector representation of text, feature extraction, classifier classification and model training to complete text classification tasks. Among them, feature extraction is a very important stage, and it is also the focus of this article. GRU can learn long-term dependencies from learned local features, and bidirectional GRU can learn hidden features in sentences. The self-attention mechanism exhibits superior performance in many fields in natural language processing. It can mine the autocorrelation of data and highlight key information by adjusting the weight of keywords. Therefore, in view of the shortcomings of existing models in text global information modeling, this paper combines bidirectional GRU and self-attention mechanism, and proposes a hybrid model BiGRU-MA for text classification, which can extract deep semantic features and solve the problem of classification performance degradation due to the lack of semantic information. This article uses text classification related technology to model, describes the modeling ideas, and introduces the technology used, and finally compares experiments with existing models to verify the effectiveness of the model.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 11th International Conference of Information and Communication Technology (ICTech))

自引率

0.00%

发文量