基于深度标签辅助分类的分层标签文本分类方法

Cao Yu-kun, Wei Zi-yue, Tang Yi-jia, Jin Cheng-kun
{"title":"基于深度标签辅助分类的分层标签文本分类方法","authors":"Cao Yu-kun, Wei Zi-yue, Tang Yi-jia, Jin Cheng-kun","doi":"10.1109/DDCLS58216.2023.10166293","DOIUrl":null,"url":null,"abstract":"Hierarchical label text classification is a challenging task in the field of natural language processing, where each document needs to be correctly classified into multiple labels with hierarchical structure. However, in the label set, due to the insufficient semantic information contained in the labels and the small number of documents classified under deep-level labels, the training of deep-level labels is insufficient, leading to a significant imbalance in label training. To address this, a hierarchical label text classification method with deep-level label-assisted classification (DLAC) is proposed. The method proposes a deep-level label-assisted classifier, which effectively utilizes text features and rich features of shallow label nodes corresponding to deep label nodes (i.e., shallow label's rich features) on the basis of enhanced label semantics to enhance the classification performance of deep labels. The comparison experiment results with eleven algorithms on three datasets show that the model can effectively improve the classification performance of deep-level labels and achieve good results.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hierarchical Label Text Classification Method with Deep-Level Label-Assisted Classification\",\"authors\":\"Cao Yu-kun, Wei Zi-yue, Tang Yi-jia, Jin Cheng-kun\",\"doi\":\"10.1109/DDCLS58216.2023.10166293\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hierarchical label text classification is a challenging task in the field of natural language processing, where each document needs to be correctly classified into multiple labels with hierarchical structure. However, in the label set, due to the insufficient semantic information contained in the labels and the small number of documents classified under deep-level labels, the training of deep-level labels is insufficient, leading to a significant imbalance in label training. To address this, a hierarchical label text classification method with deep-level label-assisted classification (DLAC) is proposed. The method proposes a deep-level label-assisted classifier, which effectively utilizes text features and rich features of shallow label nodes corresponding to deep label nodes (i.e., shallow label's rich features) on the basis of enhanced label semantics to enhance the classification performance of deep labels. The comparison experiment results with eleven algorithms on three datasets show that the model can effectively improve the classification performance of deep-level labels and achieve good results.\",\"PeriodicalId\":415532,\"journal\":{\"name\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DDCLS58216.2023.10166293\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS58216.2023.10166293","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

分层标签文本分类是自然语言处理领域的一项具有挑战性的任务,需要将每个文档正确分类为多个具有分层结构的标签。然而,在标签集中,由于标签所包含的语义信息不足,且深层标签下分类的文档数量较少,导致深层标签的训练不足,导致标签训练的不平衡现象明显。为了解决这一问题,提出了一种基于深度标签辅助分类(DLAC)的分层标签文本分类方法。该方法提出了一种深度标签辅助分类器,在增强标签语义的基础上,有效利用深度标签节点对应的浅标签节点的文本特征和丰富特征(即浅标签的丰富特征)来增强深度标签的分类性能。在3个数据集上与11种算法的对比实验结果表明,该模型能有效提高深层标签的分类性能,取得了较好的效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Hierarchical Label Text Classification Method with Deep-Level Label-Assisted Classification
Hierarchical label text classification is a challenging task in the field of natural language processing, where each document needs to be correctly classified into multiple labels with hierarchical structure. However, in the label set, due to the insufficient semantic information contained in the labels and the small number of documents classified under deep-level labels, the training of deep-level labels is insufficient, leading to a significant imbalance in label training. To address this, a hierarchical label text classification method with deep-level label-assisted classification (DLAC) is proposed. The method proposes a deep-level label-assisted classifier, which effectively utilizes text features and rich features of shallow label nodes corresponding to deep label nodes (i.e., shallow label's rich features) on the basis of enhanced label semantics to enhance the classification performance of deep labels. The comparison experiment results with eleven algorithms on three datasets show that the model can effectively improve the classification performance of deep-level labels and achieve good results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信