Improved Multi Label Classification in Hierarchical Taxonomies

Kunal Punera, Suju Rajan
{"title":"Improved Multi Label Classification in Hierarchical Taxonomies","authors":"Kunal Punera, Suju Rajan","doi":"10.1109/ICDMW.2009.110","DOIUrl":null,"url":null,"abstract":"Hierarchical taxonomies are used to organize and retrieve information in many domains, especially those dealing with large and rapidly growing amounts of information. In many of these domains data also tends to be multi-label in nature. In this paper, we consider the problem of automated text classification in these scenarios. We present a post-processing based approach that performs smoothing on the output of an underlying one-vs-all ensemble. In order to do this we formulate a Regularized Unimodal Regression problem and give an exact algorithm to solve it. We evaluate the performance of our approach on several real-world large-scale multi-label hierarchical taxonomies and demonstrate that our proposed method provides significant gains over other related approaches.","PeriodicalId":351078,"journal":{"name":"2009 IEEE International Conference on Data Mining Workshops","volume":"143 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Data Mining Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2009.110","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Hierarchical taxonomies are used to organize and retrieve information in many domains, especially those dealing with large and rapidly growing amounts of information. In many of these domains data also tends to be multi-label in nature. In this paper, we consider the problem of automated text classification in these scenarios. We present a post-processing based approach that performs smoothing on the output of an underlying one-vs-all ensemble. In order to do this we formulate a Regularized Unimodal Regression problem and give an exact algorithm to solve it. We evaluate the performance of our approach on several real-world large-scale multi-label hierarchical taxonomies and demonstrate that our proposed method provides significant gains over other related approaches.
层次分类法中改进的多标签分类
层次分类法用于组织和检索许多领域中的信息,特别是那些处理大量且快速增长的信息的领域。在许多这些领域中,数据本质上也倾向于多标签。在本文中,我们考虑了这些场景下的自动文本分类问题。我们提出了一种基于后处理的方法,该方法对底层一对一集成的输出执行平滑。为了做到这一点,我们提出了一个正则化单峰回归问题,并给出了一个精确的算法来解决它。我们在几个真实世界的大规模多标签分层分类法上评估了我们的方法的性能,并证明我们提出的方法比其他相关方法提供了显着的增益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信