{"title":"Improved Multi Label Classification in Hierarchical Taxonomies","authors":"Kunal Punera, Suju Rajan","doi":"10.1109/ICDMW.2009.110","DOIUrl":null,"url":null,"abstract":"Hierarchical taxonomies are used to organize and retrieve information in many domains, especially those dealing with large and rapidly growing amounts of information. In many of these domains data also tends to be multi-label in nature. In this paper, we consider the problem of automated text classification in these scenarios. We present a post-processing based approach that performs smoothing on the output of an underlying one-vs-all ensemble. In order to do this we formulate a Regularized Unimodal Regression problem and give an exact algorithm to solve it. We evaluate the performance of our approach on several real-world large-scale multi-label hierarchical taxonomies and demonstrate that our proposed method provides significant gains over other related approaches.","PeriodicalId":351078,"journal":{"name":"2009 IEEE International Conference on Data Mining Workshops","volume":"143 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Data Mining Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2009.110","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Hierarchical taxonomies are used to organize and retrieve information in many domains, especially those dealing with large and rapidly growing amounts of information. In many of these domains data also tends to be multi-label in nature. In this paper, we consider the problem of automated text classification in these scenarios. We present a post-processing based approach that performs smoothing on the output of an underlying one-vs-all ensemble. In order to do this we formulate a Regularized Unimodal Regression problem and give an exact algorithm to solve it. We evaluate the performance of our approach on several real-world large-scale multi-label hierarchical taxonomies and demonstrate that our proposed method provides significant gains over other related approaches.