多标签多类文本分类的弱学习算法

Proceedings. International Conference on Machine Learning and Cybernetics Pub Date : 2002-11-04 DOI:10.1109/ICMLC.2002.1174511

Yan-Yong Xu, Xian-zhong Zhou, Zhongyang Guo

{"title":"多标签多类文本分类的弱学习算法","authors":"Yan-Yong Xu, Xian-zhong Zhou, Zhongyang Guo","doi":"10.1109/ICMLC.2002.1174511","DOIUrl":null,"url":null,"abstract":"To handle the multi-label multiclass text categorization, a weak learning algorithm (WLA) is presented. The main idea of WLA is to find a highly accurate classification rule by combining many weak hypotheses, each of which may be only moderately accurate. We used a separate procedure, called the weak learner, to compute the weak hypotheses, and found a set of weak hypotheses by calling the weak learner repeatedly in a series of rounds. These weak hypotheses were then combined into a single rule called the final hypothesis, and the final hypothesis ranked the possible labels for a given document with the hope that the appropriate labels would appear at the top of the ranking. Using the three designed evaluation measures - ordinary-error, average-coverage and average-precision - our experiments show that the performance of WLA is generally better than the other algorithms on the same dataset.","PeriodicalId":90702,"journal":{"name":"Proceedings. International Conference on Machine Learning and Cybernetics","volume":"168 1","pages":"890-894 vol.2"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Weak learning algorithm for multi-label multiclass text categorization\",\"authors\":\"Yan-Yong Xu, Xian-zhong Zhou, Zhongyang Guo\",\"doi\":\"10.1109/ICMLC.2002.1174511\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To handle the multi-label multiclass text categorization, a weak learning algorithm (WLA) is presented. The main idea of WLA is to find a highly accurate classification rule by combining many weak hypotheses, each of which may be only moderately accurate. We used a separate procedure, called the weak learner, to compute the weak hypotheses, and found a set of weak hypotheses by calling the weak learner repeatedly in a series of rounds. These weak hypotheses were then combined into a single rule called the final hypothesis, and the final hypothesis ranked the possible labels for a given document with the hope that the appropriate labels would appear at the top of the ranking. Using the three designed evaluation measures - ordinary-error, average-coverage and average-precision - our experiments show that the performance of WLA is generally better than the other algorithms on the same dataset.\",\"PeriodicalId\":90702,\"journal\":{\"name\":\"Proceedings. International Conference on Machine Learning and Cybernetics\",\"volume\":\"168 1\",\"pages\":\"890-894 vol.2\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-11-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. International Conference on Machine Learning and Cybernetics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLC.2002.1174511\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. International Conference on Machine Learning and Cybernetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC.2002.1174511","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

针对多标签多类文本分类问题，提出了一种弱学习算法。WLA的主要思想是通过组合许多弱假设来找到一个高度准确的分类规则，每个弱假设可能只是中等准确。我们使用了一个单独的程序，称为弱学习器，来计算弱假设，并通过在一系列回合中反复调用弱学习器来找到一组弱假设。然后将这些弱假设组合成一个称为最终假设的规则，最终假设对给定文档的可能标签进行排序，希望合适的标签会出现在排名的顶部。使用设计的三种评价指标——平均误差、平均覆盖率和平均精度——我们的实验表明，在相同的数据集上，WLA的性能总体上优于其他算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Weak learning algorithm for multi-label multiclass text categorization

To handle the multi-label multiclass text categorization, a weak learning algorithm (WLA) is presented. The main idea of WLA is to find a highly accurate classification rule by combining many weak hypotheses, each of which may be only moderately accurate. We used a separate procedure, called the weak learner, to compute the weak hypotheses, and found a set of weak hypotheses by calling the weak learner repeatedly in a series of rounds. These weak hypotheses were then combined into a single rule called the final hypothesis, and the final hypothesis ranked the possible labels for a given document with the hope that the appropriate labels would appear at the top of the ranking. Using the three designed evaluation measures - ordinary-error, average-coverage and average-precision - our experiments show that the performance of WLA is generally better than the other algorithms on the same dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings. International Conference on Machine Learning and Cybernetics

自引率

0.00%

发文量