视图不足情况下协同训练算法的贝叶斯分析

2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA) Pub Date : 2012-07-02 DOI:10.1109/ISSPA.2012.6310456

Luca Didaci, F. Roli

{"title":"视图不足情况下协同训练算法的贝叶斯分析","authors":"Luca Didaci, F. Roli","doi":"10.1109/ISSPA.2012.6310456","DOIUrl":null,"url":null,"abstract":"The co-training algorithm can be applied if a dataset admits a representation into two different feature sets (two views). However, its optimality is proved only under the conditions a) sufficiency of each view, and b) conditional independence given the class. We address the case where condition a) doesn't hold, as often happens in concrete applications. In such cases the co-training is unable to converge to the optimal Bayesian classifier, because samples added in the training set are not distributed according to the class-conditional distributions, even if their assigned label is correct. These results help to better understand the behavior of the co-training algorithm when the classes are only `statistically' separable.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Bayesian analysis of co-training algorithm with insufficient views\",\"authors\":\"Luca Didaci, F. Roli\",\"doi\":\"10.1109/ISSPA.2012.6310456\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The co-training algorithm can be applied if a dataset admits a representation into two different feature sets (two views). However, its optimality is proved only under the conditions a) sufficiency of each view, and b) conditional independence given the class. We address the case where condition a) doesn't hold, as often happens in concrete applications. In such cases the co-training is unable to converge to the optimal Bayesian classifier, because samples added in the training set are not distributed according to the class-conditional distributions, even if their assigned label is correct. These results help to better understand the behavior of the co-training algorithm when the classes are only `statistically' separable.\",\"PeriodicalId\":248763,\"journal\":{\"name\":\"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)\",\"volume\":\"72 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISSPA.2012.6310456\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPA.2012.6310456","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

如果一个数据集允许两个不同的特征集(两个视图)表示，则可以应用协同训练算法。然而，它的最优性仅在a)每个视图的充分性和b)给定类的条件独立性的条件下被证明。我们处理条件a)不成立的情况，这在具体应用中经常发生。在这种情况下，协同训练无法收敛到最优贝叶斯分类器，因为添加到训练集中的样本不按照类条件分布分布，即使它们分配的标签是正确的。这些结果有助于更好地理解当类仅在“统计”上可分离时，协同训练算法的行为。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Bayesian analysis of co-training algorithm with insufficient views

The co-training algorithm can be applied if a dataset admits a representation into two different feature sets (two views). However, its optimality is proved only under the conditions a) sufficiency of each view, and b) conditional independence given the class. We address the case where condition a) doesn't hold, as often happens in concrete applications. In such cases the co-training is unable to converge to the optimal Bayesian classifier, because samples added in the training set are not distributed according to the class-conditional distributions, even if their assigned label is correct. These results help to better understand the behavior of the co-training algorithm when the classes are only `statistically' separable.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)

自引率

0.00%

发文量