{"title":"视图不足情况下协同训练算法的贝叶斯分析","authors":"Luca Didaci, F. Roli","doi":"10.1109/ISSPA.2012.6310456","DOIUrl":null,"url":null,"abstract":"The co-training algorithm can be applied if a dataset admits a representation into two different feature sets (two views). However, its optimality is proved only under the conditions a) sufficiency of each view, and b) conditional independence given the class. We address the case where condition a) doesn't hold, as often happens in concrete applications. In such cases the co-training is unable to converge to the optimal Bayesian classifier, because samples added in the training set are not distributed according to the class-conditional distributions, even if their assigned label is correct. These results help to better understand the behavior of the co-training algorithm when the classes are only `statistically' separable.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Bayesian analysis of co-training algorithm with insufficient views\",\"authors\":\"Luca Didaci, F. Roli\",\"doi\":\"10.1109/ISSPA.2012.6310456\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The co-training algorithm can be applied if a dataset admits a representation into two different feature sets (two views). However, its optimality is proved only under the conditions a) sufficiency of each view, and b) conditional independence given the class. We address the case where condition a) doesn't hold, as often happens in concrete applications. In such cases the co-training is unable to converge to the optimal Bayesian classifier, because samples added in the training set are not distributed according to the class-conditional distributions, even if their assigned label is correct. These results help to better understand the behavior of the co-training algorithm when the classes are only `statistically' separable.\",\"PeriodicalId\":248763,\"journal\":{\"name\":\"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)\",\"volume\":\"72 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISSPA.2012.6310456\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPA.2012.6310456","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Bayesian analysis of co-training algorithm with insufficient views
The co-training algorithm can be applied if a dataset admits a representation into two different feature sets (two views). However, its optimality is proved only under the conditions a) sufficiency of each view, and b) conditional independence given the class. We address the case where condition a) doesn't hold, as often happens in concrete applications. In such cases the co-training is unable to converge to the optimal Bayesian classifier, because samples added in the training set are not distributed according to the class-conditional distributions, even if their assigned label is correct. These results help to better understand the behavior of the co-training algorithm when the classes are only `statistically' separable.