{"title":"两种基于纯度的算法在半监督流数据分类中的比较","authors":"J. R. Bertini, Liang Zhao","doi":"10.1109/BRICS-CCI-CBIC.2013.15","DOIUrl":null,"url":null,"abstract":"Semi-supervised learning algorithms address the problem of learning from partially labeled data. However, most of the semi-supervised classification methods proposed in the literature considers a stationary distribution of data. Which means that future data patterns tend to conform to the data distribution presented in data set throughout the application lifetime. However, for plenty of new variety of applications, this expected scenario is not compatible to reality. Therefore, the research of semi-supervised methods which comprises nonstationary data classification is of a major concern nowadays. In this paper, the KAOGINCSSL algorithm, which copes with non-stationary semi-supervised learning, is analysed when using two different strategies to spread the labels to train the classifiers. The first consist of employing the inductive algorithm KAOGSS to build the classifier and the second relies on using the transductive algorithm PMTLA to spread the labels prior to build the classifier. Results regarding accuracy and processing time involving both algorithms when applied to non-stationary problems are presented.","PeriodicalId":306195,"journal":{"name":"2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2013-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Comparison of Two Purity-Based Algorithms When Applied to Semi-supervised Streaming Data Classification\",\"authors\":\"J. R. Bertini, Liang Zhao\",\"doi\":\"10.1109/BRICS-CCI-CBIC.2013.15\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semi-supervised learning algorithms address the problem of learning from partially labeled data. However, most of the semi-supervised classification methods proposed in the literature considers a stationary distribution of data. Which means that future data patterns tend to conform to the data distribution presented in data set throughout the application lifetime. However, for plenty of new variety of applications, this expected scenario is not compatible to reality. Therefore, the research of semi-supervised methods which comprises nonstationary data classification is of a major concern nowadays. In this paper, the KAOGINCSSL algorithm, which copes with non-stationary semi-supervised learning, is analysed when using two different strategies to spread the labels to train the classifiers. The first consist of employing the inductive algorithm KAOGSS to build the classifier and the second relies on using the transductive algorithm PMTLA to spread the labels prior to build the classifier. Results regarding accuracy and processing time involving both algorithms when applied to non-stationary problems are presented.\",\"PeriodicalId\":306195,\"journal\":{\"name\":\"2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BRICS-CCI-CBIC.2013.15\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 BRICS Congress on Computational Intelligence and 11th Brazilian Congress on Computational Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRICS-CCI-CBIC.2013.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Comparison of Two Purity-Based Algorithms When Applied to Semi-supervised Streaming Data Classification
Semi-supervised learning algorithms address the problem of learning from partially labeled data. However, most of the semi-supervised classification methods proposed in the literature considers a stationary distribution of data. Which means that future data patterns tend to conform to the data distribution presented in data set throughout the application lifetime. However, for plenty of new variety of applications, this expected scenario is not compatible to reality. Therefore, the research of semi-supervised methods which comprises nonstationary data classification is of a major concern nowadays. In this paper, the KAOGINCSSL algorithm, which copes with non-stationary semi-supervised learning, is analysed when using two different strategies to spread the labels to train the classifiers. The first consist of employing the inductive algorithm KAOGSS to build the classifier and the second relies on using the transductive algorithm PMTLA to spread the labels prior to build the classifier. Results regarding accuracy and processing time involving both algorithms when applied to non-stationary problems are presented.