{"title":"使用最小方差原理的数据流的细化","authors":"Virendrakumar A. Dhotre, K. Karande","doi":"10.1109/IC3I.2014.7019638","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a refined scheme on active learning from data streams where data volumes grow continuously. The objective is to label a small portion of stream data for which a model is derived to predict future instances as accurately as possible. We propose a classifier-ensemble based active learning framework which selectively labels instances from data streams to build an ensemble classifier. Classifier ensemble's variance directly corresponds to its error rates and the efforts of reducing the variance is equivalent to improving its prediction accuracy. We introduce a Minimum-Variance principle to guide instance labeling process for data streams. The MV principle and the optimal weighting module are proposed to be combined to build an active learning framework for data streams. Results and implementation demonstrate that the percentage of accuracy of the Minimum variance margin method is good as compared to other methods.","PeriodicalId":430848,"journal":{"name":"2014 International Conference on Contemporary Computing and Informatics (IC3I)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Refinement of data streams using Minimum Variance principle\",\"authors\":\"Virendrakumar A. Dhotre, K. Karande\",\"doi\":\"10.1109/IC3I.2014.7019638\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a refined scheme on active learning from data streams where data volumes grow continuously. The objective is to label a small portion of stream data for which a model is derived to predict future instances as accurately as possible. We propose a classifier-ensemble based active learning framework which selectively labels instances from data streams to build an ensemble classifier. Classifier ensemble's variance directly corresponds to its error rates and the efforts of reducing the variance is equivalent to improving its prediction accuracy. We introduce a Minimum-Variance principle to guide instance labeling process for data streams. The MV principle and the optimal weighting module are proposed to be combined to build an active learning framework for data streams. Results and implementation demonstrate that the percentage of accuracy of the Minimum variance margin method is good as compared to other methods.\",\"PeriodicalId\":430848,\"journal\":{\"name\":\"2014 International Conference on Contemporary Computing and Informatics (IC3I)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Contemporary Computing and Informatics (IC3I)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC3I.2014.7019638\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Contemporary Computing and Informatics (IC3I)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3I.2014.7019638","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Refinement of data streams using Minimum Variance principle
In this paper, we propose a refined scheme on active learning from data streams where data volumes grow continuously. The objective is to label a small portion of stream data for which a model is derived to predict future instances as accurately as possible. We propose a classifier-ensemble based active learning framework which selectively labels instances from data streams to build an ensemble classifier. Classifier ensemble's variance directly corresponds to its error rates and the efforts of reducing the variance is equivalent to improving its prediction accuracy. We introduce a Minimum-Variance principle to guide instance labeling process for data streams. The MV principle and the optimal weighting module are proposed to be combined to build an active learning framework for data streams. Results and implementation demonstrate that the percentage of accuracy of the Minimum variance margin method is good as compared to other methods.