{"title":"利用增强和聚类技术对数据进行压缩和噪声检测","authors":"Yuan-Cheng Xie, Jing-yu Yang","doi":"10.1109/CCPR.2009.5344126","DOIUrl":null,"url":null,"abstract":"AdaBoost has been the representation of ensemble learning algorithm because of its excellent performance. However, due to its longtime training, AdaBoost was complained about by people and this defect limits the practical application. Bagging is a rapid method of training and supports for parallel computing. One of important factors that can affect the performance of ensemble learning is the diversity of component learners. Based on this view, a new algorithm using clustering and Boosting to prune Bagging ensembles is proposed in this paper. Its learning efficiency is close to Bagging and its performance is close to AdaBoost. Furthermore, this new algorithm can detect noisy data from original samples based on cascade technique, and a better result of noise detection can be acquired.","PeriodicalId":354468,"journal":{"name":"2009 Chinese Conference on Pattern Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Using Boosting and Clustering to Prune Bagging and Detect Noisy Data\",\"authors\":\"Yuan-Cheng Xie, Jing-yu Yang\",\"doi\":\"10.1109/CCPR.2009.5344126\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"AdaBoost has been the representation of ensemble learning algorithm because of its excellent performance. However, due to its longtime training, AdaBoost was complained about by people and this defect limits the practical application. Bagging is a rapid method of training and supports for parallel computing. One of important factors that can affect the performance of ensemble learning is the diversity of component learners. Based on this view, a new algorithm using clustering and Boosting to prune Bagging ensembles is proposed in this paper. Its learning efficiency is close to Bagging and its performance is close to AdaBoost. Furthermore, this new algorithm can detect noisy data from original samples based on cascade technique, and a better result of noise detection can be acquired.\",\"PeriodicalId\":354468,\"journal\":{\"name\":\"2009 Chinese Conference on Pattern Recognition\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Chinese Conference on Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCPR.2009.5344126\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Chinese Conference on Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCPR.2009.5344126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using Boosting and Clustering to Prune Bagging and Detect Noisy Data
AdaBoost has been the representation of ensemble learning algorithm because of its excellent performance. However, due to its longtime training, AdaBoost was complained about by people and this defect limits the practical application. Bagging is a rapid method of training and supports for parallel computing. One of important factors that can affect the performance of ensemble learning is the diversity of component learners. Based on this view, a new algorithm using clustering and Boosting to prune Bagging ensembles is proposed in this paper. Its learning efficiency is close to Bagging and its performance is close to AdaBoost. Furthermore, this new algorithm can detect noisy data from original samples based on cascade technique, and a better result of noise detection can be acquired.