利用增强和聚类技术对数据进行压缩和噪声检测

Yuan-Cheng Xie, Jing-yu Yang
{"title":"利用增强和聚类技术对数据进行压缩和噪声检测","authors":"Yuan-Cheng Xie, Jing-yu Yang","doi":"10.1109/CCPR.2009.5344126","DOIUrl":null,"url":null,"abstract":"AdaBoost has been the representation of ensemble learning algorithm because of its excellent performance. However, due to its longtime training, AdaBoost was complained about by people and this defect limits the practical application. Bagging is a rapid method of training and supports for parallel computing. One of important factors that can affect the performance of ensemble learning is the diversity of component learners. Based on this view, a new algorithm using clustering and Boosting to prune Bagging ensembles is proposed in this paper. Its learning efficiency is close to Bagging and its performance is close to AdaBoost. Furthermore, this new algorithm can detect noisy data from original samples based on cascade technique, and a better result of noise detection can be acquired.","PeriodicalId":354468,"journal":{"name":"2009 Chinese Conference on Pattern Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Using Boosting and Clustering to Prune Bagging and Detect Noisy Data\",\"authors\":\"Yuan-Cheng Xie, Jing-yu Yang\",\"doi\":\"10.1109/CCPR.2009.5344126\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"AdaBoost has been the representation of ensemble learning algorithm because of its excellent performance. However, due to its longtime training, AdaBoost was complained about by people and this defect limits the practical application. Bagging is a rapid method of training and supports for parallel computing. One of important factors that can affect the performance of ensemble learning is the diversity of component learners. Based on this view, a new algorithm using clustering and Boosting to prune Bagging ensembles is proposed in this paper. Its learning efficiency is close to Bagging and its performance is close to AdaBoost. Furthermore, this new algorithm can detect noisy data from original samples based on cascade technique, and a better result of noise detection can be acquired.\",\"PeriodicalId\":354468,\"journal\":{\"name\":\"2009 Chinese Conference on Pattern Recognition\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Chinese Conference on Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCPR.2009.5344126\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Chinese Conference on Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCPR.2009.5344126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

AdaBoost以其优异的性能成为集成学习算法的代表。但是由于长期的培训,AdaBoost受到了人们的抱怨,这一缺陷限制了AdaBoost的实际应用。Bagging是一种快速训练和支持并行计算的方法。影响集成学习性能的重要因素之一是组件学习器的多样性。在此基础上,提出了一种利用聚类和Boosting对Bagging集合进行剪枝的新算法。其学习效率接近Bagging,性能接近AdaBoost。此外,该算法基于级联技术对原始样本中的噪声数据进行检测,获得了较好的噪声检测结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Using Boosting and Clustering to Prune Bagging and Detect Noisy Data
AdaBoost has been the representation of ensemble learning algorithm because of its excellent performance. However, due to its longtime training, AdaBoost was complained about by people and this defect limits the practical application. Bagging is a rapid method of training and supports for parallel computing. One of important factors that can affect the performance of ensemble learning is the diversity of component learners. Based on this view, a new algorithm using clustering and Boosting to prune Bagging ensembles is proposed in this paper. Its learning efficiency is close to Bagging and its performance is close to AdaBoost. Furthermore, this new algorithm can detect noisy data from original samples based on cascade technique, and a better result of noise detection can be acquired.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信