缓解属性约简中的过拟合:一种早期停止策略

Keyu Liu, Jingjing Song, Wendong Zhang, Xibei Yang
{"title":"缓解属性约简中的过拟合:一种早期停止策略","authors":"Keyu Liu, Jingjing Song, Wendong Zhang, Xibei Yang","doi":"10.1109/ICWAPR.2018.8521316","DOIUrl":null,"url":null,"abstract":"In rough set theory, forward heuristic algorithm selects the most important attribute in the process of attribute reduction until the given constraint is satisfied. However, the attributes selected by such strategy may bring us over-fitting. To solve such problem, a new heuristic algorithm is designed: the importance of the attribute is obtained by cross validation and then the Early Stopping is employed to terminate the algorithm if over-fitting occurs. Based on the neighborhood rough set, the heuristic algorithm is compared with the new method over several UCI data sets. The experimental results show that: 1) the proposed algorithm can effectively alleviate over-fitting; 2) the reduct obtained by the new algorithm may offer us better classification performances.","PeriodicalId":385478,"journal":{"name":"2018 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Alleviating Over-Fitting in Attribute Reduction: An Early Stopping Strategy\",\"authors\":\"Keyu Liu, Jingjing Song, Wendong Zhang, Xibei Yang\",\"doi\":\"10.1109/ICWAPR.2018.8521316\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In rough set theory, forward heuristic algorithm selects the most important attribute in the process of attribute reduction until the given constraint is satisfied. However, the attributes selected by such strategy may bring us over-fitting. To solve such problem, a new heuristic algorithm is designed: the importance of the attribute is obtained by cross validation and then the Early Stopping is employed to terminate the algorithm if over-fitting occurs. Based on the neighborhood rough set, the heuristic algorithm is compared with the new method over several UCI data sets. The experimental results show that: 1) the proposed algorithm can effectively alleviate over-fitting; 2) the reduct obtained by the new algorithm may offer us better classification performances.\",\"PeriodicalId\":385478,\"journal\":{\"name\":\"2018 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICWAPR.2018.8521316\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWAPR.2018.8521316","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

在粗糙集理论中,前向启发式算法在属性约简过程中选择最重要的属性,直到满足给定约束。然而,这种策略选择的属性可能会给我们带来过度拟合。为了解决这一问题,设计了一种新的启发式算法:通过交叉验证获得属性的重要度,如果出现过拟合,则采用早期停止算法终止算法。基于邻域粗糙集的启发式算法与新方法在多个UCI数据集上进行了比较。实验结果表明:1)所提算法能有效缓解过拟合;2)新算法得到的约简可以提供更好的分类性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Alleviating Over-Fitting in Attribute Reduction: An Early Stopping Strategy
In rough set theory, forward heuristic algorithm selects the most important attribute in the process of attribute reduction until the given constraint is satisfied. However, the attributes selected by such strategy may bring us over-fitting. To solve such problem, a new heuristic algorithm is designed: the importance of the attribute is obtained by cross validation and then the Early Stopping is employed to terminate the algorithm if over-fitting occurs. Based on the neighborhood rough set, the heuristic algorithm is compared with the new method over several UCI data sets. The experimental results show that: 1) the proposed algorithm can effectively alleviate over-fitting; 2) the reduct obtained by the new algorithm may offer us better classification performances.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信