A practical feature selection based on an optimal feature subset and its application for detecting lung nodules in chest radiographs

Haoyan Guo, Yuanzhi Cheng, Dazheng Wang, Li Guo
{"title":"A practical feature selection based on an optimal feature subset and its application for detecting lung nodules in chest radiographs","authors":"Haoyan Guo, Yuanzhi Cheng, Dazheng Wang, Li Guo","doi":"10.1109/BMEI.2013.6746994","DOIUrl":null,"url":null,"abstract":"The traditional motivation behind feature selection algorithms such as a genetic algorithm, a forward stepwise and a backward stepwise selections [1], is to find the best feature subset for a task using one particular learning algorithm. The idea is to select a optimal subset of attributes which are as representative as possible of the original data. However, it has been often found that no single classifier is entirely satisfactory for a particular task. Therefore, how to further improve the performance of these single systems on the basis of the previous optimal feature subset is a very important issue. Ensemble systems, also known as committees of classifiers, are composed of individual classifiers, organized in a parallel way and their outputs are combined in a combination method, which provides the final output of the system. Given the success of ensembles, ensembles allow us to get higher accuracy and sensitivity, which are often not achievable with single models. Based on the above, we propose a practical feature selection approach that is based on an optimal feature subset of a single CAD system, which is referred to as a multilevel optimal feature selection method (MOFS) in this paper. Through MOFS, we select the different optimal feature subsets in order to eliminate features that are redundant or irrelevant and obtain optimal features, and then a bagging ensemble with a MOFS method is proposed. Experimental results indicates that the accuracy of the bagging ensemble using a MOFS method is superior to that of a single CAD system and is also superior to that of the ensemble using an attribute selection algorithm based on ReliefF.","PeriodicalId":163211,"journal":{"name":"2013 6th International Conference on Biomedical Engineering and Informatics","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 6th International Conference on Biomedical Engineering and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BMEI.2013.6746994","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

The traditional motivation behind feature selection algorithms such as a genetic algorithm, a forward stepwise and a backward stepwise selections [1], is to find the best feature subset for a task using one particular learning algorithm. The idea is to select a optimal subset of attributes which are as representative as possible of the original data. However, it has been often found that no single classifier is entirely satisfactory for a particular task. Therefore, how to further improve the performance of these single systems on the basis of the previous optimal feature subset is a very important issue. Ensemble systems, also known as committees of classifiers, are composed of individual classifiers, organized in a parallel way and their outputs are combined in a combination method, which provides the final output of the system. Given the success of ensembles, ensembles allow us to get higher accuracy and sensitivity, which are often not achievable with single models. Based on the above, we propose a practical feature selection approach that is based on an optimal feature subset of a single CAD system, which is referred to as a multilevel optimal feature selection method (MOFS) in this paper. Through MOFS, we select the different optimal feature subsets in order to eliminate features that are redundant or irrelevant and obtain optimal features, and then a bagging ensemble with a MOFS method is proposed. Experimental results indicates that the accuracy of the bagging ensemble using a MOFS method is superior to that of a single CAD system and is also superior to that of the ensemble using an attribute selection algorithm based on ReliefF.
基于最优特征子集的实用特征选择及其在胸片肺结节检测中的应用
特征选择算法(如遗传算法、前向逐步选择和后向逐步选择[1])背后的传统动机是使用特定的学习算法为任务找到最佳特征子集。其思想是选择一个最优的属性子集,这些属性尽可能具有原始数据的代表性。然而,人们经常发现,没有一个分类器对特定的任务是完全满意的。因此,如何在之前最优特征子集的基础上进一步提高这些单个系统的性能是一个非常重要的问题。集成系统,也称为分类器委员会,由单个分类器组成,以并行方式组织,其输出以组合方法组合,从而提供系统的最终输出。考虑到集成的成功,集成使我们能够获得更高的精度和灵敏度,这通常是单一模型无法实现的。在此基础上,我们提出了一种实用的基于单个CAD系统的最优特征子集的特征选择方法,本文将其称为多级最优特征选择方法(MOFS)。通过MOFS选择不同的最优特征子集,剔除冗余或不相关的特征,得到最优特征,并提出了基于MOFS的套袋集成方法。实验结果表明,基于MOFS方法的套袋集成精度优于单一CAD系统,也优于基于ReliefF的属性选择算法的套袋集成精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信