A Wrapper Feature Selection Technique for Improving Diagnosis of Breast Cancer

Amal F. Goweda, Mohammed M Elmogy, S. Barakat
{"title":"A Wrapper Feature Selection Technique for Improving Diagnosis of Breast Cancer","authors":"Amal F. Goweda, Mohammed M Elmogy, S. Barakat","doi":"10.21608/mjcis.2017.311961","DOIUrl":null,"url":null,"abstract":"Nowadays, cancer is considered as a fairly common disease. Regarding the number of newly detected cases, breast cancer is ranked as one of the most leading cancer types to death in women. It can be cured, if it is identified and treated in its early stages. Therefore, this study explores a proposed integrated wrapper feature selection method called wrapper naïve-greedy search (WNGS) to improve the accuracy of the breast cancer diagnosis. WNGS is based on a wrapper method, which is blended with a greedy forward search to select optimal feature subset. WNGS method integrates a wrapper method based on Naïve Bayes (NB) classifier as a learning scheme with a forward greedy search method. Then, the selected feature subset is fed to a classifier to determine breast cancer. In addition, K-nearest neighbor-greedy search (KNN-GS) is used for comparison. In KNN-GS method, k-nearest neighbor (KNN) classifier is used as a learning scheme while a forward greedy search method is used to search through features. NB is used as the classifier for classification process for both methods. By applying these two methods, data features are reduced, and the classification rate is improved. Both methods are tested on two different benchmark breast cancer datasets. Accuracy results showed that WNGS method outperformed KNN-GS method. Also, WNGS method overcame KNN-GS regarding precision, recall, F-measure, and sensitivity.","PeriodicalId":253950,"journal":{"name":"Mansoura Journal for Computer and Information Sciences","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mansoura Journal for Computer and Information Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21608/mjcis.2017.311961","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Nowadays, cancer is considered as a fairly common disease. Regarding the number of newly detected cases, breast cancer is ranked as one of the most leading cancer types to death in women. It can be cured, if it is identified and treated in its early stages. Therefore, this study explores a proposed integrated wrapper feature selection method called wrapper naïve-greedy search (WNGS) to improve the accuracy of the breast cancer diagnosis. WNGS is based on a wrapper method, which is blended with a greedy forward search to select optimal feature subset. WNGS method integrates a wrapper method based on Naïve Bayes (NB) classifier as a learning scheme with a forward greedy search method. Then, the selected feature subset is fed to a classifier to determine breast cancer. In addition, K-nearest neighbor-greedy search (KNN-GS) is used for comparison. In KNN-GS method, k-nearest neighbor (KNN) classifier is used as a learning scheme while a forward greedy search method is used to search through features. NB is used as the classifier for classification process for both methods. By applying these two methods, data features are reduced, and the classification rate is improved. Both methods are tested on two different benchmark breast cancer datasets. Accuracy results showed that WNGS method outperformed KNN-GS method. Also, WNGS method overcame KNN-GS regarding precision, recall, F-measure, and sensitivity.
一种提高乳腺癌诊断的包膜特征选择技术
如今,癌症被认为是一种相当常见的疾病。关于新发现病例的数量,乳腺癌被列为导致妇女死亡的最主要癌症类型之一。如果在早期阶段得到识别和治疗,它是可以治愈的。因此,本研究提出了一种集成包装器特征选择方法wrapper naïve-greedy search (WNGS),以提高乳腺癌诊断的准确性。WNGS基于一种包装方法,该方法与贪婪前向搜索相结合,以选择最优特征子集。WNGS方法将基于Naïve贝叶斯(NB)分类器的包装器方法作为学习方案与前向贪婪搜索方法相结合。然后,将选择的特征子集馈送到分类器以确定乳腺癌。此外,还使用k近邻贪婪搜索(KNN-GS)进行比较。在KNN- gs方法中,使用k近邻分类器作为学习方案,使用前向贪婪搜索方法搜索特征。两种方法的分类过程都使用NB作为分类器。通过这两种方法的应用,减少了数据特征,提高了分类率。这两种方法都在两个不同的基准乳腺癌数据集上进行了测试。精度结果表明,WNGS方法优于KNN-GS方法。此外,WNGS方法在精密度、召回率、f值、灵敏度等方面也优于KNN-GS方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信