{"title":"Feature Selection Using a Hybrid Approach Depends on Filter and Wrapper Methods for Accurate Breast Cancer Diagnosis","authors":"Mohammed S. Hashim, Ali A. Yassin","doi":"10.56714/bjrs.49.1.5","DOIUrl":null,"url":null,"abstract":"Breast cancer is the biggest cause of mortality in women, outscoring all other malignancies. Diagnosing breast cancer is hard because the disease is complicated, treatment methods change, and there are many different kinds of patients. Information technology and artificial intelligence contribute to improve diagnostic procedures, which are critical for care and treatment as well as reducing and controlling cancer recurrence. The primary part of this research is to develop a new feature selection strategy based on a hybrid approach that combines two methods for selecting features: the filter and the wrapper. In two stages, this method reduces the number of features from 30 to 15 to increase and improve classification accuracy. The suggested method was tested using the Wisconsin Breast Cancer Dataset dataset (WDBC). To enhance the classification of breast cancer tumors, a soft voting classifier was used in this study. The proposed methodology outperforms previous research, achieving 1 for the F1 score, 1 for AUC, 1 for recall, 1 for precision, and 100% for accuracy. Furthermore, 10-fold cross-validation has a 98.2% accuracy rate.","PeriodicalId":127734,"journal":{"name":"49","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"49","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.56714/bjrs.49.1.5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Breast cancer is the biggest cause of mortality in women, outscoring all other malignancies. Diagnosing breast cancer is hard because the disease is complicated, treatment methods change, and there are many different kinds of patients. Information technology and artificial intelligence contribute to improve diagnostic procedures, which are critical for care and treatment as well as reducing and controlling cancer recurrence. The primary part of this research is to develop a new feature selection strategy based on a hybrid approach that combines two methods for selecting features: the filter and the wrapper. In two stages, this method reduces the number of features from 30 to 15 to increase and improve classification accuracy. The suggested method was tested using the Wisconsin Breast Cancer Dataset dataset (WDBC). To enhance the classification of breast cancer tumors, a soft voting classifier was used in this study. The proposed methodology outperforms previous research, achieving 1 for the F1 score, 1 for AUC, 1 for recall, 1 for precision, and 100% for accuracy. Furthermore, 10-fold cross-validation has a 98.2% accuracy rate.