Bibhuprasad Sahu, Chakradharamahanthi Madhavi, Sasmita Pani, J. Ravindra
{"title":"Binary Optimization Using Hybrid Owl Optimization For Biomarker Selection From Cancer Datasets","authors":"Bibhuprasad Sahu, Chakradharamahanthi Madhavi, Sasmita Pani, J. Ravindra","doi":"10.1109/TEMSMET56707.2023.10150104","DOIUrl":null,"url":null,"abstract":"The enhancement of medical technology generates a massive amount of disease data. For accurate disease detection, feature subset selection plays an important role in solving different classification problems. The selection of a good feature subset enhances the accuracy of the machine learning model and reduces the training time. However, it becomes tricky and challenging to identify the optimal feature subset from the high-dimensional datasets. Different hybrid models are proposed by various researchers to deal with these types of issues. The Owl optimization algorithm proves its efficiency in selecting the optimal features. Keeping an eye on the performance of this stochastic optimization, a binary version of the hybrid Owl optimization algorithm with simulating annealing is proposed to detect the biomarker features from the cancer datasets. K nearest neighbors (KNN) are used as a wrapper to find the best biomarkers. Eight benchmark datasets were employed for the proposed model’s performance evaluation. The results show that the proposed model is significantly better than binary Owl optimization, and other counterparts using various performance matrices such as accuracy, selection of best biomarkers, and computational time. The proposed model Owl-SA performs with an accuracy of 100% in the case of CNS and lymphoma datasets. And it retains the accuracy of 99.14%, 97.16%, 98.41%, 97.94%, 98.93%,98.32% for breast, colon, ALL-AML, ovarian, SRBCT, and MLL respectively.","PeriodicalId":179553,"journal":{"name":"2023 IEEE 3rd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 3rd International Conference on Technology, Engineering, Management for Societal impact using Marketing, Entrepreneurship and Talent (TEMSMET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TEMSMET56707.2023.10150104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The enhancement of medical technology generates a massive amount of disease data. For accurate disease detection, feature subset selection plays an important role in solving different classification problems. The selection of a good feature subset enhances the accuracy of the machine learning model and reduces the training time. However, it becomes tricky and challenging to identify the optimal feature subset from the high-dimensional datasets. Different hybrid models are proposed by various researchers to deal with these types of issues. The Owl optimization algorithm proves its efficiency in selecting the optimal features. Keeping an eye on the performance of this stochastic optimization, a binary version of the hybrid Owl optimization algorithm with simulating annealing is proposed to detect the biomarker features from the cancer datasets. K nearest neighbors (KNN) are used as a wrapper to find the best biomarkers. Eight benchmark datasets were employed for the proposed model’s performance evaluation. The results show that the proposed model is significantly better than binary Owl optimization, and other counterparts using various performance matrices such as accuracy, selection of best biomarkers, and computational time. The proposed model Owl-SA performs with an accuracy of 100% in the case of CNS and lymphoma datasets. And it retains the accuracy of 99.14%, 97.16%, 98.41%, 97.94%, 98.93%,98.32% for breast, colon, ALL-AML, ovarian, SRBCT, and MLL respectively.