Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithm for Chest X-Ray Images Classification

IF 0.7 Q3 ENGINEERING, MULTIDISCIPLINARY
A. Yasar
{"title":"Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithm for Chest X-Ray Images Classification","authors":"A. Yasar","doi":"10.31803/tg-20220828220446","DOIUrl":null,"url":null,"abstract":"Covid-19 virus has led to a tremendous pandemic in more than 200 countries across the globe, leading to severe impacts on the lives and health of a large number of people globally. The emergence of Omicron (SARS-CoV-2), which is a coronavirus 2 variant, an acute respiratory syndrome which is highly mutated, has again caused social limitations around the world because of infectious and vaccine escape mutations. One of the most significant steps in the fight against covid-19 is to identify those who were infected with the virus as early as possible, to start their treatment and to minimize the risk of transmission. Detection of this disease from radiographic and radiological images is perhaps one of the quickest and most accessible methods of diagnosing patients. In this study, a computer aided system based on deep learning is proposed for rapid diagnosis of COVID-19 from chest x-ray images. First, a dataset of 5380 Chest x-ray images was collected from publicly available datasets. In the first step, the deep features of the images in the dataset are extracted by using the dataset pre-trained convolutional neural network (CNN) model. In the second step, Differential Evolution (DE), Ant Colony Optimization (ACO) and Particle Swarm Optimization (PSO) algorithms were used for feature selection in order to find the features that are effective for classification of these deep features. Finally, the features obtained in two stages, Decision Tree (DT), Naive Bayes (NB), support vector machine (SVM), k-Nearest Neighbours (k-NN) and Neural Network (NN) classifiers are used for binary, triple and quadruple classification. In order to measure the success of the models objectively, 10 folds cross validation was used. As a result, 1000 features were extracted with the SqueezeNet CNN model. In the binary, triple and quadruple classification process using these features, the SVM method was found to be the best classifier. The classification successes of the SVM model are 96.02%, 86.84% and 79.87%, respectively. The results obtained from the classification process with deep feature extraction were achieved by selecting the features in the proposed method in less time and with less features. While the performance achieved is very good, further analysis is required on a larger set of COVID-19 images to obtain higher estimates of accuracy.","PeriodicalId":43419,"journal":{"name":"TEHNICKI GLASNIK-TECHNICAL JOURNAL","volume":"1 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2023-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"TEHNICKI GLASNIK-TECHNICAL JOURNAL","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31803/tg-20220828220446","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 2

Abstract

Covid-19 virus has led to a tremendous pandemic in more than 200 countries across the globe, leading to severe impacts on the lives and health of a large number of people globally. The emergence of Omicron (SARS-CoV-2), which is a coronavirus 2 variant, an acute respiratory syndrome which is highly mutated, has again caused social limitations around the world because of infectious and vaccine escape mutations. One of the most significant steps in the fight against covid-19 is to identify those who were infected with the virus as early as possible, to start their treatment and to minimize the risk of transmission. Detection of this disease from radiographic and radiological images is perhaps one of the quickest and most accessible methods of diagnosing patients. In this study, a computer aided system based on deep learning is proposed for rapid diagnosis of COVID-19 from chest x-ray images. First, a dataset of 5380 Chest x-ray images was collected from publicly available datasets. In the first step, the deep features of the images in the dataset are extracted by using the dataset pre-trained convolutional neural network (CNN) model. In the second step, Differential Evolution (DE), Ant Colony Optimization (ACO) and Particle Swarm Optimization (PSO) algorithms were used for feature selection in order to find the features that are effective for classification of these deep features. Finally, the features obtained in two stages, Decision Tree (DT), Naive Bayes (NB), support vector machine (SVM), k-Nearest Neighbours (k-NN) and Neural Network (NN) classifiers are used for binary, triple and quadruple classification. In order to measure the success of the models objectively, 10 folds cross validation was used. As a result, 1000 features were extracted with the SqueezeNet CNN model. In the binary, triple and quadruple classification process using these features, the SVM method was found to be the best classifier. The classification successes of the SVM model are 96.02%, 86.84% and 79.87%, respectively. The results obtained from the classification process with deep feature extraction were achieved by selecting the features in the proposed method in less time and with less features. While the performance achieved is very good, further analysis is required on a larger set of COVID-19 images to obtain higher estimates of accuracy.
基于元启发式算法的包装和混合特征选择方法用于胸部x射线图像分类
新冠肺炎疫情已在全球200多个国家引发大规模疫情,给全球大批民众的生命健康带来严重影响。欧米克隆(SARS-CoV-2)是冠状病毒2型的变种,是一种高度变异的急性呼吸系统综合症。由于传染病和疫苗逃逸突变,它的出现再次在世界范围内引起了社会限制。抗击covid-19的最重要步骤之一是尽早确定病毒感染者,开始治疗并尽量减少传播风险。从x线摄影和放射图像中发现这种疾病可能是诊断患者最快和最容易的方法之一。本研究提出一种基于深度学习的计算机辅助系统,用于从胸部x线图像中快速诊断COVID-19。首先,从公开数据集中收集了5380张胸部x射线图像的数据集。第一步,使用数据集预训练的卷积神经网络(CNN)模型提取数据集中图像的深度特征。第二步,利用差分进化(DE)、蚁群优化(ACO)和粒子群优化(PSO)算法进行特征选择,寻找对深度特征分类有效的特征。最后,利用决策树(DT)、朴素贝叶斯(NB)、支持向量机(SVM)、k近邻(k-NN)和神经网络(NN)分类器这两个阶段得到的特征进行二值分类、三重分类和四重分类。为了客观地衡量模型的成功与否,采用10倍交叉验证。结果,使用SqueezeNet CNN模型提取了1000个特征。在利用这些特征进行二值分类、三重分类和四重分类的过程中,发现支持向量机方法是最好的分类器。SVM模型的分类成功率分别为96.02%、86.84%和79.87%。采用深度特征提取的分类过程可以在更短的时间内以更少的特征选择出所提出方法中的特征,从而获得分类结果。虽然所取得的性能非常好,但需要对更大的COVID-19图像集进行进一步分析,以获得更高的准确性估计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
TEHNICKI GLASNIK-TECHNICAL JOURNAL
TEHNICKI GLASNIK-TECHNICAL JOURNAL ENGINEERING, MULTIDISCIPLINARY-
CiteScore
1.50
自引率
8.30%
发文量
85
审稿时长
15 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信