A Comparison of Machine Learning Methods for best Accuracy COVID-19 Diagnosis Using Chest X-Ray Images

Hussein Ahmed Ali, Walid Hariri, Nadia Smaoui Zghal, Dalenda Ben Aissa
{"title":"A Comparison of Machine Learning Methods for best Accuracy COVID-19 Diagnosis Using Chest X-Ray Images","authors":"Hussein Ahmed Ali, Walid Hariri, Nadia Smaoui Zghal, Dalenda Ben Aissa","doi":"10.1109/SETIT54465.2022.9875477","DOIUrl":null,"url":null,"abstract":"Coronavirus (COVID-19) changed the view of people towards life in all the countries of the world in December 2019. The virus has made chaos that cannot be predicted. This problem requires using a variety of technologies to aid in the identification of COVID-19 patients and to control the disease spread. For suspected instances of COVID-19 disease, chest X-ray (CXR) imaging is a standard with fewer costs, but it does not need a COVID-19 examination approach without using technology to help for a suitable diagnosis. In response to this issue, a big dataset of CXR images was divided into four classes found on the website Kaggle. Dealing with large data of the images needs dataset reprocessing through choosing the optimal method for getting speed and best accuracy. Dataset reprocessing converts into gray level then adjust image intensity, resize and extract the best features then apply Machine Learning ML models. The use of different prediction models, ML algorithms, and their performances are calculated with evaluation on the dataset after reprocessing. Decision Tree (DT), Random Forest (RF), Stochastic Gradient Descent (SGD), Logistic Regression (LR), Gaussian Naive Bayes (GNB), and K-Nearest Neighbors (KNN) are models used to foretell the specialized who would be diagnosed with COVID-19 quickly by using CXR images classification. The KNN has revealed the best accuracy compared with the others such as GNB, DT, SGD, LR, and RF. Also, KNN has the best-weighted average for all parameters, which are precision, sensitivity, and F1-score compared with the other models.","PeriodicalId":126155,"journal":{"name":"2022 IEEE 9th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 9th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SETIT54465.2022.9875477","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Coronavirus (COVID-19) changed the view of people towards life in all the countries of the world in December 2019. The virus has made chaos that cannot be predicted. This problem requires using a variety of technologies to aid in the identification of COVID-19 patients and to control the disease spread. For suspected instances of COVID-19 disease, chest X-ray (CXR) imaging is a standard with fewer costs, but it does not need a COVID-19 examination approach without using technology to help for a suitable diagnosis. In response to this issue, a big dataset of CXR images was divided into four classes found on the website Kaggle. Dealing with large data of the images needs dataset reprocessing through choosing the optimal method for getting speed and best accuracy. Dataset reprocessing converts into gray level then adjust image intensity, resize and extract the best features then apply Machine Learning ML models. The use of different prediction models, ML algorithms, and their performances are calculated with evaluation on the dataset after reprocessing. Decision Tree (DT), Random Forest (RF), Stochastic Gradient Descent (SGD), Logistic Regression (LR), Gaussian Naive Bayes (GNB), and K-Nearest Neighbors (KNN) are models used to foretell the specialized who would be diagnosed with COVID-19 quickly by using CXR images classification. The KNN has revealed the best accuracy compared with the others such as GNB, DT, SGD, LR, and RF. Also, KNN has the best-weighted average for all parameters, which are precision, sensitivity, and F1-score compared with the other models.
胸部x线图像诊断COVID-19最佳准确性的机器学习方法比较
2019年12月,冠状病毒(COVID-19)改变了世界各国人民对生活的看法。病毒造成了无法预测的混乱。这个问题需要使用各种技术来帮助识别COVID-19患者并控制疾病传播。对于疑似COVID-19病例,胸部x射线(CXR)成像是一种成本较低的标准,但在不使用技术帮助进行适当诊断的情况下,不需要采用COVID-19检查方法。针对这个问题,Kaggle网站将CXR图像的大数据集分为四类。处理大数据图像需要对数据集进行再处理,通过选择最优方法来获得速度和最佳精度。数据集再处理转换为灰度,然后调整图像强度,调整大小,提取最佳特征,然后应用机器学习ML模型。使用不同的预测模型、机器学习算法及其性能进行计算,并对数据集进行再处理后的评估。决策树(DT)、随机森林(RF)、随机梯度下降(SGD)、逻辑回归(LR)、高斯朴素贝叶斯(GNB)和k近邻(KNN)是通过使用CXR图像分类来预测谁将被快速诊断为COVID-19的模型。与GNB、DT、SGD、LR和RF等其他方法相比,KNN显示出最好的精度。此外,与其他模型相比,KNN在所有参数(精度、灵敏度和f1分数)上都具有最佳加权平均值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信