On Data Bias and the Usability of Deep Learning Algorithms in Classifying COVID-19 based on Chest X-ray

2021 IEEE 3rd International Multidisciplinary Conference on Engineering Technology (IMCET) Pub Date : 2021-12-08 DOI:10.1109/imcet53404.2021.9665574

Hassan Ezzeddine, M. Awad, Alain S. Abi Ghanem, Bassem Mourani

{"title":"On Data Bias and the Usability of Deep Learning Algorithms in Classifying COVID-19 based on Chest X-ray","authors":"Hassan Ezzeddine, M. Awad, Alain S. Abi Ghanem, Bassem Mourani","doi":"10.1109/imcet53404.2021.9665574","DOIUrl":null,"url":null,"abstract":"SARS-COV-2 is a new strain of virus that was first detected in China. It quickly spread across the world affecting millions of people. For this reason, early detection of the virus is mandatory in order to limit the spread of the virus. Real-time reverse transcription polymerase chain reaction (RT-PCR) and the antibody test are the main tests used to detect the virus. Chest X-rays (CXRs) and computerized tomography (CT) scans are also used to detect the virus although the American college of Radiology does not recommend using medical imaging as a diagnostic tool. Like other medical imaging, convolutional neural networks are used to classify the images. We believe that developing a model to detect COVID-19 has no clinical value regardless of the accuracy achieved since 58% of CXRs seem to be normal. During literature review, several papers with suspicious accuracy of 90% and higher were found. We believe that the dataset used to train and validate the network is biased and is not appropriate for deep learning as any model we train using the same dataset has achieved high accuracy. Our experiments on Cohen's Covid dataset, augmented with Wang dataset, shows that any model trained on Cohen dataset can easily achieve high accuracy. This was further validated with two experienced radiologists who participated in this study were only able to classify 60% as being Covid. Our study highlight the importance of addressing bias in data and developing trustworthy and explainable ML models based on well curated data.","PeriodicalId":181607,"journal":{"name":"2021 IEEE 3rd International Multidisciplinary Conference on Engineering Technology (IMCET)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 3rd International Multidisciplinary Conference on Engineering Technology (IMCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/imcet53404.2021.9665574","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

SARS-COV-2 is a new strain of virus that was first detected in China. It quickly spread across the world affecting millions of people. For this reason, early detection of the virus is mandatory in order to limit the spread of the virus. Real-time reverse transcription polymerase chain reaction (RT-PCR) and the antibody test are the main tests used to detect the virus. Chest X-rays (CXRs) and computerized tomography (CT) scans are also used to detect the virus although the American college of Radiology does not recommend using medical imaging as a diagnostic tool. Like other medical imaging, convolutional neural networks are used to classify the images. We believe that developing a model to detect COVID-19 has no clinical value regardless of the accuracy achieved since 58% of CXRs seem to be normal. During literature review, several papers with suspicious accuracy of 90% and higher were found. We believe that the dataset used to train and validate the network is biased and is not appropriate for deep learning as any model we train using the same dataset has achieved high accuracy. Our experiments on Cohen's Covid dataset, augmented with Wang dataset, shows that any model trained on Cohen dataset can easily achieve high accuracy. This was further validated with two experienced radiologists who participated in this study were only able to classify 60% as being Covid. Our study highlight the importance of addressing bias in data and developing trustworthy and explainable ML models based on well curated data.

查看原文本刊更多论文

基于胸片的深度学习算法在COVID-19分类中的数据偏差及可用性研究

SARS-COV-2是中国首次发现的一种新型病毒。它迅速蔓延到世界各地，影响了数百万人。因此，为了限制病毒的传播，必须及早发现病毒。实时逆转录聚合酶链反应(RT-PCR)和抗体检测是检测病毒的主要方法。尽管美国放射学会不建议使用医学成像作为诊断工具，但胸部x光片(CXRs)和计算机断层扫描(CT)也可用于检测病毒。与其他医学成像一样，卷积神经网络用于对图像进行分类。我们认为，开发一种检测COVID-19的模型没有任何临床价值，因为58%的cxr似乎是正常的。在文献综述中，发现了几篇准确率在90%以上的可疑论文。我们认为，用于训练和验证网络的数据集是有偏差的，不适合深度学习，因为我们使用相同的数据集训练的任何模型都达到了很高的准确性。我们在Cohen的Covid数据集上的实验，与Wang数据集的增强，表明在Cohen数据集上训练的任何模型都可以很容易地实现高精度。参与本研究的两名经验丰富的放射科医生进一步验证了这一点，他们只能将60%的患者归类为Covid。我们的研究强调了解决数据偏差和基于精心策划的数据开发值得信赖和可解释的ML模型的重要性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 3rd International Multidisciplinary Conference on Engineering Technology (IMCET)

自引率

0.00%

发文量