Improving Tuberculosis Detection in Chest X-ray Images through Transfer Learning and Deep Learning: A Comparative Study of CNN Architectures

Alex Mirugwe, Lillian Tamale, Juwa Nyirenda
{"title":"Improving Tuberculosis Detection in Chest X-ray Images through Transfer Learning and Deep Learning: A Comparative Study of CNN Architectures","authors":"Alex Mirugwe, Lillian Tamale, Juwa Nyirenda","doi":"10.1101/2024.08.02.24311396","DOIUrl":null,"url":null,"abstract":"Introduction: Tuberculosis remains a significant global health challenge, necessitating more efficient and accurate diagnostic methods.\nMethods: This study evaluates the performance of various convolutional neural network (CNN) architectures VGG16, VGG19, ResNet50, ResNet101, ResNet152, and Inception-ResNet-V2 in classifying chest X-ray (CXR) images as either normal or TB positive. The dataset comprised 4,200 CXR images, with 700 labeled as TB-positive and 3,500 as normal. We also examined the impact of data augmentation on model performance and analyzed the training times and the number of parameters for each architecture.\nResults: Our results showed that VGG16 outperformed the other models across all evaluation metrics, achieving an accuracy of 99.4%, precision of 97.9%, recall of 98.6%, F1-score of 98.3%, and AUC-ROC of 98.25%. Surprisingly, data augmentation did not improve performance, suggesting that the original dataset's diversity was sufficient. Furthermore, models with large numbers of parameters, such as ResNet152 and Inception-ResNet-V2, required longer training times without yielding proportionally better performance.\nDiscussion: These findings highlight the importance of selecting the appropriate model architecture based on task-specific requirements. While more complex models with larger parameter counts may seem advantageous, they do not necessarily offer superior performance and often come with increased computational costs.\nConclusion: The study demonstrates the potential of simpler models such as VGG16 to effectively diagnose TB from CXR images, providing a balance between performance and computational efficiency. This insight can guide future research and practical implementations in medical image classification.","PeriodicalId":501358,"journal":{"name":"medRxiv - Radiology and Imaging","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Radiology and Imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.08.02.24311396","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Tuberculosis remains a significant global health challenge, necessitating more efficient and accurate diagnostic methods. Methods: This study evaluates the performance of various convolutional neural network (CNN) architectures VGG16, VGG19, ResNet50, ResNet101, ResNet152, and Inception-ResNet-V2 in classifying chest X-ray (CXR) images as either normal or TB positive. The dataset comprised 4,200 CXR images, with 700 labeled as TB-positive and 3,500 as normal. We also examined the impact of data augmentation on model performance and analyzed the training times and the number of parameters for each architecture. Results: Our results showed that VGG16 outperformed the other models across all evaluation metrics, achieving an accuracy of 99.4%, precision of 97.9%, recall of 98.6%, F1-score of 98.3%, and AUC-ROC of 98.25%. Surprisingly, data augmentation did not improve performance, suggesting that the original dataset's diversity was sufficient. Furthermore, models with large numbers of parameters, such as ResNet152 and Inception-ResNet-V2, required longer training times without yielding proportionally better performance. Discussion: These findings highlight the importance of selecting the appropriate model architecture based on task-specific requirements. While more complex models with larger parameter counts may seem advantageous, they do not necessarily offer superior performance and often come with increased computational costs. Conclusion: The study demonstrates the potential of simpler models such as VGG16 to effectively diagnose TB from CXR images, providing a balance between performance and computational efficiency. This insight can guide future research and practical implementations in medical image classification.
通过迁移学习和深度学习改进胸部 X 光图像中的结核病检测:CNN 架构比较研究
简介:结核病仍然是全球健康的重大挑战:结核病仍然是全球健康面临的重大挑战,需要更高效、更准确的诊断方法:本研究评估了各种卷积神经网络(CNN)架构 VGG16、VGG19、ResNet50、ResNet101、ResNet152 和 Inception-ResNet-V2 在将胸部 X 光(CXR)图像分类为正常或肺结核阳性方面的性能。数据集包括 4,200 张 CXR 图像,其中 700 张标记为肺结核阳性,3,500 张标记为正常。我们还研究了数据增强对模型性能的影响,并分析了每种架构的训练时间和参数数量:结果显示,VGG16 在所有评估指标上都优于其他模型,准确率达到 99.4%,精确率达到 97.9%,召回率达到 98.6%,F1 分数达到 98.3%,AUC-ROC 达到 98.25%。令人惊讶的是,数据扩充并没有提高性能,这表明原始数据集的多样性已经足够。此外,具有大量参数的模型,如 ResNet152 和 Inception-ResNet-V2 需要更长的训练时间,但性能却没有相应提高:讨论:这些发现强调了根据特定任务要求选择适当模型架构的重要性。虽然参数数越多的复杂模型似乎越有利,但它们并不一定能提供更优越的性能,而且往往会增加计算成本:这项研究表明,VGG16 等较简单的模型具有通过 CXR 图像有效诊断肺结核的潜力,可在性能和计算效率之间取得平衡。这一见解可以指导未来医学图像分类的研究和实际应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信