A Two-Stage Deep Learning Strategy for Pneumothorax Classification

2021 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME) Pub Date : 2021-10-07 DOI:10.1109/ICECCME52200.2021.9590988

Yuchi Tian, Xiaodong Yang

{"title":"A Two-Stage Deep Learning Strategy for Pneumothorax Classification","authors":"Yuchi Tian, Xiaodong Yang","doi":"10.1109/ICECCME52200.2021.9590988","DOIUrl":null,"url":null,"abstract":"Due to pneumothorax lesions in chest X-ray images show wide and complicated variation in size, shape, and location within lung regions that overlap with many other anatomic structures such as ribs and vessels, it is a challenge to develop a reliable computer-aided diagnosis systems (CADs) for automatic pneumothorax screening. To address this challenge, we propose a new two-stage deep learning strategy: local feature learning (LFL) followed by global multiple instance learning (GMIL). The GMIL stage intends to train a model that regards the given image as a set of patches and determines whether or not the image contains pneumothorax based on the patches. More specifically, the GMIL model first extracts the hierarchical feature map of a given image by using convolution layer, and takes the feature map of the last layer as a set of depth instances. Each instance is then provided to additional layers to produce its contribution to the final image level prediction. However, the GMIL model trained directly using the original image may still fail to learn highly discriminative features when large areas of non-lesion regions are contained in the image space and thus adversely affect performance. To resolve this problem, prior to the GMIL stage, another model with identical convolutional layers is first trained in the LFL stage using normal patches and pneumothorax-infected patches so that it can better learn the key distinguishing features by reducing most of the non-lesion regions in the X-ray image. The pre-trained convolutional weights are then utilized via transfer learning to enhance training of the GMIL model. Experiments carried on the benchmark ChestX-ray14 data set demonstrate that the proposed learning strategy can achieve the most advanced performance on accuracy, area under receiver operating characteristic curve (AUC), recall, specificity, and F1 scores of 94.4±0. 7%, 97.3±0.5%, 94.6±1.5%, 94.2±0.4% and 94.4±0.7%, separately. We demonstrate the importance and effectiveness of reducing most of the non-lesion regions in the images for learning more discriminative features. The results show that our proposed CAD system is an effective auxiliary tool for screening pneumothorax.","PeriodicalId":102785,"journal":{"name":"2021 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECCME52200.2021.9590988","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Due to pneumothorax lesions in chest X-ray images show wide and complicated variation in size, shape, and location within lung regions that overlap with many other anatomic structures such as ribs and vessels, it is a challenge to develop a reliable computer-aided diagnosis systems (CADs) for automatic pneumothorax screening. To address this challenge, we propose a new two-stage deep learning strategy: local feature learning (LFL) followed by global multiple instance learning (GMIL). The GMIL stage intends to train a model that regards the given image as a set of patches and determines whether or not the image contains pneumothorax based on the patches. More specifically, the GMIL model first extracts the hierarchical feature map of a given image by using convolution layer, and takes the feature map of the last layer as a set of depth instances. Each instance is then provided to additional layers to produce its contribution to the final image level prediction. However, the GMIL model trained directly using the original image may still fail to learn highly discriminative features when large areas of non-lesion regions are contained in the image space and thus adversely affect performance. To resolve this problem, prior to the GMIL stage, another model with identical convolutional layers is first trained in the LFL stage using normal patches and pneumothorax-infected patches so that it can better learn the key distinguishing features by reducing most of the non-lesion regions in the X-ray image. The pre-trained convolutional weights are then utilized via transfer learning to enhance training of the GMIL model. Experiments carried on the benchmark ChestX-ray14 data set demonstrate that the proposed learning strategy can achieve the most advanced performance on accuracy, area under receiver operating characteristic curve (AUC), recall, specificity, and F1 scores of 94.4±0. 7%, 97.3±0.5%, 94.6±1.5%, 94.2±0.4% and 94.4±0.7%, separately. We demonstrate the importance and effectiveness of reducing most of the non-lesion regions in the images for learning more discriminative features. The results show that our proposed CAD system is an effective auxiliary tool for screening pneumothorax.

查看原文本刊更多论文

气胸分类的两阶段深度学习策略

由于胸片上的气胸病变在大小、形状和位置上表现出广泛而复杂的变化，并与许多其他解剖结构(如肋骨和血管)重叠，因此开发可靠的计算机辅助诊断系统(cad)用于自动气胸筛查是一项挑战。为了应对这一挑战，我们提出了一种新的两阶段深度学习策略:局部特征学习(LFL)，然后是全局多实例学习(GMIL)。GMIL阶段旨在训练一个模型，该模型将给定图像视为一组补丁，并根据补丁确定图像是否包含气胸。更具体地说，GMIL模型首先利用卷积层提取给定图像的分层特征图，并将最后一层的特征图作为深度实例集。然后将每个实例提供给其他层，以产生其对最终图像级别预测的贡献。然而，当图像空间中包含大面积的非损伤区域时，直接使用原始图像训练的GMIL模型仍然可能无法学习到高度判别性的特征，从而对性能产生不利影响。为了解决这个问题，在GMIL阶段之前，首先在LFL阶段使用正常斑块和气胸感染斑块训练另一个具有相同卷积层的模型，以便通过减少x射线图像中的大部分非病变区域来更好地学习关键的区分特征。然后通过迁移学习利用预训练的卷积权值来增强GMIL模型的训练。在基准的ChestX-ray14数据集上进行的实验表明，所提出的学习策略在准确率、接收者工作特征曲线下面积(AUC)、召回率、特异性和F1分数(94.4±0)方面达到了最高的性能。7%, 97.3±0.5%,94.6±1.5%,94.2±0.4%和94.4±0.7%,分别。我们证明了减少图像中大多数非损伤区域对于学习更多判别特征的重要性和有效性。结果表明，本文提出的CAD系统是气胸筛查的有效辅助工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)

自引率

0.00%

发文量