Classifying chronic obstructive pulmonary disease status using computed tomography imaging and convolutional neural networks: comparison of model input image types and training data severity.

IF 1.7 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Journal of Medical Imaging Pub Date : 2025-05-01 Epub Date: 2025-05-22 DOI:10.1117/1.JMI.12.3.034502

Sara Rezvanjou, Amir Moslemi, Samuel Peterson, Wan-Cheng Tan, James C Hogg, Jean Bourbeau, Joseph M Reinhardt, Miranda Kirby

{"title":"Classifying chronic obstructive pulmonary disease status using computed tomography imaging and convolutional neural networks: comparison of model input image types and training data severity.","authors":"Sara Rezvanjou, Amir Moslemi, Samuel Peterson, Wan-Cheng Tan, James C Hogg, Jean Bourbeau, Joseph M Reinhardt, Miranda Kirby","doi":"10.1117/1.JMI.12.3.034502","DOIUrl":null,"url":null,"abstract":"Purpose: Convolutional neural network (CNN)-based models using computed tomography images can classify chronic obstructive pulmonary disease (COPD) with high performance, but various input image types have been investigated, and it is unclear what image types are optimal. We propose a 2D airway-optimized topological multiplanar reformat (tMPR) input image and compare its performance with established 2D/3D input image types for COPD classification. As a secondary aim, we examined the impact of training on a dataset with predominantly mild COPD cases and testing on a more severe dataset to assess whether it improves generalizability.Approach: CanCOLD study participants were used for training/internal testing; SPIROMICS participants were used for external testing. Several 2D/3D input image types were adapted from the literature. In the proposed models, 2D airway-optimized tMPR images (to convey shape and interior/contextual information) and 3D output fusion of axial/sagittal/coronal images were investigated. The area-under-the-receiver-operator-curve (AUC) was used to evaluate model performance and Brier scores were used to evaluate model calibration. To further examine how training dataset severity impacts generalization, we compared model performance when trained on the milder CanCOLD dataset versus the more severe SPIROMICS dataset, and vice versa.Results: A total of <math><mrow><mi>n</mi> <mo>=</mo> <mn>742</mn></mrow> </math> CanCOLD participants were used for training/validation and <math><mrow><mi>n</mi> <mo>=</mo> <mn>309</mn></mrow> </math> for testing; <math><mrow><mi>n</mi> <mo>=</mo> <mn>448</mn></mrow> </math> SPIROMICS participants were used for external testing. For the CanCOLD and SPIROMICS test set, the proposed 2D tMPR on its own (CanCOLD: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.79</mn></mrow> </math> ; SPIROMICS: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.94</mn></mrow> </math> ) and combined with the 3D axial/coronal/sagittal lung view (CanCOLD: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.82</mn></mrow> </math> ; SPIROMICS: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.93</mn></mrow> </math> ) had the highest performance. The combined 2D tMPR and 3D axial/coronal/sagittal lung view had the lowest Brier score (CanCOLD: score = 0.16; SPIROMICS: score = 0.24). Conversely, using SPIROMICS for training/testing and CanCOLD for external testing resulted in lower performance when tested on CanCOLD for 2D tMPR on its own (SPIROMICS: AUC = 0.92; CanCOLD: AUC = 0.74) and when combined with the 3D axial/coronal/sagittal lung view (SPIROMICS: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.92</mn></mrow> </math> ; CanCOLD: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.75</mn></mrow> </math> ).Conclusions: The CNN-based model with the combined 2D tMPR images and 3D lung view as input image types had the highest performance for COPD classification, highlighting the importance of airway information and that the fusion of different types of information as input image can improve CNN-based model performance. In addition, models trained on CanCOLD demonstrated strong generalization to the more severe SPIROMICS cohort, whereas training on SPIROMICS resulted in lower performance when tested on CanCOLD. These findings suggest that training on milder COPD cases may improve classification performance across the disease spectrum.","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 3","pages":"034502"},"PeriodicalIF":1.7000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12097752/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1117/1.JMI.12.3.034502","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/22 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Convolutional neural network (CNN)-based models using computed tomography images can classify chronic obstructive pulmonary disease (COPD) with high performance, but various input image types have been investigated, and it is unclear what image types are optimal. We propose a 2D airway-optimized topological multiplanar reformat (tMPR) input image and compare its performance with established 2D/3D input image types for COPD classification. As a secondary aim, we examined the impact of training on a dataset with predominantly mild COPD cases and testing on a more severe dataset to assess whether it improves generalizability.

Approach: CanCOLD study participants were used for training/internal testing; SPIROMICS participants were used for external testing. Several 2D/3D input image types were adapted from the literature. In the proposed models, 2D airway-optimized tMPR images (to convey shape and interior/contextual information) and 3D output fusion of axial/sagittal/coronal images were investigated. The area-under-the-receiver-operator-curve (AUC) was used to evaluate model performance and Brier scores were used to evaluate model calibration. To further examine how training dataset severity impacts generalization, we compared model performance when trained on the milder CanCOLD dataset versus the more severe SPIROMICS dataset, and vice versa.

Results: A total of $n = 742$ CanCOLD participants were used for training/validation and $n = 309$ for testing; $n = 448$ SPIROMICS participants were used for external testing. For the CanCOLD and SPIROMICS test set, the proposed 2D tMPR on its own (CanCOLD: $AUC = 0.79$ ; SPIROMICS: $AUC = 0.94$ ) and combined with the 3D axial/coronal/sagittal lung view (CanCOLD: $AUC = 0.82$ ; SPIROMICS: $AUC = 0.93$ ) had the highest performance. The combined 2D tMPR and 3D axial/coronal/sagittal lung view had the lowest Brier score (CanCOLD: score = 0.16; SPIROMICS: score = 0.24). Conversely, using SPIROMICS for training/testing and CanCOLD for external testing resulted in lower performance when tested on CanCOLD for 2D tMPR on its own (SPIROMICS: AUC = 0.92; CanCOLD: AUC = 0.74) and when combined with the 3D axial/coronal/sagittal lung view (SPIROMICS: $AUC = 0.92$ ; CanCOLD: $AUC = 0.75$ ).

Conclusions: The CNN-based model with the combined 2D tMPR images and 3D lung view as input image types had the highest performance for COPD classification, highlighting the importance of airway information and that the fusion of different types of information as input image can improve CNN-based model performance. In addition, models trained on CanCOLD demonstrated strong generalization to the more severe SPIROMICS cohort, whereas training on SPIROMICS resulted in lower performance when tested on CanCOLD. These findings suggest that training on milder COPD cases may improve classification performance across the disease spectrum.

查看原文本刊更多论文

使用计算机断层成像和卷积神经网络对慢性阻塞性肺疾病状态进行分类：模型输入图像类型和训练数据严重程度的比较

目的：基于卷积神经网络（CNN）的模型使用计算机断层扫描图像对慢性阻塞性肺疾病（COPD）进行了高效分类，但各种输入图像类型已经被研究过，目前尚不清楚哪种图像类型是最优的。我们提出了一种二维气道优化的拓扑多平面重构（tMPR）输入图像，并将其与已建立的二维/三维输入图像类型进行COPD分类比较。作为次要目标，我们研究了在以轻度COPD病例为主的数据集上进行训练的影响，并在更严重的数据集上进行测试，以评估其是否提高了通用性。方法：使用CanCOLD研究参与者进行培训/内部测试；SPIROMICS参与者用于外部测试。几种2D/3D输入图像类型改编自文献。在所提出的模型中，研究了二维气道优化的tMPR图像（以传达形状和内部/上下文信息）和轴/矢状/冠状图像的三维输出融合。采用受者-操作者曲线下面积（AUC）评价模型性能，采用Brier评分评价模型校准。为了进一步研究训练数据严重程度如何影响泛化，我们比较了在较温和的canold数据集和较严重的SPIROMICS数据集上训练时的模型性能，反之亦然。结果：共有n = 742名canold参与者用于培训/验证，n = 309名参与者用于测试；n = 448名SPIROMICS参与者用于外部测试。对于CanCOLD和SPIROMICS测试集，建议的2D tMPR单独使用(CanCOLD: AUC = 0.79；肺活学：AUC = 0.94)，并结合三维轴位/冠状位/矢状位肺视图(CanCOLD: AUC = 0.82；SPIROMICS: AUC = 0.93)表现最佳。二维tMPR和三维轴位/冠状位/矢状位肺视图的Brier评分最低(CanCOLD：评分= 0.16；SPIROMICS：得分= 0.24)。相反，使用SPIROMICS进行训练/测试，使用CanCOLD进行外部测试，当使用CanCOLD单独进行2D tMPR测试时，结果会导致较低的性能(SPIROMICS: AUC = 0.92；CanCOLD: AUC = 0.74)，并结合三维轴位/冠状位/矢状位肺视图(SPIROMICS: AUC = 0.92；CanCOLD: AUC = 0.75)。结论：以二维tMPR图像和三维肺视图联合作为输入图像类型的cnn模型对COPD的分类效果最好，突出了气道信息的重要性，融合不同类型的信息作为输入图像可以提高cnn模型的分类效果。此外，在canold上训练的模型显示出对更严重的SPIROMICS队列的强泛化，而在canold上测试时，SPIROMICS训练导致较低的性能。这些发现表明，对较轻的COPD病例进行培训可能会提高整个疾病谱系的分类表现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Medical Imaging RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

4.10

自引率

4.20%

发文量

期刊介绍： JMI covers fundamental and translational research, as well as applications, focused on medical imaging, which continue to yield physical and biomedical advancements in the early detection, diagnostics, and therapy of disease as well as in the understanding of normal. The scope of JMI includes: Imaging physics, Tomographic reconstruction algorithms (such as those in CT and MRI), Image processing and deep learning, Computer-aided diagnosis and quantitative image analysis, Visualization and modeling, Picture archiving and communications systems (PACS), Image perception and observer performance, Technology assessment, Ultrasonic imaging, Image-guided procedures, Digital pathology, Biomedical applications of biomedical imaging. JMI allows for the peer-reviewed communication and archiving of scientific developments, translational and clinical applications, reviews, and recommendations for the field.