Classifying chronic obstructive pulmonary disease status using computed tomography imaging and convolutional neural networks: comparison of model input image types and training data severity.
IF 1.9 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Sara Rezvanjou, Amir Moslemi, Samuel Peterson, Wan-Cheng Tan, James C Hogg, Jean Bourbeau, Joseph M Reinhardt, Miranda Kirby
{"title":"Classifying chronic obstructive pulmonary disease status using computed tomography imaging and convolutional neural networks: comparison of model input image types and training data severity.","authors":"Sara Rezvanjou, Amir Moslemi, Samuel Peterson, Wan-Cheng Tan, James C Hogg, Jean Bourbeau, Joseph M Reinhardt, Miranda Kirby","doi":"10.1117/1.JMI.12.3.034502","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Convolutional neural network (CNN)-based models using computed tomography images can classify chronic obstructive pulmonary disease (COPD) with high performance, but various input image types have been investigated, and it is unclear what image types are optimal. We propose a 2D airway-optimized topological multiplanar reformat (tMPR) input image and compare its performance with established 2D/3D input image types for COPD classification. As a secondary aim, we examined the impact of training on a dataset with predominantly mild COPD cases and testing on a more severe dataset to assess whether it improves generalizability.</p><p><strong>Approach: </strong>CanCOLD study participants were used for training/internal testing; SPIROMICS participants were used for external testing. Several 2D/3D input image types were adapted from the literature. In the proposed models, 2D airway-optimized tMPR images (to convey shape and interior/contextual information) and 3D output fusion of axial/sagittal/coronal images were investigated. The area-under-the-receiver-operator-curve (AUC) was used to evaluate model performance and Brier scores were used to evaluate model calibration. To further examine how training dataset severity impacts generalization, we compared model performance when trained on the milder CanCOLD dataset versus the more severe SPIROMICS dataset, and vice versa.</p><p><strong>Results: </strong>A total of <math><mrow><mi>n</mi> <mo>=</mo> <mn>742</mn></mrow> </math> CanCOLD participants were used for training/validation and <math><mrow><mi>n</mi> <mo>=</mo> <mn>309</mn></mrow> </math> for testing; <math><mrow><mi>n</mi> <mo>=</mo> <mn>448</mn></mrow> </math> SPIROMICS participants were used for external testing. For the CanCOLD and SPIROMICS test set, the proposed 2D tMPR on its own (CanCOLD: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.79</mn></mrow> </math> ; SPIROMICS: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.94</mn></mrow> </math> ) and combined with the 3D axial/coronal/sagittal lung view (CanCOLD: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.82</mn></mrow> </math> ; SPIROMICS: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.93</mn></mrow> </math> ) had the highest performance. The combined 2D tMPR and 3D axial/coronal/sagittal lung view had the lowest Brier score (CanCOLD: score = 0.16; SPIROMICS: score = 0.24). Conversely, using SPIROMICS for training/testing and CanCOLD for external testing resulted in lower performance when tested on CanCOLD for 2D tMPR on its own (SPIROMICS: AUC = 0.92; CanCOLD: AUC = 0.74) and when combined with the 3D axial/coronal/sagittal lung view (SPIROMICS: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.92</mn></mrow> </math> ; CanCOLD: <math><mrow><mi>AUC</mi> <mo>=</mo> <mn>0.75</mn></mrow> </math> ).</p><p><strong>Conclusions: </strong>The CNN-based model with the combined 2D tMPR images and 3D lung view as input image types had the highest performance for COPD classification, highlighting the importance of airway information and that the fusion of different types of information as input image can improve CNN-based model performance. In addition, models trained on CanCOLD demonstrated strong generalization to the more severe SPIROMICS cohort, whereas training on SPIROMICS resulted in lower performance when tested on CanCOLD. These findings suggest that training on milder COPD cases may improve classification performance across the disease spectrum.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 3","pages":"034502"},"PeriodicalIF":1.9000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12097752/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1117/1.JMI.12.3.034502","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/22 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: Convolutional neural network (CNN)-based models using computed tomography images can classify chronic obstructive pulmonary disease (COPD) with high performance, but various input image types have been investigated, and it is unclear what image types are optimal. We propose a 2D airway-optimized topological multiplanar reformat (tMPR) input image and compare its performance with established 2D/3D input image types for COPD classification. As a secondary aim, we examined the impact of training on a dataset with predominantly mild COPD cases and testing on a more severe dataset to assess whether it improves generalizability.
Approach: CanCOLD study participants were used for training/internal testing; SPIROMICS participants were used for external testing. Several 2D/3D input image types were adapted from the literature. In the proposed models, 2D airway-optimized tMPR images (to convey shape and interior/contextual information) and 3D output fusion of axial/sagittal/coronal images were investigated. The area-under-the-receiver-operator-curve (AUC) was used to evaluate model performance and Brier scores were used to evaluate model calibration. To further examine how training dataset severity impacts generalization, we compared model performance when trained on the milder CanCOLD dataset versus the more severe SPIROMICS dataset, and vice versa.
Results: A total of CanCOLD participants were used for training/validation and for testing; SPIROMICS participants were used for external testing. For the CanCOLD and SPIROMICS test set, the proposed 2D tMPR on its own (CanCOLD: ; SPIROMICS: ) and combined with the 3D axial/coronal/sagittal lung view (CanCOLD: ; SPIROMICS: ) had the highest performance. The combined 2D tMPR and 3D axial/coronal/sagittal lung view had the lowest Brier score (CanCOLD: score = 0.16; SPIROMICS: score = 0.24). Conversely, using SPIROMICS for training/testing and CanCOLD for external testing resulted in lower performance when tested on CanCOLD for 2D tMPR on its own (SPIROMICS: AUC = 0.92; CanCOLD: AUC = 0.74) and when combined with the 3D axial/coronal/sagittal lung view (SPIROMICS: ; CanCOLD: ).
Conclusions: The CNN-based model with the combined 2D tMPR images and 3D lung view as input image types had the highest performance for COPD classification, highlighting the importance of airway information and that the fusion of different types of information as input image can improve CNN-based model performance. In addition, models trained on CanCOLD demonstrated strong generalization to the more severe SPIROMICS cohort, whereas training on SPIROMICS resulted in lower performance when tested on CanCOLD. These findings suggest that training on milder COPD cases may improve classification performance across the disease spectrum.
期刊介绍:
JMI covers fundamental and translational research, as well as applications, focused on medical imaging, which continue to yield physical and biomedical advancements in the early detection, diagnostics, and therapy of disease as well as in the understanding of normal. The scope of JMI includes: Imaging physics, Tomographic reconstruction algorithms (such as those in CT and MRI), Image processing and deep learning, Computer-aided diagnosis and quantitative image analysis, Visualization and modeling, Picture archiving and communications systems (PACS), Image perception and observer performance, Technology assessment, Ultrasonic imaging, Image-guided procedures, Digital pathology, Biomedical applications of biomedical imaging. JMI allows for the peer-reviewed communication and archiving of scientific developments, translational and clinical applications, reviews, and recommendations for the field.