{"title":"Deep learning and machine learning in CT-based COPD diagnosis: Systematic review and meta-analysis","authors":"Qian Wu, Hui Guo, Ruihan Li, Jinhuan Han","doi":"10.1016/j.ijmedinf.2025.105812","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>With advancements in medical technology and science, chronic obstructive pulmonary disease (COPD), one of the world’s three major chronic diseases, has seen numerous remarkable outcomes when combined with artificial intelligence, particularly in disease diagnosis. However, the diagnostic performance of these AI models still lacks comprehensive evidence. Therefore, this study quantitatively analyzed the diagnostic performance of AI models in CT images of COPD patients, aiming to promote the development of related research in the future.</div></div><div><h3>Methods</h3><div>PubMed, Cochrane Library, Web of Science, and Embase were retrieved up to September 1, 2024. The QUADAS-2 evaluation tool was used to assess the quality of the included studies. Meta-analysis of the included researches was performed using Stata18, RevMan 5.4, and Meta-Disc 1.4 software to merge sensitivity, specificity and plot a summary receiver operating characteristic curve (SROC). Heterogeneity was assessed using the Q statistic, and sources of inter-study heterogeneity were explored through <em>meta</em>-regression analysis.</div></div><div><h3>Results</h3><div>Twenty-two of 3280 identified studies were eligible. Meta-analysis was performed on 15 of these studies, encompassing a total of 22,817 patients for which statistical metrics were reported or could be calculated. Seven studies were based on deep learning (DL) model, three on machine learning (ML) model, and five on DL model with multiple-instance learning (MIL) mechanisms. One study evaluated both DL and ML models. The <em>meta</em>-analysis results showed that the pooled sensitivity of all DL and ML models was 86 % (95 %CI 78–91 %), specificity was 87 % (95 %CI 83–91 %), and area under the curve was 93 % (95 %CI 90–95 %). Subgroup analyses revealed no significant difference in diagnostic sensitivity and specificity between DL and ML models (sensitivity 82 % (95 %CI 76–87 %), 93 % (95 %CI 85–97 %); specificity 87 % (95 %CI 79–91 %), 84 % (95 %CI 79–88 %), and the DL model with MIL (sensitivity 87 % (95 %CI 61–96 %); specificity 89 % (95 %CI 78–95 %) improved the performance of DL model, but this improvement was not statistically significant (p > 0.05).</div></div><div><h3>Conclusion</h3><div>Both DL and ML models for diagnosing COPD using CT images exhibited high accuracy. There was no significant difference in diagnostic efficacy between the two types of AI models, and the addition of the MIL mechanism may enhance the performance of the DL model.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"196 ","pages":"Article 105812"},"PeriodicalIF":3.7000,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505625000292","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background
With advancements in medical technology and science, chronic obstructive pulmonary disease (COPD), one of the world’s three major chronic diseases, has seen numerous remarkable outcomes when combined with artificial intelligence, particularly in disease diagnosis. However, the diagnostic performance of these AI models still lacks comprehensive evidence. Therefore, this study quantitatively analyzed the diagnostic performance of AI models in CT images of COPD patients, aiming to promote the development of related research in the future.
Methods
PubMed, Cochrane Library, Web of Science, and Embase were retrieved up to September 1, 2024. The QUADAS-2 evaluation tool was used to assess the quality of the included studies. Meta-analysis of the included researches was performed using Stata18, RevMan 5.4, and Meta-Disc 1.4 software to merge sensitivity, specificity and plot a summary receiver operating characteristic curve (SROC). Heterogeneity was assessed using the Q statistic, and sources of inter-study heterogeneity were explored through meta-regression analysis.
Results
Twenty-two of 3280 identified studies were eligible. Meta-analysis was performed on 15 of these studies, encompassing a total of 22,817 patients for which statistical metrics were reported or could be calculated. Seven studies were based on deep learning (DL) model, three on machine learning (ML) model, and five on DL model with multiple-instance learning (MIL) mechanisms. One study evaluated both DL and ML models. The meta-analysis results showed that the pooled sensitivity of all DL and ML models was 86 % (95 %CI 78–91 %), specificity was 87 % (95 %CI 83–91 %), and area under the curve was 93 % (95 %CI 90–95 %). Subgroup analyses revealed no significant difference in diagnostic sensitivity and specificity between DL and ML models (sensitivity 82 % (95 %CI 76–87 %), 93 % (95 %CI 85–97 %); specificity 87 % (95 %CI 79–91 %), 84 % (95 %CI 79–88 %), and the DL model with MIL (sensitivity 87 % (95 %CI 61–96 %); specificity 89 % (95 %CI 78–95 %) improved the performance of DL model, but this improvement was not statistically significant (p > 0.05).
Conclusion
Both DL and ML models for diagnosing COPD using CT images exhibited high accuracy. There was no significant difference in diagnostic efficacy between the two types of AI models, and the addition of the MIL mechanism may enhance the performance of the DL model.
期刊介绍:
International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings.
The scope of journal covers:
Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.;
Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc.
Educational computer based programs pertaining to medical informatics or medicine in general;
Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.