Gerlig Widmann, Anna Katharina Luger, Thomas Sonnweber, Christoph Schwabl, Katharina Cima, Anna Katharina Gerstner, Alex Pizzini, Sabina Sahanic, Anna Boehm, Maxmilian Coen, Ewald Wöll, Günter Weiss, Rudolf Kirchmair, Leonhard Gruber, Gudrun M Feuchtner, Ivan Tancevski, Judith Löffler-Ragg, Piotr Tymoszuk
{"title":"Machine Learning Based Multi-Parameter Modeling for Prediction of Post-Inflammatory Lung Changes.","authors":"Gerlig Widmann, Anna Katharina Luger, Thomas Sonnweber, Christoph Schwabl, Katharina Cima, Anna Katharina Gerstner, Alex Pizzini, Sabina Sahanic, Anna Boehm, Maxmilian Coen, Ewald Wöll, Günter Weiss, Rudolf Kirchmair, Leonhard Gruber, Gudrun M Feuchtner, Ivan Tancevski, Judith Löffler-Ragg, Piotr Tymoszuk","doi":"10.3390/diagnostics15060783","DOIUrl":null,"url":null,"abstract":"<p><p><b>Objectives</b>: Prediction of lung function deficits following pulmonary infection is challenging and suffers from inaccuracy. We sought to develop machine-learning models for prediction of post-inflammatory lung changes based on COVID-19 recovery data. <b>Methods</b>: In the prospective CovILD study (<i>n</i> = 420 longitudinal observations from <i>n</i> = 140 COVID-19 survivors), data on lung function testing (LFT), chest CT including severity scoring by a human radiologist and density measurement by artificial intelligence, demography, and persistent symptoms were collected. This information was used to develop models of numeric readouts and abnormalities of LFT with four machine learning algorithms (Random Forest, gradient boosted machines, neural network, and support vector machines). <b>Results</b>: Reduced DLCO (diffusion capacity for carbon monoxide <80% of reference) was found in 94 (22%) observations. Those observations were modeled with a cross-validated accuracy of 82-85%, AUC of 0.87-0.9, and Cohen's κ of 0.45-0.5. No reliable models could be established for FEV1 or FVC. For DLCO as a continuous variable, three machine learning algorithms yielded meaningful models with cross-validated mean absolute errors of 11.6-12.5% and R<sup>2</sup> of 0.26-0.34. CT-derived features such as opacity, high opacity, and CT severity score were among the most influential predictors of DLCO impairment. <b>Conclusions</b>: Multi-parameter machine learning trained with demographic, clinical, and artificial intelligence chest CT data reliably and reproducibly predicts LFT deficits and outperforms single markers of lung pathology and human radiologist's assessment. It may improve diagnostic and foster personalized treatment.</p>","PeriodicalId":11225,"journal":{"name":"Diagnostics","volume":"15 6","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11941013/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diagnostics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3390/diagnostics15060783","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: Prediction of lung function deficits following pulmonary infection is challenging and suffers from inaccuracy. We sought to develop machine-learning models for prediction of post-inflammatory lung changes based on COVID-19 recovery data. Methods: In the prospective CovILD study (n = 420 longitudinal observations from n = 140 COVID-19 survivors), data on lung function testing (LFT), chest CT including severity scoring by a human radiologist and density measurement by artificial intelligence, demography, and persistent symptoms were collected. This information was used to develop models of numeric readouts and abnormalities of LFT with four machine learning algorithms (Random Forest, gradient boosted machines, neural network, and support vector machines). Results: Reduced DLCO (diffusion capacity for carbon monoxide <80% of reference) was found in 94 (22%) observations. Those observations were modeled with a cross-validated accuracy of 82-85%, AUC of 0.87-0.9, and Cohen's κ of 0.45-0.5. No reliable models could be established for FEV1 or FVC. For DLCO as a continuous variable, three machine learning algorithms yielded meaningful models with cross-validated mean absolute errors of 11.6-12.5% and R2 of 0.26-0.34. CT-derived features such as opacity, high opacity, and CT severity score were among the most influential predictors of DLCO impairment. Conclusions: Multi-parameter machine learning trained with demographic, clinical, and artificial intelligence chest CT data reliably and reproducibly predicts LFT deficits and outperforms single markers of lung pathology and human radiologist's assessment. It may improve diagnostic and foster personalized treatment.
DiagnosticsBiochemistry, Genetics and Molecular Biology-Clinical Biochemistry
CiteScore
4.70
自引率
8.30%
发文量
2699
审稿时长
19.64 days
期刊介绍:
Diagnostics (ISSN 2075-4418) is an international scholarly open access journal on medical diagnostics. It publishes original research articles, reviews, communications and short notes on the research and development of medical diagnostics. There is no restriction on the length of the papers. Our aim is to encourage scientists to publish their experimental and theoretical research in as much detail as possible. Full experimental and/or methodological details must be provided for research articles.