Identification of Idiopathic Pulmonary Fibrosis and Prediction of Disease Severity via Machine Learning Analysis of Comprehensive Metabolic Panel and Complete Blood Count Data.
Alex N Mueller, Hunter A Miller, Matthew J Taylor, Sally A Suliman, Hermann B Frieboes
{"title":"Identification of Idiopathic Pulmonary Fibrosis and Prediction of Disease Severity via Machine Learning Analysis of Comprehensive Metabolic Panel and Complete Blood Count Data.","authors":"Alex N Mueller, Hunter A Miller, Matthew J Taylor, Sally A Suliman, Hermann B Frieboes","doi":"10.1007/s00408-024-00673-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Diagnosis of idiopathic pulmonary fibrosis (IPF) typically relies on high-resolution computed tomography imaging (HRCT) or histopathology, while monitoring disease severity is done via frequent pulmonary function testing (PFT). More reliable and convenient methods of diagnosing fibrotic interstitial lung disease (ILD) type and monitoring severity would allow for early identification and enhance current therapeutic interventions. This study tested the hypothesis that a machine learning (ML) ensemble analysis of comprehensive metabolic panel (CMP) and complete blood count (CBC) data can accurately distinguish IPF from connective tissue disease ILD (CTD-ILD) and predict disease severity as seen with PFT.</p><p><strong>Methods: </strong>Outpatient data with diagnosis of IPF or CTD-ILD (n = 103 visits by 53 patients) were analyzed via ML methodology to evaluate (1) IPF vs CTD-ILD diagnosis; (2) %predicted Diffusing Capacity of Lung for Carbon Monoxide (DLCO) moderate or mild vs severe; (3) %predicted Forced Vital Capacity (FVC) moderate or mild vs severe; and (4) %predicted FVC mild vs moderate or severe.</p><p><strong>Results: </strong>ML methodology identified IPF from CTD-ILD with AUC<sub>TEST</sub> = 0.893, while PFT was classified as DLCO moderate or mild vs severe with AUC<sub>TEST</sub> = 0.749, FVC moderate or mild vs severe with AUC<sub>TEST</sub> = 0.741, and FVC mild vs moderate or severe with AUC<sub>TEST</sub> = 0.739. Key features included albumin, alanine transaminase, %lymphocytes, hemoglobin, %eosinophils, white blood cell count, %monocytes, and %neutrophils.</p><p><strong>Conclusion: </strong>Analysis of CMP and CBC data via proposed ML methodology offers the potential to distinguish IPF from CTD-ILD and predict severity on associated PFT with accuracy that meets or exceeds current clinical practice.</p>","PeriodicalId":18163,"journal":{"name":"Lung","volume":" ","pages":"139-150"},"PeriodicalIF":4.6000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lung","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00408-024-00673-7","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/20 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Diagnosis of idiopathic pulmonary fibrosis (IPF) typically relies on high-resolution computed tomography imaging (HRCT) or histopathology, while monitoring disease severity is done via frequent pulmonary function testing (PFT). More reliable and convenient methods of diagnosing fibrotic interstitial lung disease (ILD) type and monitoring severity would allow for early identification and enhance current therapeutic interventions. This study tested the hypothesis that a machine learning (ML) ensemble analysis of comprehensive metabolic panel (CMP) and complete blood count (CBC) data can accurately distinguish IPF from connective tissue disease ILD (CTD-ILD) and predict disease severity as seen with PFT.
Methods: Outpatient data with diagnosis of IPF or CTD-ILD (n = 103 visits by 53 patients) were analyzed via ML methodology to evaluate (1) IPF vs CTD-ILD diagnosis; (2) %predicted Diffusing Capacity of Lung for Carbon Monoxide (DLCO) moderate or mild vs severe; (3) %predicted Forced Vital Capacity (FVC) moderate or mild vs severe; and (4) %predicted FVC mild vs moderate or severe.
Results: ML methodology identified IPF from CTD-ILD with AUCTEST = 0.893, while PFT was classified as DLCO moderate or mild vs severe with AUCTEST = 0.749, FVC moderate or mild vs severe with AUCTEST = 0.741, and FVC mild vs moderate or severe with AUCTEST = 0.739. Key features included albumin, alanine transaminase, %lymphocytes, hemoglobin, %eosinophils, white blood cell count, %monocytes, and %neutrophils.
Conclusion: Analysis of CMP and CBC data via proposed ML methodology offers the potential to distinguish IPF from CTD-ILD and predict severity on associated PFT with accuracy that meets or exceeds current clinical practice.
期刊介绍:
Lung publishes original articles, reviews and editorials on all aspects of the healthy and diseased lungs, of the airways, and of breathing. Epidemiological, clinical, pathophysiological, biochemical, and pharmacological studies fall within the scope of the journal. Case reports, short communications and technical notes can be accepted if they are of particular interest.