Daniel R Balcarcel, Mark V Mai, Sanjiv D Mehta, Kathleen Chiotos, L Nelson Sanchez-Pinto, Blanca E Himes, Nadir Yehya
{"title":"Development and Validation of an Electronic Health Record-Based, Pediatric Acute Respiratory Distress Syndrome Subphenotype Classifier Model.","authors":"Daniel R Balcarcel, Mark V Mai, Sanjiv D Mehta, Kathleen Chiotos, L Nelson Sanchez-Pinto, Blanca E Himes, Nadir Yehya","doi":"10.1097/PCC.0000000000003709","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To determine if hyperinflammatory and hypoinflammatory pediatric acute respiratory distress syndrome (PARDS) subphenotypes defined using serum biomarkers can be determined solely from electronic health record (EHR) data using machine learning.</p><p><strong>Design: </strong>Retrospective, exploratory analysis using data from 2014 to 2022.</p><p><strong>Setting: </strong>Single-center quaternary care PICU.</p><p><strong>Patients: </strong>Two temporally distinct cohorts of PARDS patients, 2014-2019 and 2019-2022.</p><p><strong>Interventions: </strong>None.</p><p><strong>Measurements and main results: </strong>Patients in the derivation cohort (n = 333) were assigned to hyperinflammatory or hypoinflammatory subphenotypes using biomarkers and latent class analysis. A machine learning model was trained on 165 EHR-derived variables to identify subphenotypes. The most important variables were selected for inclusion in a parsimonious model. The model was validated in a separate cohort (n = 114). The EHR-based classifier achieved an area under the receiver operating characteristic curve (AUC) of 0.93 (95% CI, 0.87-0.98), with a sensitivity of 88% and specificity of 83% for determining hyperinflammatory PARDS. The parsimonious model, using only five laboratory values, achieved an AUC of 0.92 (95% CI, 0.86-0.98) with a sensitivity of 76% and specificity of 87% in the validation cohort.</p><p><strong>Conclusions: </strong>This proof-of-concept study demonstrates that biomarker-based PARDS subphenotypes can be identified using EHR data at 24 hours of PARDS diagnosis. Further validation in larger, multicenter cohorts is needed to confirm the clinical utility of this approach.</p>","PeriodicalId":19760,"journal":{"name":"Pediatric Critical Care Medicine","volume":" ","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pediatric Critical Care Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/PCC.0000000000003709","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CRITICAL CARE MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: To determine if hyperinflammatory and hypoinflammatory pediatric acute respiratory distress syndrome (PARDS) subphenotypes defined using serum biomarkers can be determined solely from electronic health record (EHR) data using machine learning.
Design: Retrospective, exploratory analysis using data from 2014 to 2022.
Setting: Single-center quaternary care PICU.
Patients: Two temporally distinct cohorts of PARDS patients, 2014-2019 and 2019-2022.
Interventions: None.
Measurements and main results: Patients in the derivation cohort (n = 333) were assigned to hyperinflammatory or hypoinflammatory subphenotypes using biomarkers and latent class analysis. A machine learning model was trained on 165 EHR-derived variables to identify subphenotypes. The most important variables were selected for inclusion in a parsimonious model. The model was validated in a separate cohort (n = 114). The EHR-based classifier achieved an area under the receiver operating characteristic curve (AUC) of 0.93 (95% CI, 0.87-0.98), with a sensitivity of 88% and specificity of 83% for determining hyperinflammatory PARDS. The parsimonious model, using only five laboratory values, achieved an AUC of 0.92 (95% CI, 0.86-0.98) with a sensitivity of 76% and specificity of 87% in the validation cohort.
Conclusions: This proof-of-concept study demonstrates that biomarker-based PARDS subphenotypes can be identified using EHR data at 24 hours of PARDS diagnosis. Further validation in larger, multicenter cohorts is needed to confirm the clinical utility of this approach.
期刊介绍:
Pediatric Critical Care Medicine is written for the entire critical care team: pediatricians, neonatologists, respiratory therapists, nurses, and others who deal with pediatric patients who are critically ill or injured. International in scope, with editorial board members and contributors from around the world, the Journal includes a full range of scientific content, including clinical articles, scientific investigations, solicited reviews, and abstracts from pediatric critical care meetings. Additionally, the Journal includes abstracts of selected articles published in Chinese, French, Italian, Japanese, Portuguese, and Spanish translations - making news of advances in the field available to pediatric and neonatal intensive care practitioners worldwide.