{"title":"Development and validation of an explainable model of brain injury in premature infants: A prospective cohort study.","authors":"Zhijie He, Ruiqi Zhang, Pengfei Qu, Yuxuan Meng, Jinrui Jia, Zhibo Wang, Peng Wang, Yu Ni, Li Shan, Mingzhi Liao, Yajun Li","doi":"10.1016/j.cmpb.2024.108559","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Preterm brain injury (PBI) is a prevalent complication in preterm infants, leading to the destruction of critical structural and functional brain connections and placing a significant burden on families. The timely detection of PBI is of paramount importance for the prevention and treatment of the condition. However, the absence of specific clinical manifestations in the early stages of PBI renders it susceptible to misdiagnosis and missed diagnoses. Moreover, once it occurs, there is no specific treatment available. The aim of this study was to develop and validate a machine learning (ML) based interpretable model for the early detection of PBI, as well as the assessment of patient-wide and individual risk factors for this disease.</p><p><strong>Methods: </strong>This study utilized a cohort of premature infants provided by Northwest Women's and Children's Hospital in China, comprising medical records of 650 premature infants, spanning from 2019 to 2021. PBI were identified based on cranial magnetic resonance imaging (MRI). Fourteen machine learning models were employed with stratified 10-fold cross-validation method used to evaluate model performance. The Shapley Additive Explanations (SHAP) method was applied for model interpretation. Feature selection methods were used to determine the final model which was validated on the independent test set. Subsequently, risk factors for the entire cohort and individual patients were assessed.</p><p><strong>Results: </strong>Among the fourteen machine learning models, the CatBoost model demonstrated the best discriminative ability. Following feature selection, the final model was constructed using seven features, designated as PBIPred (Preterm Brain Injury Predictor). PBIPred exhibited strong performance in both 10-fold cross-validation and independent test set (AUC = 0.8229) for accurately predicting PBI. The screening for risk factors in the cohort and individuals identified the following variables as positive risk factors for PBI: Mechanical ventilation (MV), Weight, Anemia of prematurity (AOP), Respiratory distress syndrome (RDS), Albumin (ALB), and White blood cell (WBC).</p><p><strong>Availability and implementation: </strong>The PBIPred webserver and PBIPred tool were developed for clinical diagnosis and large-scale local medical record data prediction. They can be accessed freely at http://pbipred.liaolab.net and https://github.com/chikit2077/PBIPred, respectively.</p>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"260 ","pages":"108559"},"PeriodicalIF":4.9000,"publicationDate":"2024-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.cmpb.2024.108559","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Preterm brain injury (PBI) is a prevalent complication in preterm infants, leading to the destruction of critical structural and functional brain connections and placing a significant burden on families. The timely detection of PBI is of paramount importance for the prevention and treatment of the condition. However, the absence of specific clinical manifestations in the early stages of PBI renders it susceptible to misdiagnosis and missed diagnoses. Moreover, once it occurs, there is no specific treatment available. The aim of this study was to develop and validate a machine learning (ML) based interpretable model for the early detection of PBI, as well as the assessment of patient-wide and individual risk factors for this disease.
Methods: This study utilized a cohort of premature infants provided by Northwest Women's and Children's Hospital in China, comprising medical records of 650 premature infants, spanning from 2019 to 2021. PBI were identified based on cranial magnetic resonance imaging (MRI). Fourteen machine learning models were employed with stratified 10-fold cross-validation method used to evaluate model performance. The Shapley Additive Explanations (SHAP) method was applied for model interpretation. Feature selection methods were used to determine the final model which was validated on the independent test set. Subsequently, risk factors for the entire cohort and individual patients were assessed.
Results: Among the fourteen machine learning models, the CatBoost model demonstrated the best discriminative ability. Following feature selection, the final model was constructed using seven features, designated as PBIPred (Preterm Brain Injury Predictor). PBIPred exhibited strong performance in both 10-fold cross-validation and independent test set (AUC = 0.8229) for accurately predicting PBI. The screening for risk factors in the cohort and individuals identified the following variables as positive risk factors for PBI: Mechanical ventilation (MV), Weight, Anemia of prematurity (AOP), Respiratory distress syndrome (RDS), Albumin (ALB), and White blood cell (WBC).
Availability and implementation: The PBIPred webserver and PBIPred tool were developed for clinical diagnosis and large-scale local medical record data prediction. They can be accessed freely at http://pbipred.liaolab.net and https://github.com/chikit2077/PBIPred, respectively.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.