{"title":"Development and validation of a nomogram-based prognostic model to predict coronary artery lesions in Kawasaki disease from 6847 children in China","authors":"Changjian Li , Huayong Zhang , Wei Yin , Yong Zhang","doi":"10.1016/j.cmpb.2025.108588","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Objective</h3><div>Predicting potential risk factors for the occurrence of coronary artery lesions (CAL) in children with Kawasaki disease (KD) is critical for subsequent treatment. The aim of our study was to establish and validate a nomograph-based model for identifying children with KD at risk for CAL.</div></div><div><h3>Methods</h3><div>Hospitalized children with KD attending Wuhan Children's Hospital from Jan 2011 to Dec 2023 were included in the study and were grouped into a training set (4793 cases) and a validation set (2054 cases) using a simple random sampling method in a 7:3 ratio. The analysis was performed using RStudio software, which first used LASSO regression analysis to screen for the best predictors, and then analyzed the screened predictors using logistic regression analysis to derive independent predictors and construct a nomogram model to predict CAL risk. The receiver operating characteristic (ROC) and calibration curves were employed to evaluate the discrimination and calibration of the model. Finally, decision curve analysis (DCA) was utilized to validate the clinical applicability of the models assessed in the data.</div></div><div><h3>Results</h3><div>Of the 6847 eligible children with KD included, 845 (12 %) were ultimately diagnosed with CAL, of whom 619 were boys (73 %) with a median age of 1.81 (0.74, 3.51) years. Six significant independent predictors were identified, including sex, intravenous immunoglobulin nonresponse, peripheral blood hemoglobin, platelet distribution width, platelet count, and serum albumin. Our model has acceptable discriminative power, with areas under the curve at 0.671 and 0.703 in the training and validation sets, respectively. DCA analysis showed that the prediction model had great clinical utility when the threshold probability interval was between 0.1 and 0.5.</div></div><div><h3>Conclusions</h3><div>We constructed and internally validated a nomograph-based predictive model based on six variables consisting of sex, intravenous immunoglobulin nonresponse, peripheral blood hemoglobin, platelet distribution width, platelet count, and serum albumin, which may be useful for earlier identification of children with KD who may have CAL.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"260 ","pages":"Article 108588"},"PeriodicalIF":4.9000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725000057","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Background and Objective
Predicting potential risk factors for the occurrence of coronary artery lesions (CAL) in children with Kawasaki disease (KD) is critical for subsequent treatment. The aim of our study was to establish and validate a nomograph-based model for identifying children with KD at risk for CAL.
Methods
Hospitalized children with KD attending Wuhan Children's Hospital from Jan 2011 to Dec 2023 were included in the study and were grouped into a training set (4793 cases) and a validation set (2054 cases) using a simple random sampling method in a 7:3 ratio. The analysis was performed using RStudio software, which first used LASSO regression analysis to screen for the best predictors, and then analyzed the screened predictors using logistic regression analysis to derive independent predictors and construct a nomogram model to predict CAL risk. The receiver operating characteristic (ROC) and calibration curves were employed to evaluate the discrimination and calibration of the model. Finally, decision curve analysis (DCA) was utilized to validate the clinical applicability of the models assessed in the data.
Results
Of the 6847 eligible children with KD included, 845 (12 %) were ultimately diagnosed with CAL, of whom 619 were boys (73 %) with a median age of 1.81 (0.74, 3.51) years. Six significant independent predictors were identified, including sex, intravenous immunoglobulin nonresponse, peripheral blood hemoglobin, platelet distribution width, platelet count, and serum albumin. Our model has acceptable discriminative power, with areas under the curve at 0.671 and 0.703 in the training and validation sets, respectively. DCA analysis showed that the prediction model had great clinical utility when the threshold probability interval was between 0.1 and 0.5.
Conclusions
We constructed and internally validated a nomograph-based predictive model based on six variables consisting of sex, intravenous immunoglobulin nonresponse, peripheral blood hemoglobin, platelet distribution width, platelet count, and serum albumin, which may be useful for earlier identification of children with KD who may have CAL.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.