{"title":"基于机器学习的原发性川崎病合并冠状动脉瘤预测模型的开发和验证。","authors":"Zixia Song, Hongjun Ming, Bin Liu, Dong Liu","doi":"10.21037/tp-24-359","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Kawasaki disease (KD) can lead to coronary artery aneurysms (CAA) in approximately 1 in 5 untreated children despite intravenous immunoglobulin (IVIG) therapy in the acute phase. The aim of this study is to develop and validate an explainable machine learning (ML)-based prediction model for CAA in KD.</p><p><strong>Methods: </strong>This study retrospectively analyzed the clinical data of children diagnosed with primary KD at Nanchong Central Hospital, Sichuan Province between 2015 and 2023. Six models, including support vector machine (SVM), K-nearest neighbors (KNN), least absolute shrinkage and selection operator (Lasso), extreme gradient boosting (XGBoost), random forest (RF), and multilayer perceptron (MLP), based on ML algorithms were developed. The model with optimal performance was validated and the explainable SHapley Additive exPlanations (SHAP) analysis was used.</p><p><strong>Results: </strong>A total of 327 children diagnosed with KD were included in the training set and validation set. Receiver operator characteristic curve analysis showed that XGBoost based model exhibited an optimal performance among the six models. Moreover, for a given CAA positive sample, the sum of the SHAP values of all variables of XGBoost represented the individual deviation from the mean predicted from the entire dataset.</p><p><strong>Conclusions: </strong>The XGBoost algorithm-based explainable model might be used to predict the occurrence of CAA in children with KD.</p>","PeriodicalId":23294,"journal":{"name":"Translational pediatrics","volume":"14 2","pages":"208-221"},"PeriodicalIF":1.5000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11921264/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development and validation of an explainable machine learning-based prediction model for primary Kawasaki disease complicated with coronary artery aneurysms.\",\"authors\":\"Zixia Song, Hongjun Ming, Bin Liu, Dong Liu\",\"doi\":\"10.21037/tp-24-359\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Kawasaki disease (KD) can lead to coronary artery aneurysms (CAA) in approximately 1 in 5 untreated children despite intravenous immunoglobulin (IVIG) therapy in the acute phase. The aim of this study is to develop and validate an explainable machine learning (ML)-based prediction model for CAA in KD.</p><p><strong>Methods: </strong>This study retrospectively analyzed the clinical data of children diagnosed with primary KD at Nanchong Central Hospital, Sichuan Province between 2015 and 2023. Six models, including support vector machine (SVM), K-nearest neighbors (KNN), least absolute shrinkage and selection operator (Lasso), extreme gradient boosting (XGBoost), random forest (RF), and multilayer perceptron (MLP), based on ML algorithms were developed. The model with optimal performance was validated and the explainable SHapley Additive exPlanations (SHAP) analysis was used.</p><p><strong>Results: </strong>A total of 327 children diagnosed with KD were included in the training set and validation set. Receiver operator characteristic curve analysis showed that XGBoost based model exhibited an optimal performance among the six models. Moreover, for a given CAA positive sample, the sum of the SHAP values of all variables of XGBoost represented the individual deviation from the mean predicted from the entire dataset.</p><p><strong>Conclusions: </strong>The XGBoost algorithm-based explainable model might be used to predict the occurrence of CAA in children with KD.</p>\",\"PeriodicalId\":23294,\"journal\":{\"name\":\"Translational pediatrics\",\"volume\":\"14 2\",\"pages\":\"208-221\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11921264/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Translational pediatrics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/tp-24-359\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/25 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"PEDIATRICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational pediatrics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tp-24-359","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/25 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"PEDIATRICS","Score":null,"Total":0}
Development and validation of an explainable machine learning-based prediction model for primary Kawasaki disease complicated with coronary artery aneurysms.
Background: Kawasaki disease (KD) can lead to coronary artery aneurysms (CAA) in approximately 1 in 5 untreated children despite intravenous immunoglobulin (IVIG) therapy in the acute phase. The aim of this study is to develop and validate an explainable machine learning (ML)-based prediction model for CAA in KD.
Methods: This study retrospectively analyzed the clinical data of children diagnosed with primary KD at Nanchong Central Hospital, Sichuan Province between 2015 and 2023. Six models, including support vector machine (SVM), K-nearest neighbors (KNN), least absolute shrinkage and selection operator (Lasso), extreme gradient boosting (XGBoost), random forest (RF), and multilayer perceptron (MLP), based on ML algorithms were developed. The model with optimal performance was validated and the explainable SHapley Additive exPlanations (SHAP) analysis was used.
Results: A total of 327 children diagnosed with KD were included in the training set and validation set. Receiver operator characteristic curve analysis showed that XGBoost based model exhibited an optimal performance among the six models. Moreover, for a given CAA positive sample, the sum of the SHAP values of all variables of XGBoost represented the individual deviation from the mean predicted from the entire dataset.
Conclusions: The XGBoost algorithm-based explainable model might be used to predict the occurrence of CAA in children with KD.