开发一个可解释的机器学习模型来预测新生儿筛查中假阴性的柠檬酸缺乏症病例。

IF 3.5 2区医学 Q2 GENETICS & HEREDITY

Orphanet Journal of Rare Diseases Pub Date : 2025-10-08 DOI:10.1186/s13023-025-04045-z

Peiyao Wang, Haomin Li, Xinjie Yang, Lingwei Hu, Yuhe Chen, Ziyan Cen, Pingping Ge, Qimin He, Benqing Wu, Xinwen Huang

{"title":"开发一个可解释的机器学习模型来预测新生儿筛查中假阴性的柠檬酸缺乏症病例。","authors":"Peiyao Wang, Haomin Li, Xinjie Yang, Lingwei Hu, Yuhe Chen, Ziyan Cen, Pingping Ge, Qimin He, Benqing Wu, Xinwen Huang","doi":"10.1186/s13023-025-04045-z","DOIUrl":null,"url":null,"abstract":"Background: Neonatal Intrahepatic Cholestasis caused by Citrin Deficiency (NICCD) is an autosomal recessive disorder affecting the urea cycle and energy metabolism. Newborn screening (NBS) usually relies on elevated citrulline, but some patients have normal citrulline, resulting in false negatives and delayed diagnosis. This study develops an explainable machine learning (ML) model to predict false-negative NICCD cases during NBS.Methods: Data from 53 false-negative NICCD patients and 212 controls, collected retrospectively between 2011 and 2024, were analyzed. The dataset was split into a training set (70%) and a test set (30%). External validation involved 48 participants from distinct time periods. Key predictors were identified using variable importance in projection (VIP > 1) and Lasso regression. Six ML models were trained for evaluation: Logistic Regression, Random Forest, Light Gradient Boosting Machine, Extreme Gradient Boosting (XGBoost), K-Nearest Neighbor, and Support Vector Machines. Performance was evaluated using the area under the receiver operating characteristic curve (AUC) and F1 score. Shapley Additive exPlanations (SHAP) was applied to determine the importance of features and interpret the models.Results: Birth weight, citrulline, glycine, phenylalanine, ornithine, arginine, proline, succinylacetone, and C10:2 were selected as predictive features. Among the ML models, XGBoost demonstrated the most robust and consistent performance, achieving AUCs of 0.971(95%CI: 0.959-0.979), 0.968, and 0.977, and F1 scores of 0.786(95% CI: 0.744-0.820), 0.828, and 0.833 in the training, test, and external validation sets, respectively. SHAP analysis showed that the most important features are citrulline, glycine, phenylalanine, succinylacetone, birth weight, and ornithine. Feature pairs such as citrulline-phenylalanine, citrulline-glycine, succinylacetone-birth weight, and ornithine-glycine showed varying interactions. SHAP force plots, decision plots, and waterfall plots provided insightful patient-level interpretations. Finally, we built a network calculator for the prediction of false-negative NICCD cases ( https://myapp123.shinyapps.io/my_shiny_app/ ).Conclusion: An interpretable machine learning model utilizing metabolite and demographic data enhances the detection of false-negative NICCD cases, facilitates early identification and intervention, and ultimately improves the overall effectiveness of the newborn screening system.","PeriodicalId":19651,"journal":{"name":"Orphanet Journal of Rare Diseases","volume":"20 1","pages":"507"},"PeriodicalIF":3.5000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Developing an explainable machine learning model to predict false-negative citrin deficiency cases in newborn screening.\",\"authors\":\"Peiyao Wang, Haomin Li, Xinjie Yang, Lingwei Hu, Yuhe Chen, Ziyan Cen, Pingping Ge, Qimin He, Benqing Wu, Xinwen Huang\",\"doi\":\"10.1186/s13023-025-04045-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Neonatal Intrahepatic Cholestasis caused by Citrin Deficiency (NICCD) is an autosomal recessive disorder affecting the urea cycle and energy metabolism. Newborn screening (NBS) usually relies on elevated citrulline, but some patients have normal citrulline, resulting in false negatives and delayed diagnosis. This study develops an explainable machine learning (ML) model to predict false-negative NICCD cases during NBS.Methods: Data from 53 false-negative NICCD patients and 212 controls, collected retrospectively between 2011 and 2024, were analyzed. The dataset was split into a training set (70%) and a test set (30%). External validation involved 48 participants from distinct time periods. Key predictors were identified using variable importance in projection (VIP > 1) and Lasso regression. Six ML models were trained for evaluation: Logistic Regression, Random Forest, Light Gradient Boosting Machine, Extreme Gradient Boosting (XGBoost), K-Nearest Neighbor, and Support Vector Machines. Performance was evaluated using the area under the receiver operating characteristic curve (AUC) and F1 score. Shapley Additive exPlanations (SHAP) was applied to determine the importance of features and interpret the models.Results: Birth weight, citrulline, glycine, phenylalanine, ornithine, arginine, proline, succinylacetone, and C10:2 were selected as predictive features. Among the ML models, XGBoost demonstrated the most robust and consistent performance, achieving AUCs of 0.971(95%CI: 0.959-0.979), 0.968, and 0.977, and F1 scores of 0.786(95% CI: 0.744-0.820), 0.828, and 0.833 in the training, test, and external validation sets, respectively. SHAP analysis showed that the most important features are citrulline, glycine, phenylalanine, succinylacetone, birth weight, and ornithine. Feature pairs such as citrulline-phenylalanine, citrulline-glycine, succinylacetone-birth weight, and ornithine-glycine showed varying interactions. SHAP force plots, decision plots, and waterfall plots provided insightful patient-level interpretations. Finally, we built a network calculator for the prediction of false-negative NICCD cases ( https://myapp123.shinyapps.io/my_shiny_app/ ).Conclusion: An interpretable machine learning model utilizing metabolite and demographic data enhances the detection of false-negative NICCD cases, facilitates early identification and intervention, and ultimately improves the overall effectiveness of the newborn screening system.\",\"PeriodicalId\":19651,\"journal\":{\"name\":\"Orphanet Journal of Rare Diseases\",\"volume\":\"20 1\",\"pages\":\"507\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Orphanet Journal of Rare Diseases\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s13023-025-04045-z\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Orphanet Journal of Rare Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13023-025-04045-z","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}

引用次数: 0

摘要

背景：新生儿肝内胆汁淤积症（NICCD）是一种影响尿素循环和能量代谢的常染色体隐性遗传病。新生儿筛查（NBS）通常依赖于瓜氨酸升高，但一些患者瓜氨酸正常，导致假阴性和延误诊断。本研究开发了一个可解释的机器学习（ML）模型来预测NBS期间NICCD的假阴性病例。方法：回顾性分析2011年至2024年间收集的53例假阴性NICCD患者和212例对照组的数据。数据集分为训练集（70%）和测试集（30%）。外部验证涉及来自不同时期的48名参与者。使用变量重要性预测（VIP >1）和Lasso回归确定关键预测因子。6个ML模型被训练用于评估：逻辑回归、随机森林、光梯度增强机、极端梯度增强机（XGBoost）、k近邻和支持向量机。使用受者工作特征曲线下面积（AUC）和F1评分来评估性能。采用Shapley加性解释（SHAP）来确定特征的重要性并解释模型。结果：选择出生体重、瓜氨酸、甘氨酸、苯丙氨酸、鸟氨酸、精氨酸、脯氨酸、琥珀酰丙酮和C10:2作为预测特征。在ML模型中，XGBoost表现出最稳健和一致的性能，在训练集、测试集和外部验证集上的auc分别为0.971（95%CI: 0.959-0.979）、0.968和0.977，F1得分分别为0.786（95% CI: 0.744-0.820）、0.828和0.833。SHAP分析显示，最重要的特征是瓜氨酸、甘氨酸、苯丙氨酸、琥珀酰丙酮、出生体重和鸟氨酸。特征对如瓜氨酸-苯丙氨酸、瓜氨酸-甘氨酸、琥珀酰丙酮-出生体重和鸟氨酸-甘氨酸表现出不同的相互作用。SHAP力图、决策图和瀑布图提供了深刻的患者层面解释。最后，我们建立了一个预测NICCD假阴性病例的网络计算器（https://myapp123.shinyapps.io/my_shiny_app/）。结论：利用代谢物和人口统计学数据的可解释机器学习模型提高了NICCD假阴性病例的发现，促进了早期识别和干预，最终提高了新生儿筛查系统的整体有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Developing an explainable machine learning model to predict false-negative citrin deficiency cases in newborn screening.

Background: Neonatal Intrahepatic Cholestasis caused by Citrin Deficiency (NICCD) is an autosomal recessive disorder affecting the urea cycle and energy metabolism. Newborn screening (NBS) usually relies on elevated citrulline, but some patients have normal citrulline, resulting in false negatives and delayed diagnosis. This study develops an explainable machine learning (ML) model to predict false-negative NICCD cases during NBS.

Methods: Data from 53 false-negative NICCD patients and 212 controls, collected retrospectively between 2011 and 2024, were analyzed. The dataset was split into a training set (70%) and a test set (30%). External validation involved 48 participants from distinct time periods. Key predictors were identified using variable importance in projection (VIP > 1) and Lasso regression. Six ML models were trained for evaluation: Logistic Regression, Random Forest, Light Gradient Boosting Machine, Extreme Gradient Boosting (XGBoost), K-Nearest Neighbor, and Support Vector Machines. Performance was evaluated using the area under the receiver operating characteristic curve (AUC) and F1 score. Shapley Additive exPlanations (SHAP) was applied to determine the importance of features and interpret the models.

Results: Birth weight, citrulline, glycine, phenylalanine, ornithine, arginine, proline, succinylacetone, and C10:2 were selected as predictive features. Among the ML models, XGBoost demonstrated the most robust and consistent performance, achieving AUCs of 0.971(95%CI: 0.959-0.979), 0.968, and 0.977, and F1 scores of 0.786(95% CI: 0.744-0.820), 0.828, and 0.833 in the training, test, and external validation sets, respectively. SHAP analysis showed that the most important features are citrulline, glycine, phenylalanine, succinylacetone, birth weight, and ornithine. Feature pairs such as citrulline-phenylalanine, citrulline-glycine, succinylacetone-birth weight, and ornithine-glycine showed varying interactions. SHAP force plots, decision plots, and waterfall plots provided insightful patient-level interpretations. Finally, we built a network calculator for the prediction of false-negative NICCD cases ( https://myapp123.shinyapps.io/my_shiny_app/ ).

Conclusion: An interpretable machine learning model utilizing metabolite and demographic data enhances the detection of false-negative NICCD cases, facilitates early identification and intervention, and ultimately improves the overall effectiveness of the newborn screening system.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Orphanet Journal of Rare Diseases 医学-医学：研究与实验

CiteScore

6.30

自引率

8.10%

发文量

418

审稿时长

4-8 weeks

期刊介绍： Orphanet Journal of Rare Diseases is an open access, peer-reviewed journal that encompasses all aspects of rare diseases and orphan drugs. The journal publishes high-quality reviews on specific rare diseases. In addition, the journal may consider articles on clinical trial outcome reports, either positive or negative, and articles on public health issues in the field of rare diseases and orphan drugs. The journal does not accept case reports.