A prediction model for genetic cholestatic disease in infancy using the machine learning approach.

IF 2.6 3区 医学 Q3 GASTROENTEROLOGY & HEPATOLOGY
Chi-San Tai, Sung-Chu Ko, Chien-Chang Lee, Hui-Ru Yang, Chia-Ray Lin, Byung-Ho Choe, Suporn Treepongkaruna, Voranush Chongsrisawat, Chau-Chung Wu, Huey-Ling Chen
{"title":"A prediction model for genetic cholestatic disease in infancy using the machine learning approach.","authors":"Chi-San Tai, Sung-Chu Ko, Chien-Chang Lee, Hui-Ru Yang, Chia-Ray Lin, Byung-Ho Choe, Suporn Treepongkaruna, Voranush Chongsrisawat, Chau-Chung Wu, Huey-Ling Chen","doi":"10.1002/jpn3.70166","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Cholestasis in infancy poses a complex clinical conundrum for pediatric hepatologists, warranting timely diagnosis, especially for genetic diseases. This study aims to create machine learning (ML)-based prediction models, referred to as Jaundice Diagnosis Easy for Baby (JADE-B), to identify the subjects prone to genetic causes of cholestasis.</p><p><strong>Methods: </strong>We retrieved patient data from the Integrated Medical Database at a university-affiliated tertiary medical center from 2006 to 2018. Patients with cholestatic disease were identified using liver-disease-specific International Classification of Diseases codes. A total of 47 clinical and laboratory parameters were used for ML for predicting a positive genetic disease, defined by a disease-specific genetic diagnosis matched with phenotype. Four distinct classifiers: Logistic regression, XGBoost (XGB), LightGBM (LGBM), and Random Forests were utilized to build the models.</p><p><strong>Results: </strong>From a patient pool of 1845, 1008 infants below 1 year of age diagnosed with cholestatic liver disease were included in the analysis. A comprehensive set of 47 pertinent clinical and laboratory features was incorporated for training the ML models. We built five sets of models (Model 1-5), yielding an area under the receiver operating characteristic curve of 0.869, 0.884, 0.855, 0.852, and 0.836, respectively. A JADE-B model was built using 20 simple and widely accessible clinical parameters at disease onset, up to 1 month, to predict patients with genetic disorders.</p><p><strong>Conclusions: </strong>The machine learning model prioritizes cholestatic infants for the allocation of genetic diagnostic tools and patient referrals, as well as optimizes the utilization of genetic diagnostic resources.</p>","PeriodicalId":16694,"journal":{"name":"Journal of Pediatric Gastroenterology and Nutrition","volume":" ","pages":"933-942"},"PeriodicalIF":2.6000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12484703/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Pediatric Gastroenterology and Nutrition","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/jpn3.70166","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/30 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives: Cholestasis in infancy poses a complex clinical conundrum for pediatric hepatologists, warranting timely diagnosis, especially for genetic diseases. This study aims to create machine learning (ML)-based prediction models, referred to as Jaundice Diagnosis Easy for Baby (JADE-B), to identify the subjects prone to genetic causes of cholestasis.

Methods: We retrieved patient data from the Integrated Medical Database at a university-affiliated tertiary medical center from 2006 to 2018. Patients with cholestatic disease were identified using liver-disease-specific International Classification of Diseases codes. A total of 47 clinical and laboratory parameters were used for ML for predicting a positive genetic disease, defined by a disease-specific genetic diagnosis matched with phenotype. Four distinct classifiers: Logistic regression, XGBoost (XGB), LightGBM (LGBM), and Random Forests were utilized to build the models.

Results: From a patient pool of 1845, 1008 infants below 1 year of age diagnosed with cholestatic liver disease were included in the analysis. A comprehensive set of 47 pertinent clinical and laboratory features was incorporated for training the ML models. We built five sets of models (Model 1-5), yielding an area under the receiver operating characteristic curve of 0.869, 0.884, 0.855, 0.852, and 0.836, respectively. A JADE-B model was built using 20 simple and widely accessible clinical parameters at disease onset, up to 1 month, to predict patients with genetic disorders.

Conclusions: The machine learning model prioritizes cholestatic infants for the allocation of genetic diagnostic tools and patient referrals, as well as optimizes the utilization of genetic diagnostic resources.

使用机器学习方法的婴儿遗传性胆汁淤积症预测模型。
目的:婴儿胆汁淤积症是儿科肝病学家面临的一个复杂的临床难题,需要及时诊断,尤其是遗传性疾病。本研究旨在创建基于机器学习(ML)的预测模型,称为“婴儿黄疸诊断简易(JADE-B)”,以确定易受遗传原因影响的胆汁淤积的受试者。方法:从某大学附属三级医疗中心的综合医学数据库中检索2006 - 2018年的患者数据。使用肝脏疾病特异性国际疾病分类代码确定胆汁淤积症患者。ML共有47个临床和实验室参数用于预测阳性遗传疾病,由疾病特异性遗传诊断与表型匹配定义。四种不同的分类器:Logistic回归,XGBoost (XGB), LightGBM (LGBM)和随机森林来构建模型。结果:从1845名患者池中,1008名1岁以下诊断为胆汁淤积性肝病的婴儿被纳入分析。一套完整的47个相关临床和实验室特征被纳入ML模型的训练。我们建立了5组模型(Model 1-5),分别得到接收者工作特征曲线下的面积为0.869、0.884、0.855、0.852和0.836。使用20个简单且可广泛获取的疾病发病后1个月的临床参数建立JADE-B模型,用于预测遗传疾病患者。结论:该机器学习模型为胆汁淤积症患儿优先分配遗传诊断工具和患者转诊,优化遗传诊断资源的利用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.30
自引率
13.80%
发文量
467
审稿时长
3-6 weeks
期刊介绍: ​The Journal of Pediatric Gastroenterology and Nutrition (JPGN) provides a forum for original papers and reviews dealing with pediatric gastroenterology and nutrition, including normal and abnormal functions of the alimentary tract and its associated organs, including the salivary glands, pancreas, gallbladder, and liver. Particular emphasis is on development and its relation to infant and childhood nutrition.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信