{"title":"Integrating Etiological Insights With Machine Learning for Precision Diagnosis of Obstructive Jaundice: Findings From a High-Volume Center.","authors":"Ningyuan Wen, Yaoqun Wang, Xianze Xiong, Jianrong Xu, Shaofeng Wang, Yuan Tian, Di Zeng, Xingyu Pu, Bei Li, Jiong Lu, Geng Liu, Nansheng Cheng","doi":"10.14309/ctg.0000000000000849","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Large-scale cohort studies exploring the etiology of obstructive jaundice (OJ) are scarce, with current serum-based diagnostic markers offering suboptimal performance. This study leverages the largest retrospective cohort of patients with OJ to date to investigate its disease spectrum and to develop a novel diagnostic system.</p><p><strong>Methods: </strong>This study involves 2 retrospective observational cohorts. The biliary surgery cohort (BS cohort, n = 349) served for initial data exploration and external validation of machine learning (ML) models. The large general cohort (LG cohort, n = 5,726) enabled an in-depth analysis of etiologies and the determination of relevant diagnostic indicators, in addition to supporting ML model development. Interpretable ML techniques were used to derive insights from the models.</p><p><strong>Results: </strong>The LG cohort highlighted a diverse disease spectrum of OJ, including cholangiocarcinoma (10.39% distal, 10.01% perihilar, and 5.59% intrahepatic), pancreatic adenocarcinoma (19.11%), and common bile duct stones (18.27%) as leading causes. Traditional serum markers such as carbohydrate antigen 19-9 and carcinoembryonic antigen lacked stand-alone diagnostic accuracy. Two ML-based models (collectively termed the ML of OJ based on common laboratory tests model) were developed: a classifier to differentiate benign from malignant causes (AUROC = 0.862) and a multiclass model to further stratify malignant and benign diseases (ACC = 0.777). Interpretable ML tools provided clarity on critical features, offering actionable insights and enhancing transparency in the decision-making process.</p><p><strong>Discussion: </strong>This study elucidates the etiological spectrum of OJ, meanwhile providing a practical and interpretable ML-based diagnostic tool. By leveraging large-scale clinical data, our model provides a rapid and reliable primary assessment for patients with OJ, enabling clinicians to identify potential etiologies and guide further diagnostic workup.</p>","PeriodicalId":10278,"journal":{"name":"Clinical and Translational Gastroenterology","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Translational Gastroenterology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.14309/ctg.0000000000000849","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Large-scale cohort studies exploring the etiology of obstructive jaundice (OJ) are scarce, with current serum-based diagnostic markers offering suboptimal performance. This study leverages the largest retrospective cohort of patients with OJ to date to investigate its disease spectrum and to develop a novel diagnostic system.
Methods: This study involves 2 retrospective observational cohorts. The biliary surgery cohort (BS cohort, n = 349) served for initial data exploration and external validation of machine learning (ML) models. The large general cohort (LG cohort, n = 5,726) enabled an in-depth analysis of etiologies and the determination of relevant diagnostic indicators, in addition to supporting ML model development. Interpretable ML techniques were used to derive insights from the models.
Results: The LG cohort highlighted a diverse disease spectrum of OJ, including cholangiocarcinoma (10.39% distal, 10.01% perihilar, and 5.59% intrahepatic), pancreatic adenocarcinoma (19.11%), and common bile duct stones (18.27%) as leading causes. Traditional serum markers such as carbohydrate antigen 19-9 and carcinoembryonic antigen lacked stand-alone diagnostic accuracy. Two ML-based models (collectively termed the ML of OJ based on common laboratory tests model) were developed: a classifier to differentiate benign from malignant causes (AUROC = 0.862) and a multiclass model to further stratify malignant and benign diseases (ACC = 0.777). Interpretable ML tools provided clarity on critical features, offering actionable insights and enhancing transparency in the decision-making process.
Discussion: This study elucidates the etiological spectrum of OJ, meanwhile providing a practical and interpretable ML-based diagnostic tool. By leveraging large-scale clinical data, our model provides a rapid and reliable primary assessment for patients with OJ, enabling clinicians to identify potential etiologies and guide further diagnostic workup.
期刊介绍:
Clinical and Translational Gastroenterology (CTG), published on behalf of the American College of Gastroenterology (ACG), is a peer-reviewed open access online journal dedicated to innovative clinical work in the field of gastroenterology and hepatology. CTG hopes to fulfill an unmet need for clinicians and scientists by welcoming novel cohort studies, early-phase clinical trials, qualitative and quantitative epidemiologic research, hypothesis-generating research, studies of novel mechanisms and methodologies including public health interventions, and integration of approaches across organs and disciplines. CTG also welcomes hypothesis-generating small studies, methods papers, and translational research with clear applications to human physiology or disease.
Colon and small bowel
Endoscopy and novel diagnostics
Esophagus
Functional GI disorders
Immunology of the GI tract
Microbiology of the GI tract
Inflammatory bowel disease
Pancreas and biliary tract
Liver
Pathology
Pediatrics
Preventative medicine
Nutrition/obesity
Stomach.