基于超声的机器学习和SHapley加性解释评估胆囊癌风险的方法:一项双中心验证研究。

IF 2.4 4区 医学 Q2 ACOUSTICS
Binqiong Chen, Huohu Zhong, Jiaojiao Lin, Guorong Lyu, Shanshan Su
{"title":"基于超声的机器学习和SHapley加性解释评估胆囊癌风险的方法:一项双中心验证研究。","authors":"Binqiong Chen, Huohu Zhong, Jiaojiao Lin, Guorong Lyu, Shanshan Su","doi":"10.1002/jum.70027","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>This study aims to construct and evaluate 8 machine learning models by integrating ultrasound imaging features, clinical characteristics, and serological features to assess the risk of gallbladder cancer (GBC) occurrence in patients.</p><p><strong>Methods: </strong>A retrospective analysis was conducted on ultrasound and clinical data of 300 suspected GBC patients who visited the Second Affiliated Hospital of Fujian Medical University from January 2020 to January 2024 and 69 patients who visited the Zhongshan Hospital Affiliated to Xiamen University from January 2024 to January 2025. Key relevant features were selected using Least Absolute Shrinkage and Selection Operator (LASSO) regression. Predictive models were constructed using XGBoost, logistic regression, support vector machine, k-nearest neighbors, random forest, decision tree, naive Bayes, and neural network, with the SHapley Additive exPlanations (SHAP) method employed to explain model interpretability.</p><p><strong>Results: </strong>The LASSO regression demonstrated that gender, age, alkaline phosphatase (ALP), clarity of interface with liver, stratification of the gallbladder wall, intracapsular anechoic lesions, and intracapsular punctiform strong lesions were key features for GBC. The XGBoost model demonstrated an area under receiver operating characteristic curve (AUC) of 0.934, 0.916, and 0.813 in the training, validating, and test sets. SHAP analysis revealed the importance ranking of factors as clarity of interface with liver, stratification of the gallbladder wall, intracapsular anechoic lesions, and intracapsular punctiform strong lesions, ALP, gender, and age. Personalized prediction explanations through SHAP values demonstrated the contribution of each feature to the final prediction, enhancing result interpretability. Furthermore, decision plots were generated to display the influence trajectory of each feature on model predictions, aiding in analyzing which features had the greatest impact on these mispredictions; thereby facilitating further model optimization or feature adjustment.</p><p><strong>Conclusion: </strong>This study proposed a GBC ML model based on ultrasound, clinical, and serological characteristics, indicating the superior performance of the XGBoost model and enhancing the interpretability of the model through the SHAP method.</p>","PeriodicalId":17563,"journal":{"name":"Journal of Ultrasound in Medicine","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Ultrasound-Based Machine Learning and SHapley Additive exPlanations Method Evaluating Risk of Gallbladder Cancer: A Bicentric and Validation Study.\",\"authors\":\"Binqiong Chen, Huohu Zhong, Jiaojiao Lin, Guorong Lyu, Shanshan Su\",\"doi\":\"10.1002/jum.70027\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>This study aims to construct and evaluate 8 machine learning models by integrating ultrasound imaging features, clinical characteristics, and serological features to assess the risk of gallbladder cancer (GBC) occurrence in patients.</p><p><strong>Methods: </strong>A retrospective analysis was conducted on ultrasound and clinical data of 300 suspected GBC patients who visited the Second Affiliated Hospital of Fujian Medical University from January 2020 to January 2024 and 69 patients who visited the Zhongshan Hospital Affiliated to Xiamen University from January 2024 to January 2025. Key relevant features were selected using Least Absolute Shrinkage and Selection Operator (LASSO) regression. Predictive models were constructed using XGBoost, logistic regression, support vector machine, k-nearest neighbors, random forest, decision tree, naive Bayes, and neural network, with the SHapley Additive exPlanations (SHAP) method employed to explain model interpretability.</p><p><strong>Results: </strong>The LASSO regression demonstrated that gender, age, alkaline phosphatase (ALP), clarity of interface with liver, stratification of the gallbladder wall, intracapsular anechoic lesions, and intracapsular punctiform strong lesions were key features for GBC. The XGBoost model demonstrated an area under receiver operating characteristic curve (AUC) of 0.934, 0.916, and 0.813 in the training, validating, and test sets. SHAP analysis revealed the importance ranking of factors as clarity of interface with liver, stratification of the gallbladder wall, intracapsular anechoic lesions, and intracapsular punctiform strong lesions, ALP, gender, and age. Personalized prediction explanations through SHAP values demonstrated the contribution of each feature to the final prediction, enhancing result interpretability. Furthermore, decision plots were generated to display the influence trajectory of each feature on model predictions, aiding in analyzing which features had the greatest impact on these mispredictions; thereby facilitating further model optimization or feature adjustment.</p><p><strong>Conclusion: </strong>This study proposed a GBC ML model based on ultrasound, clinical, and serological characteristics, indicating the superior performance of the XGBoost model and enhancing the interpretability of the model through the SHAP method.</p>\",\"PeriodicalId\":17563,\"journal\":{\"name\":\"Journal of Ultrasound in Medicine\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2025-08-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Ultrasound in Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/jum.70027\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Ultrasound in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/jum.70027","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

摘要

目的:本研究旨在综合超声影像特征、临床特征、血清学特征,构建并评价8种机器学习模型,评估胆囊癌(GBC)患者发生风险。方法:回顾性分析2020年1月至2024年1月在福建医科大学第二附属医院就诊的300例疑似GBC患者和2024年1月至2025年1月在厦门大学附属中山医院就诊的69例患者的超声及临床资料。使用最小绝对收缩和选择算子(LASSO)回归选择关键相关特征。利用XGBoost、逻辑回归、支持向量机、k近邻、随机森林、决策树、朴素贝叶斯和神经网络构建预测模型,并采用SHapley加性解释(SHAP)方法解释模型的可解释性。结果:LASSO回归显示,性别、年龄、碱性磷酸酶(ALP)、与肝脏交界面清晰度、胆囊壁分层、囊内无回声病变、囊内点状强病变是GBC的主要特征。在训练集、验证集和测试集上,XGBoost模型的受试者工作特征曲线下面积(AUC)分别为0.934、0.916和0.813。SHAP分析显示,影响因素的重要程度依次为肝界面清晰度、胆囊壁分层、囊内无回声病变、囊内点状强病变、ALP、性别、年龄。通过SHAP值进行个性化预测解释,展示了每个特征对最终预测的贡献,增强了结果的可解释性。此外,生成决策图来显示每个特征对模型预测的影响轨迹,有助于分析哪些特征对这些错误预测的影响最大;从而便于进一步的模型优化或特征调整。结论:本研究提出了基于超声、临床、血清学特征的GBC ML模型,表明XGBoost模型具有优越的性能,并通过SHAP方法增强了模型的可解释性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Ultrasound-Based Machine Learning and SHapley Additive exPlanations Method Evaluating Risk of Gallbladder Cancer: A Bicentric and Validation Study.

Objectives: This study aims to construct and evaluate 8 machine learning models by integrating ultrasound imaging features, clinical characteristics, and serological features to assess the risk of gallbladder cancer (GBC) occurrence in patients.

Methods: A retrospective analysis was conducted on ultrasound and clinical data of 300 suspected GBC patients who visited the Second Affiliated Hospital of Fujian Medical University from January 2020 to January 2024 and 69 patients who visited the Zhongshan Hospital Affiliated to Xiamen University from January 2024 to January 2025. Key relevant features were selected using Least Absolute Shrinkage and Selection Operator (LASSO) regression. Predictive models were constructed using XGBoost, logistic regression, support vector machine, k-nearest neighbors, random forest, decision tree, naive Bayes, and neural network, with the SHapley Additive exPlanations (SHAP) method employed to explain model interpretability.

Results: The LASSO regression demonstrated that gender, age, alkaline phosphatase (ALP), clarity of interface with liver, stratification of the gallbladder wall, intracapsular anechoic lesions, and intracapsular punctiform strong lesions were key features for GBC. The XGBoost model demonstrated an area under receiver operating characteristic curve (AUC) of 0.934, 0.916, and 0.813 in the training, validating, and test sets. SHAP analysis revealed the importance ranking of factors as clarity of interface with liver, stratification of the gallbladder wall, intracapsular anechoic lesions, and intracapsular punctiform strong lesions, ALP, gender, and age. Personalized prediction explanations through SHAP values demonstrated the contribution of each feature to the final prediction, enhancing result interpretability. Furthermore, decision plots were generated to display the influence trajectory of each feature on model predictions, aiding in analyzing which features had the greatest impact on these mispredictions; thereby facilitating further model optimization or feature adjustment.

Conclusion: This study proposed a GBC ML model based on ultrasound, clinical, and serological characteristics, indicating the superior performance of the XGBoost model and enhancing the interpretability of the model through the SHAP method.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.10
自引率
4.30%
发文量
205
审稿时长
1.5 months
期刊介绍: The Journal of Ultrasound in Medicine (JUM) is dedicated to the rapid, accurate publication of original articles dealing with all aspects of medical ultrasound, particularly its direct application to patient care but also relevant basic science, advances in instrumentation, and biological effects. The journal is an official publication of the American Institute of Ultrasound in Medicine and publishes articles in a variety of categories, including Original Research papers, Review Articles, Pictorial Essays, Technical Innovations, Case Series, Letters to the Editor, and more, from an international bevy of countries in a continual effort to showcase and promote advances in the ultrasound community. Represented through these efforts are a wide variety of disciplines of ultrasound, including, but not limited to: -Basic Science- Breast Ultrasound- Contrast-Enhanced Ultrasound- Dermatology- Echocardiography- Elastography- Emergency Medicine- Fetal Echocardiography- Gastrointestinal Ultrasound- General and Abdominal Ultrasound- Genitourinary Ultrasound- Gynecologic Ultrasound- Head and Neck Ultrasound- High Frequency Clinical and Preclinical Imaging- Interventional-Intraoperative Ultrasound- Musculoskeletal Ultrasound- Neurosonology- Obstetric Ultrasound- Ophthalmologic Ultrasound- Pediatric Ultrasound- Point-of-Care Ultrasound- Public Policy- Superficial Structures- Therapeutic Ultrasound- Ultrasound Education- Ultrasound in Global Health- Urologic Ultrasound- Vascular Ultrasound
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信