Development of a High-Performance Ultrasound Prediction Model for the Diagnosis of Endometrial Cancer: An Interpretable XGBoost Algorithm Utilizing SHAP Analysis.

IF 2.4 4区 医学 Q2 ACOUSTICS
Hongwei Lai, Qiumei Wu, Zongjie Weng, Guorong Lyu, Wenmin Yang, Fengying Ye
{"title":"Development of a High-Performance Ultrasound Prediction Model for the Diagnosis of Endometrial Cancer: An Interpretable XGBoost Algorithm Utilizing SHAP Analysis.","authors":"Hongwei Lai, Qiumei Wu, Zongjie Weng, Guorong Lyu, Wenmin Yang, Fengying Ye","doi":"10.1002/jum.70082","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>To develop and validate an ultrasonography-based machine learning (ML) model for predicting malignant endometrial and cavitary lesions.</p><p><strong>Methods: </strong>This retrospective study was conducted on patients with pathologically confirmed results following transvaginal or transrectal ultrasound from 2021 to 2023. Endometrial ultrasound features were characterized using the International Endometrial Tumor Analysis (IETA) terminology. The dataset was ranomly divided (7:3) into training and validation sets. LASSO (least absolute shrinkage and selection operator) regression was applied for feature selection, and an extreme gradient boosting (XGBoost) model was developed. Performance was assessed via receiver operating characteristic (ROC) analysis, calibration, decision curve analysis, sensitivity, specificity, and accuracy.</p><p><strong>Results: </strong>Among 1080 patients, 6 had a non-measurable endometrium. Of the remaining 1074 cases, 641 were premenopausal and 433 postmenopausal. Performance of the XGBoost model on the test set: The area under the curve (AUC) for the premenopausal group was 0.845 (0.781-0.909), with a relatively low sensitivity (0.588, 0.442-0.722) and a relatively high specificity (0.923, 0.863-0.959); the AUC for the postmenopausal group was 0.968 (0.944-0.992), with both sensitivity (0.895, 0.778-0.956) and specificity (0.931, 0.839-0.974) being relatively high. SHapley Additive exPlanations (SHAP) analysis identified key predictors: endometrial-myometrial junction, endometrial thickness, endometrial echogenicity, color Doppler flow score, and vascular pattern in premenopausal women; endometrial thickness, endometrial-myometrial junction, endometrial echogenicity, and color Doppler flow score in postmenopausal women.</p><p><strong>Conclusion: </strong>The XGBoost-based model exhibited excellent predictive performance, particularly in postmenopausal patients. SHAP analysis further enhances interpretability by identifying key ultrasonographic predictors of malignancy.</p>","PeriodicalId":17563,"journal":{"name":"Journal of Ultrasound in Medicine","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Ultrasound in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/jum.70082","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives: To develop and validate an ultrasonography-based machine learning (ML) model for predicting malignant endometrial and cavitary lesions.

Methods: This retrospective study was conducted on patients with pathologically confirmed results following transvaginal or transrectal ultrasound from 2021 to 2023. Endometrial ultrasound features were characterized using the International Endometrial Tumor Analysis (IETA) terminology. The dataset was ranomly divided (7:3) into training and validation sets. LASSO (least absolute shrinkage and selection operator) regression was applied for feature selection, and an extreme gradient boosting (XGBoost) model was developed. Performance was assessed via receiver operating characteristic (ROC) analysis, calibration, decision curve analysis, sensitivity, specificity, and accuracy.

Results: Among 1080 patients, 6 had a non-measurable endometrium. Of the remaining 1074 cases, 641 were premenopausal and 433 postmenopausal. Performance of the XGBoost model on the test set: The area under the curve (AUC) for the premenopausal group was 0.845 (0.781-0.909), with a relatively low sensitivity (0.588, 0.442-0.722) and a relatively high specificity (0.923, 0.863-0.959); the AUC for the postmenopausal group was 0.968 (0.944-0.992), with both sensitivity (0.895, 0.778-0.956) and specificity (0.931, 0.839-0.974) being relatively high. SHapley Additive exPlanations (SHAP) analysis identified key predictors: endometrial-myometrial junction, endometrial thickness, endometrial echogenicity, color Doppler flow score, and vascular pattern in premenopausal women; endometrial thickness, endometrial-myometrial junction, endometrial echogenicity, and color Doppler flow score in postmenopausal women.

Conclusion: The XGBoost-based model exhibited excellent predictive performance, particularly in postmenopausal patients. SHAP analysis further enhances interpretability by identifying key ultrasonographic predictors of malignancy.

子宫内膜癌诊断的高性能超声预测模型的开发:利用SHAP分析可解释的XGBoost算法。
目的:建立并验证基于超声的机器学习(ML)模型预测子宫内膜和子宫腔恶性病变。方法:对2021 ~ 2023年经阴道或经直肠超声病理证实的患者进行回顾性研究。使用国际子宫内膜肿瘤分析(IETA)术语对子宫内膜超声特征进行表征。数据集被随机分为训练集和验证集(7:3)。采用LASSO(最小绝对收缩和选择算子)回归进行特征选择,并建立了极端梯度增强(XGBoost)模型。通过受试者工作特征(ROC)分析、校准、决策曲线分析、敏感性、特异性和准确性来评估疗效。结果:1080例患者中,6例子宫内膜不可测。在剩余的1074例中,641例为绝经前,433例为绝经后。XGBoost模型在测试集上的表现:绝经前组曲线下面积(AUC)为0.845(0.781-0.909),敏感性较低(0.588,0.442-0.722),特异性较高(0.923,0.863-0.959);绝经后组AUC为0.968(0.944 ~ 0.992),敏感性(0.895,0.778 ~ 0.956)和特异性(0.931,0.839 ~ 0.974)均较高。SHapley加性解释(SHAP)分析确定了关键的预测因素:绝经前妇女子宫内膜-子宫肌交界处、子宫内膜厚度、子宫内膜回声性、彩色多普勒血流评分和血管模式;绝经后妇女子宫内膜厚度,子宫内膜-子宫内膜交界处,子宫内膜回声性和彩色多普勒血流评分。结论:基于xgboost的模型具有出色的预测性能,特别是对绝经后患者。SHAP分析通过识别关键的超声预测因素进一步提高了恶性肿瘤的可解释性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.10
自引率
4.30%
发文量
205
审稿时长
1.5 months
期刊介绍: The Journal of Ultrasound in Medicine (JUM) is dedicated to the rapid, accurate publication of original articles dealing with all aspects of medical ultrasound, particularly its direct application to patient care but also relevant basic science, advances in instrumentation, and biological effects. The journal is an official publication of the American Institute of Ultrasound in Medicine and publishes articles in a variety of categories, including Original Research papers, Review Articles, Pictorial Essays, Technical Innovations, Case Series, Letters to the Editor, and more, from an international bevy of countries in a continual effort to showcase and promote advances in the ultrasound community. Represented through these efforts are a wide variety of disciplines of ultrasound, including, but not limited to: -Basic Science- Breast Ultrasound- Contrast-Enhanced Ultrasound- Dermatology- Echocardiography- Elastography- Emergency Medicine- Fetal Echocardiography- Gastrointestinal Ultrasound- General and Abdominal Ultrasound- Genitourinary Ultrasound- Gynecologic Ultrasound- Head and Neck Ultrasound- High Frequency Clinical and Preclinical Imaging- Interventional-Intraoperative Ultrasound- Musculoskeletal Ultrasound- Neurosonology- Obstetric Ultrasound- Ophthalmologic Ultrasound- Pediatric Ultrasound- Point-of-Care Ultrasound- Public Policy- Superficial Structures- Therapeutic Ultrasound- Ultrasound Education- Ultrasound in Global Health- Urologic Ultrasound- Vascular Ultrasound
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信