基于超声的可解释机器学习模型的开发和验证,用于≤3cm肝细胞癌的分类:一项多中心回顾性诊断研究。

IF 9.6 1区 医学 Q1 MEDICINE, GENERAL & INTERNAL
EClinicalMedicine Pub Date : 2025-02-13 eCollection Date: 2025-03-01 DOI:10.1016/j.eclinm.2025.103098
Zhicheng Du, Fangying Fan, Jun Ma, Jing Liu, Xing Yan, Xuexue Chen, Yangfang Dong, Jiapeng Wu, Wenzhen Ding, Qinxian Zhao, Yuling Wang, Guojun Zhang, Jie Yu, Ping Liang
{"title":"基于超声的可解释机器学习模型的开发和验证,用于≤3cm肝细胞癌的分类:一项多中心回顾性诊断研究。","authors":"Zhicheng Du, Fangying Fan, Jun Ma, Jing Liu, Xing Yan, Xuexue Chen, Yangfang Dong, Jiapeng Wu, Wenzhen Ding, Qinxian Zhao, Yuling Wang, Guojun Zhang, Jie Yu, Ping Liang","doi":"10.1016/j.eclinm.2025.103098","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Our study aimed to develop a machine learning (ML) model utilizing grayscale ultrasound (US) to distinguish ≤3 cm small hepatocellular carcinoma (sHCC) from non-HCC lesions.</p><p><strong>Methods: </strong>A total of 1052 patients with 1058 liver lesions ≤3 cm from 55 hospitals were collected between May 2017 and June 2021, and 756 liver lesions were randomly allocated into train and internal validation cohorts at a 8:2 ratio for the development and evaluation of ML models based on multilayer perceptron (MLP) and extreme gradient boosting (XGBoost) methods (Model<sup>U</sup> utilizing US imaging features; Model<sup>UR</sup> adding US radiomics features; Model<sup>URC</sup> employing clinical features further). The diagnostic performance of three models was assessed in external validation cohort (312 liver lesions from 14 hospitals). The diagnostic efficacy of the optimal model was compared to that of radiologists in external validation cohort. The SHapley Additive exPlanations (SHAP) method was employed to interpret the optimal ML model by ranking feature importance. The study was registered at ClinicalTrials.gov (NCT03871140).</p><p><strong>Findings: </strong>Model<sup>URC</sup> based XGBoost showed the best performance (AUC = 0.934; 95% CI: 0.894-0.974) in the internal validation cohort. In the external validation cohort, Model<sup>URC</sup> also achieved optimal AUC (AUC = 0.899, 95% CI: 0.861-0.931). Upon conducting a subgroup analysis, no statistically significant differences were observed in the diagnostic performance of the Model<sup>URC</sup> neither between tumor sizes of ≤2.0 cm and 2.1-3.0 cm nor across different HCC risk stratifications. Model<sup>URC</sup> exhibited superior ability compared to all radiologists and Model<sup>URC</sup> assistance significantly improved the diagnostic AUC for all radiologists (all P < 0.0001).</p><p><strong>Interpretation: </strong>A diagnostic model for sHCC was developed and validated using ML and grayscale US from large cohorts. This model significantly improved the diagnostic performance of grayscale US for sHCC compared with experts.</p><p><strong>Funding: </strong>This work was supported by National Key Research and Development Program of China (2022YFC2405500), Major Research Program of the National Natural Science Foundation of China (92159305), National Science Fund for Distinguished Young Scholars (82325027), Key project of National Natural Science Foundation of China (82030047), Military Fund for Geriatric Diseases (20BJZ42), National Natural Science Foundation of China Special Program (82441011). National Natural Science Foundation of China (82402280), National Natural Science Foundation of China (32171363), Key Research and Development Program for Social Development of Yunnan Science and Technology Department (202403AC100014).</p>","PeriodicalId":11393,"journal":{"name":"EClinicalMedicine","volume":"81 ","pages":"103098"},"PeriodicalIF":9.6000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11872562/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development and validation of an ultrasound-based interpretable machine learning model for the classification of ≤3 cm hepatocellular carcinoma: a multicentre retrospective diagnostic study.\",\"authors\":\"Zhicheng Du, Fangying Fan, Jun Ma, Jing Liu, Xing Yan, Xuexue Chen, Yangfang Dong, Jiapeng Wu, Wenzhen Ding, Qinxian Zhao, Yuling Wang, Guojun Zhang, Jie Yu, Ping Liang\",\"doi\":\"10.1016/j.eclinm.2025.103098\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Our study aimed to develop a machine learning (ML) model utilizing grayscale ultrasound (US) to distinguish ≤3 cm small hepatocellular carcinoma (sHCC) from non-HCC lesions.</p><p><strong>Methods: </strong>A total of 1052 patients with 1058 liver lesions ≤3 cm from 55 hospitals were collected between May 2017 and June 2021, and 756 liver lesions were randomly allocated into train and internal validation cohorts at a 8:2 ratio for the development and evaluation of ML models based on multilayer perceptron (MLP) and extreme gradient boosting (XGBoost) methods (Model<sup>U</sup> utilizing US imaging features; Model<sup>UR</sup> adding US radiomics features; Model<sup>URC</sup> employing clinical features further). The diagnostic performance of three models was assessed in external validation cohort (312 liver lesions from 14 hospitals). The diagnostic efficacy of the optimal model was compared to that of radiologists in external validation cohort. The SHapley Additive exPlanations (SHAP) method was employed to interpret the optimal ML model by ranking feature importance. The study was registered at ClinicalTrials.gov (NCT03871140).</p><p><strong>Findings: </strong>Model<sup>URC</sup> based XGBoost showed the best performance (AUC = 0.934; 95% CI: 0.894-0.974) in the internal validation cohort. In the external validation cohort, Model<sup>URC</sup> also achieved optimal AUC (AUC = 0.899, 95% CI: 0.861-0.931). Upon conducting a subgroup analysis, no statistically significant differences were observed in the diagnostic performance of the Model<sup>URC</sup> neither between tumor sizes of ≤2.0 cm and 2.1-3.0 cm nor across different HCC risk stratifications. Model<sup>URC</sup> exhibited superior ability compared to all radiologists and Model<sup>URC</sup> assistance significantly improved the diagnostic AUC for all radiologists (all P < 0.0001).</p><p><strong>Interpretation: </strong>A diagnostic model for sHCC was developed and validated using ML and grayscale US from large cohorts. This model significantly improved the diagnostic performance of grayscale US for sHCC compared with experts.</p><p><strong>Funding: </strong>This work was supported by National Key Research and Development Program of China (2022YFC2405500), Major Research Program of the National Natural Science Foundation of China (92159305), National Science Fund for Distinguished Young Scholars (82325027), Key project of National Natural Science Foundation of China (82030047), Military Fund for Geriatric Diseases (20BJZ42), National Natural Science Foundation of China Special Program (82441011). National Natural Science Foundation of China (82402280), National Natural Science Foundation of China (32171363), Key Research and Development Program for Social Development of Yunnan Science and Technology Department (202403AC100014).</p>\",\"PeriodicalId\":11393,\"journal\":{\"name\":\"EClinicalMedicine\",\"volume\":\"81 \",\"pages\":\"103098\"},\"PeriodicalIF\":9.6000,\"publicationDate\":\"2025-02-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11872562/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EClinicalMedicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.eclinm.2025.103098\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, GENERAL & INTERNAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EClinicalMedicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.eclinm.2025.103098","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

摘要

背景:本研究旨在建立一种机器学习(ML)模型,利用灰度超声(US)来区分≤3cm的小肝细胞癌(sHCC)和非hcc病变。方法:2017年5月至2021年6月,共收集55家医院1052例肝病变≤3 cm的患者,其中756例肝病变按8:2的比例随机分配到训练组和内部验证组,用于基于多层感知器(MLP)和极端梯度增强(XGBoost)方法的ML模型的开发和评估(利用US成像特征的ModelU;ModelUR增加了美国放射组学功能;ModelURC进一步采用临床特征)。在外部验证队列(来自14家医院的312例肝脏病变)中评估了三种模型的诊断性能。将最优模型的诊断效果与外部验证队列放射科医师的诊断效果进行比较。采用SHapley加性解释(SHAP)方法通过对特征重要性排序来解释最优ML模型。该研究已在ClinicalTrials.gov注册(NCT03871140)。结果表明:基于ModelURC的XGBoost优化效果最佳(AUC = 0.934;95% CI: 0.894-0.974)。在外部验证队列中,ModelURC也达到了最佳AUC (AUC = 0.899, 95% CI: 0.861-0.931)。在进行亚组分析时,无论是肿瘤大小≤2.0 cm和2.1-3.0 cm之间,还是不同HCC危险分层之间,ModelURC的诊断性能均无统计学差异。与所有放射科医生相比,ModelURC表现出卓越的能力,ModelURC的辅助显著提高了所有放射科医生的诊断AUC(所有P)解释:使用ML和来自大型队列的灰度US开发并验证了sHCC的诊断模型。与专家相比,该模型显著提高了灰度US对sHCC的诊断性能。基金资助:国家重点研发计划项目(2022YFC2405500)、国家自然科学基金重大研究计划项目(92159305)、国家杰出青年科学基金项目(82325027)、国家自然科学基金重点项目(82030047)、军队老年病防治基金项目(20BJZ42)、国家自然科学基金专项项目(82441011)。国家自然科学基金项目(82402280),国家自然科学基金项目(32171363),云南省科技厅社会发展重点研究发展计划项目(202403AC100014)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Development and validation of an ultrasound-based interpretable machine learning model for the classification of ≤3 cm hepatocellular carcinoma: a multicentre retrospective diagnostic study.

Background: Our study aimed to develop a machine learning (ML) model utilizing grayscale ultrasound (US) to distinguish ≤3 cm small hepatocellular carcinoma (sHCC) from non-HCC lesions.

Methods: A total of 1052 patients with 1058 liver lesions ≤3 cm from 55 hospitals were collected between May 2017 and June 2021, and 756 liver lesions were randomly allocated into train and internal validation cohorts at a 8:2 ratio for the development and evaluation of ML models based on multilayer perceptron (MLP) and extreme gradient boosting (XGBoost) methods (ModelU utilizing US imaging features; ModelUR adding US radiomics features; ModelURC employing clinical features further). The diagnostic performance of three models was assessed in external validation cohort (312 liver lesions from 14 hospitals). The diagnostic efficacy of the optimal model was compared to that of radiologists in external validation cohort. The SHapley Additive exPlanations (SHAP) method was employed to interpret the optimal ML model by ranking feature importance. The study was registered at ClinicalTrials.gov (NCT03871140).

Findings: ModelURC based XGBoost showed the best performance (AUC = 0.934; 95% CI: 0.894-0.974) in the internal validation cohort. In the external validation cohort, ModelURC also achieved optimal AUC (AUC = 0.899, 95% CI: 0.861-0.931). Upon conducting a subgroup analysis, no statistically significant differences were observed in the diagnostic performance of the ModelURC neither between tumor sizes of ≤2.0 cm and 2.1-3.0 cm nor across different HCC risk stratifications. ModelURC exhibited superior ability compared to all radiologists and ModelURC assistance significantly improved the diagnostic AUC for all radiologists (all P < 0.0001).

Interpretation: A diagnostic model for sHCC was developed and validated using ML and grayscale US from large cohorts. This model significantly improved the diagnostic performance of grayscale US for sHCC compared with experts.

Funding: This work was supported by National Key Research and Development Program of China (2022YFC2405500), Major Research Program of the National Natural Science Foundation of China (92159305), National Science Fund for Distinguished Young Scholars (82325027), Key project of National Natural Science Foundation of China (82030047), Military Fund for Geriatric Diseases (20BJZ42), National Natural Science Foundation of China Special Program (82441011). National Natural Science Foundation of China (82402280), National Natural Science Foundation of China (32171363), Key Research and Development Program for Social Development of Yunnan Science and Technology Department (202403AC100014).

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
EClinicalMedicine
EClinicalMedicine Medicine-Medicine (all)
CiteScore
18.90
自引率
1.30%
发文量
506
审稿时长
22 days
期刊介绍: eClinicalMedicine is a gold open-access clinical journal designed to support frontline health professionals in addressing the complex and rapid health transitions affecting societies globally. The journal aims to assist practitioners in overcoming healthcare challenges across diverse communities, spanning diagnosis, treatment, prevention, and health promotion. Integrating disciplines from various specialties and life stages, it seeks to enhance health systems as fundamental institutions within societies. With a forward-thinking approach, eClinicalMedicine aims to redefine the future of healthcare.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信