基于超声的可解释机器学习模型的开发和验证，用于≤3cm肝细胞癌的分类：一项多中心回顾性诊断研究。

IF 9.6 1区医学 Q1 MEDICINE, GENERAL & INTERNAL

EClinicalMedicine Pub Date : 2025-02-13 eCollection Date: 2025-03-01 DOI:10.1016/j.eclinm.2025.103098

Zhicheng Du, Fangying Fan, Jun Ma, Jing Liu, Xing Yan, Xuexue Chen, Yangfang Dong, Jiapeng Wu, Wenzhen Ding, Qinxian Zhao, Yuling Wang, Guojun Zhang, Jie Yu, Ping Liang

{"title":"基于超声的可解释机器学习模型的开发和验证，用于≤3cm肝细胞癌的分类：一项多中心回顾性诊断研究。","authors":"Zhicheng Du, Fangying Fan, Jun Ma, Jing Liu, Xing Yan, Xuexue Chen, Yangfang Dong, Jiapeng Wu, Wenzhen Ding, Qinxian Zhao, Yuling Wang, Guojun Zhang, Jie Yu, Ping Liang","doi":"10.1016/j.eclinm.2025.103098","DOIUrl":null,"url":null,"abstract":"Background: Our study aimed to develop a machine learning (ML) model utilizing grayscale ultrasound (US) to distinguish ≤3 cm small hepatocellular carcinoma (sHCC) from non-HCC lesions.Methods: A total of 1052 patients with 1058 liver lesions ≤3 cm from 55 hospitals were collected between May 2017 and June 2021, and 756 liver lesions were randomly allocated into train and internal validation cohorts at a 8:2 ratio for the development and evaluation of ML models based on multilayer perceptron (MLP) and extreme gradient boosting (XGBoost) methods (ModelU utilizing US imaging features; ModelUR adding US radiomics features; ModelURC employing clinical features further). The diagnostic performance of three models was assessed in external validation cohort (312 liver lesions from 14 hospitals). The diagnostic efficacy of the optimal model was compared to that of radiologists in external validation cohort. The SHapley Additive exPlanations (SHAP) method was employed to interpret the optimal ML model by ranking feature importance. The study was registered at ClinicalTrials.gov (NCT03871140).Findings: ModelURC based XGBoost showed the best performance (AUC = 0.934; 95% CI: 0.894-0.974) in the internal validation cohort. In the external validation cohort, ModelURC also achieved optimal AUC (AUC = 0.899, 95% CI: 0.861-0.931). Upon conducting a subgroup analysis, no statistically significant differences were observed in the diagnostic performance of the ModelURC neither between tumor sizes of ≤2.0 cm and 2.1-3.0 cm nor across different HCC risk stratifications. ModelURC exhibited superior ability compared to all radiologists and ModelURC assistance significantly improved the diagnostic AUC for all radiologists (all P < 0.0001).Interpretation: A diagnostic model for sHCC was developed and validated using ML and grayscale US from large cohorts. This model significantly improved the diagnostic performance of grayscale US for sHCC compared with experts.Funding: This work was supported by National Key Research and Development Program of China (2022YFC2405500), Major Research Program of the National Natural Science Foundation of China (92159305), National Science Fund for Distinguished Young Scholars (82325027), Key project of National Natural Science Foundation of China (82030047), Military Fund for Geriatric Diseases (20BJZ42), National Natural Science Foundation of China Special Program (82441011). National Natural Science Foundation of China (82402280), National Natural Science Foundation of China (32171363), Key Research and Development Program for Social Development of Yunnan Science and Technology Department (202403AC100014).","PeriodicalId":11393,"journal":{"name":"EClinicalMedicine","volume":"81 ","pages":"103098"},"PeriodicalIF":9.6000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11872562/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development and validation of an ultrasound-based interpretable machine learning model for the classification of ≤3 cm hepatocellular carcinoma: a multicentre retrospective diagnostic study.\",\"authors\":\"Zhicheng Du, Fangying Fan, Jun Ma, Jing Liu, Xing Yan, Xuexue Chen, Yangfang Dong, Jiapeng Wu, Wenzhen Ding, Qinxian Zhao, Yuling Wang, Guojun Zhang, Jie Yu, Ping Liang\",\"doi\":\"10.1016/j.eclinm.2025.103098\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Our study aimed to develop a machine learning (ML) model utilizing grayscale ultrasound (US) to distinguish ≤3 cm small hepatocellular carcinoma (sHCC) from non-HCC lesions.Methods: A total of 1052 patients with 1058 liver lesions ≤3 cm from 55 hospitals were collected between May 2017 and June 2021, and 756 liver lesions were randomly allocated into train and internal validation cohorts at a 8:2 ratio for the development and evaluation of ML models based on multilayer perceptron (MLP) and extreme gradient boosting (XGBoost) methods (ModelU utilizing US imaging features; ModelUR adding US radiomics features; ModelURC employing clinical features further). The diagnostic performance of three models was assessed in external validation cohort (312 liver lesions from 14 hospitals). The diagnostic efficacy of the optimal model was compared to that of radiologists in external validation cohort. The SHapley Additive exPlanations (SHAP) method was employed to interpret the optimal ML model by ranking feature importance. The study was registered at ClinicalTrials.gov (NCT03871140).Findings: ModelURC based XGBoost showed the best performance (AUC = 0.934; 95% CI: 0.894-0.974) in the internal validation cohort. In the external validation cohort, ModelURC also achieved optimal AUC (AUC = 0.899, 95% CI: 0.861-0.931). Upon conducting a subgroup analysis, no statistically significant differences were observed in the diagnostic performance of the ModelURC neither between tumor sizes of ≤2.0 cm and 2.1-3.0 cm nor across different HCC risk stratifications. ModelURC exhibited superior ability compared to all radiologists and ModelURC assistance significantly improved the diagnostic AUC for all radiologists (all P < 0.0001).Interpretation: A diagnostic model for sHCC was developed and validated using ML and grayscale US from large cohorts. This model significantly improved the diagnostic performance of grayscale US for sHCC compared with experts.Funding: This work was supported by National Key Research and Development Program of China (2022YFC2405500), Major Research Program of the National Natural Science Foundation of China (92159305), National Science Fund for Distinguished Young Scholars (82325027), Key project of National Natural Science Foundation of China (82030047), Military Fund for Geriatric Diseases (20BJZ42), National Natural Science Foundation of China Special Program (82441011). National Natural Science Foundation of China (82402280), National Natural Science Foundation of China (32171363), Key Research and Development Program for Social Development of Yunnan Science and Technology Department (202403AC100014).\",\"PeriodicalId\":11393,\"journal\":{\"name\":\"EClinicalMedicine\",\"volume\":\"81 \",\"pages\":\"103098\"},\"PeriodicalIF\":9.6000,\"publicationDate\":\"2025-02-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11872562/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EClinicalMedicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.eclinm.2025.103098\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"MEDICINE, GENERAL & INTERNAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EClinicalMedicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.eclinm.2025.103098","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}

引用次数: 0

摘要

背景：本研究旨在建立一种机器学习（ML）模型，利用灰度超声（US）来区分≤3cm的小肝细胞癌（sHCC）和非hcc病变。方法：2017年5月至2021年6月，共收集55家医院1052例肝病变≤3 cm的患者，其中756例肝病变按8:2的比例随机分配到训练组和内部验证组，用于基于多层感知器（MLP）和极端梯度增强（XGBoost）方法的ML模型的开发和评估(利用US成像特征的ModelU；ModelUR增加了美国放射组学功能；ModelURC进一步采用临床特征)。在外部验证队列（来自14家医院的312例肝脏病变）中评估了三种模型的诊断性能。将最优模型的诊断效果与外部验证队列放射科医师的诊断效果进行比较。采用SHapley加性解释（SHAP）方法通过对特征重要性排序来解释最优ML模型。该研究已在ClinicalTrials.gov注册（NCT03871140）。结果表明：基于ModelURC的XGBoost优化效果最佳(AUC = 0.934；95% CI: 0.894-0.974)。在外部验证队列中，ModelURC也达到了最佳AUC （AUC = 0.899, 95% CI: 0.861-0.931）。在进行亚组分析时，无论是肿瘤大小≤2.0 cm和2.1-3.0 cm之间，还是不同HCC危险分层之间，ModelURC的诊断性能均无统计学差异。与所有放射科医生相比，ModelURC表现出卓越的能力，ModelURC的辅助显著提高了所有放射科医生的诊断AUC（所有P）解释：使用ML和来自大型队列的灰度US开发并验证了sHCC的诊断模型。与专家相比，该模型显著提高了灰度US对sHCC的诊断性能。基金资助：国家重点研发计划项目（2022YFC2405500）、国家自然科学基金重大研究计划项目（92159305）、国家杰出青年科学基金项目（82325027）、国家自然科学基金重点项目（82030047）、军队老年病防治基金项目（20BJZ42）、国家自然科学基金专项项目（82441011）。国家自然科学基金项目（82402280），国家自然科学基金项目（32171363），云南省科技厅社会发展重点研究发展计划项目（202403AC100014）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Development and validation of an ultrasound-based interpretable machine learning model for the classification of ≤3 cm hepatocellular carcinoma: a multicentre retrospective diagnostic study.

Background: Our study aimed to develop a machine learning (ML) model utilizing grayscale ultrasound (US) to distinguish ≤3 cm small hepatocellular carcinoma (sHCC) from non-HCC lesions.

Methods: A total of 1052 patients with 1058 liver lesions ≤3 cm from 55 hospitals were collected between May 2017 and June 2021, and 756 liver lesions were randomly allocated into train and internal validation cohorts at a 8:2 ratio for the development and evaluation of ML models based on multilayer perceptron (MLP) and extreme gradient boosting (XGBoost) methods (Model^U utilizing US imaging features; Model^UR adding US radiomics features; Model^URC employing clinical features further). The diagnostic performance of three models was assessed in external validation cohort (312 liver lesions from 14 hospitals). The diagnostic efficacy of the optimal model was compared to that of radiologists in external validation cohort. The SHapley Additive exPlanations (SHAP) method was employed to interpret the optimal ML model by ranking feature importance. The study was registered at ClinicalTrials.gov (NCT03871140).

Findings: Model^URC based XGBoost showed the best performance (AUC = 0.934; 95% CI: 0.894-0.974) in the internal validation cohort. In the external validation cohort, Model^URC also achieved optimal AUC (AUC = 0.899, 95% CI: 0.861-0.931). Upon conducting a subgroup analysis, no statistically significant differences were observed in the diagnostic performance of the Model^URC neither between tumor sizes of ≤2.0 cm and 2.1-3.0 cm nor across different HCC risk stratifications. Model^URC exhibited superior ability compared to all radiologists and Model^URC assistance significantly improved the diagnostic AUC for all radiologists (all P < 0.0001).

Interpretation: A diagnostic model for sHCC was developed and validated using ML and grayscale US from large cohorts. This model significantly improved the diagnostic performance of grayscale US for sHCC compared with experts.

Funding: This work was supported by National Key Research and Development Program of China (2022YFC2405500), Major Research Program of the National Natural Science Foundation of China (92159305), National Science Fund for Distinguished Young Scholars (82325027), Key project of National Natural Science Foundation of China (82030047), Military Fund for Geriatric Diseases (20BJZ42), National Natural Science Foundation of China Special Program (82441011). National Natural Science Foundation of China (82402280), National Natural Science Foundation of China (32171363), Key Research and Development Program for Social Development of Yunnan Science and Technology Department (202403AC100014).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

EClinicalMedicine Medicine-Medicine (all)

CiteScore

18.90

自引率

1.30%

发文量

506

审稿时长

22 days

期刊介绍： eClinicalMedicine is a gold open-access clinical journal designed to support frontline health professionals in addressing the complex and rapid health transitions affecting societies globally. The journal aims to assist practitioners in overcoming healthcare challenges across diverse communities, spanning diagnosis, treatment, prevention, and health promotion. Integrating disciplines from various specialties and life stages, it seeks to enhance health systems as fundamental institutions within societies. With a forward-thinking approach, eClinicalMedicine aims to redefine the future of healthcare.