预测肝癌患者haic后的耐药性和生存率:基于Shapley加性解释和机器学习。

IF 4.2 3区 医学 Q2 ONCOLOGY
Journal of Hepatocellular Carcinoma Pub Date : 2025-05-31 eCollection Date: 2025-01-01 DOI:10.2147/JHC.S523806
Fan Yao, Jianliang Miao, Bing Quan, Jinghuan Li, Bei Tang, Shenxin Lu, Xin Yin
{"title":"预测肝癌患者haic后的耐药性和生存率:基于Shapley加性解释和机器学习。","authors":"Fan Yao, Jianliang Miao, Bing Quan, Jinghuan Li, Bei Tang, Shenxin Lu, Xin Yin","doi":"10.2147/JHC.S523806","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To establish prediction models using Shapley Additive exPlanations (SHAP) and multiple machine learning (ML) algorithms to identify clinical features influencing hepatic arterial infusion chemotherapy (HAIC) resistance and survival in patients with hepatocellular carcinoma (HCC).</p><p><strong>Patients and methods: </strong>We recruited 286 patients with unresectable HCC who underwent HAIC. Patients were divided into training and validation datasets (7:3 ratio). eXtreme Gradient Boosting (XGBoost) was used to build the preliminary resistance prediction model. The SHAP values explained the importance of the clinical features. Recursive Feature Elimination with Cross-Validation (RFECV) was used to select the optimum number of features. Seven ML methods were used to construct further resistance prediction models, and ten ML algorithms were employed to establish the survival prognosis models.</p><p><strong>Results: </strong>The areas under the curve (AUC) of the XGBoost model were 1.000 and 0.812 for the training and validation groups, respectively. SHAP identified 27 of the 38 clinical features affecting resistance, with pre-HAIC treatment being the main factor. RFECV showed the best model performance with six features (pre-HAIC treatment, tumor size, HBV DNA, alkaline phosphatase (AKP), prothrombin time (PT), and portal vein tumor thrombosis (PVTT)). Random Forest had the best performance among the seven ML algorithms (AUC=0.935 for training, AUC=0.876 for validation). The combination of Stepcox [forward] and Gradient Boosting Machine was the best for predicting survival (AUC=0.98 in training, AUC=0.83 in validation). Based on the above clinical characteristics, patients were categorized into high-risk and low-risk groups based on the median risk score, and it was found that these characteristics also performed well in the prognostic model for predicting the survival of patients with HCC.</p><p><strong>Conclusion: </strong>Pre-HAIC treatment, tumor size, HBV DNA, AKP, PT, and PVTT are effective predictors of post-HAIC resistance and survival in patients with unresectable advanced HCC.</p>","PeriodicalId":15906,"journal":{"name":"Journal of Hepatocellular Carcinoma","volume":"12 ","pages":"1111-1128"},"PeriodicalIF":4.2000,"publicationDate":"2025-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12140075/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting Resistance and Survival of HCC Patients Post-HAIC: Based on Shapley Additive exPlanations and Machine Learning.\",\"authors\":\"Fan Yao, Jianliang Miao, Bing Quan, Jinghuan Li, Bei Tang, Shenxin Lu, Xin Yin\",\"doi\":\"10.2147/JHC.S523806\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>To establish prediction models using Shapley Additive exPlanations (SHAP) and multiple machine learning (ML) algorithms to identify clinical features influencing hepatic arterial infusion chemotherapy (HAIC) resistance and survival in patients with hepatocellular carcinoma (HCC).</p><p><strong>Patients and methods: </strong>We recruited 286 patients with unresectable HCC who underwent HAIC. Patients were divided into training and validation datasets (7:3 ratio). eXtreme Gradient Boosting (XGBoost) was used to build the preliminary resistance prediction model. The SHAP values explained the importance of the clinical features. Recursive Feature Elimination with Cross-Validation (RFECV) was used to select the optimum number of features. Seven ML methods were used to construct further resistance prediction models, and ten ML algorithms were employed to establish the survival prognosis models.</p><p><strong>Results: </strong>The areas under the curve (AUC) of the XGBoost model were 1.000 and 0.812 for the training and validation groups, respectively. SHAP identified 27 of the 38 clinical features affecting resistance, with pre-HAIC treatment being the main factor. RFECV showed the best model performance with six features (pre-HAIC treatment, tumor size, HBV DNA, alkaline phosphatase (AKP), prothrombin time (PT), and portal vein tumor thrombosis (PVTT)). Random Forest had the best performance among the seven ML algorithms (AUC=0.935 for training, AUC=0.876 for validation). The combination of Stepcox [forward] and Gradient Boosting Machine was the best for predicting survival (AUC=0.98 in training, AUC=0.83 in validation). Based on the above clinical characteristics, patients were categorized into high-risk and low-risk groups based on the median risk score, and it was found that these characteristics also performed well in the prognostic model for predicting the survival of patients with HCC.</p><p><strong>Conclusion: </strong>Pre-HAIC treatment, tumor size, HBV DNA, AKP, PT, and PVTT are effective predictors of post-HAIC resistance and survival in patients with unresectable advanced HCC.</p>\",\"PeriodicalId\":15906,\"journal\":{\"name\":\"Journal of Hepatocellular Carcinoma\",\"volume\":\"12 \",\"pages\":\"1111-1128\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12140075/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Hepatocellular Carcinoma\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2147/JHC.S523806\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hepatocellular Carcinoma","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/JHC.S523806","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

目的:利用Shapley加性解释(SHAP)和多机器学习(ML)算法建立预测模型,识别影响肝动脉输注化疗(HAIC)耐药和肝细胞癌(HCC)患者生存的临床特征。患者和方法:我们招募了286例接受HAIC治疗的不可切除HCC患者。将患者分为训练数据集和验证数据集,比例为7:3。利用极限梯度增压(eXtreme Gradient boost, XGBoost)建立了初步的电阻预测模型。SHAP值解释了临床特征的重要性。采用交叉验证递归特征消除法(RFECV)选择最优特征数量。采用7种ML方法构建进一步的耐药预测模型,采用10种ML算法建立生存预后模型。结果:训练组和验证组XGBoost模型的曲线下面积(AUC)分别为1.000和0.812。SHAP确定了38个影响耐药的临床特征中的27个,其中haic前治疗是主要因素。RFECV在haic治疗前、肿瘤大小、HBV DNA、碱性磷酸酶(AKP)、凝血酶原时间(PT)、门静脉肿瘤血栓形成(PVTT) 6个指标上表现最佳。随机森林算法在7种ML算法中表现最好(训练的AUC=0.935,验证的AUC=0.876)。结合Stepcox [forward]和Gradient Boosting Machine预测生存率最佳(训练时AUC=0.98,验证时AUC=0.83)。基于上述临床特征,根据中位风险评分将患者分为高危组和低危组,发现这些特征在预测HCC患者生存的预后模型中也表现良好。结论:haic前治疗、肿瘤大小、HBV DNA、AKP、PT和PVTT是无法切除的晚期HCC患者haic后耐药和生存的有效预测因子。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting Resistance and Survival of HCC Patients Post-HAIC: Based on Shapley Additive exPlanations and Machine Learning.

Purpose: To establish prediction models using Shapley Additive exPlanations (SHAP) and multiple machine learning (ML) algorithms to identify clinical features influencing hepatic arterial infusion chemotherapy (HAIC) resistance and survival in patients with hepatocellular carcinoma (HCC).

Patients and methods: We recruited 286 patients with unresectable HCC who underwent HAIC. Patients were divided into training and validation datasets (7:3 ratio). eXtreme Gradient Boosting (XGBoost) was used to build the preliminary resistance prediction model. The SHAP values explained the importance of the clinical features. Recursive Feature Elimination with Cross-Validation (RFECV) was used to select the optimum number of features. Seven ML methods were used to construct further resistance prediction models, and ten ML algorithms were employed to establish the survival prognosis models.

Results: The areas under the curve (AUC) of the XGBoost model were 1.000 and 0.812 for the training and validation groups, respectively. SHAP identified 27 of the 38 clinical features affecting resistance, with pre-HAIC treatment being the main factor. RFECV showed the best model performance with six features (pre-HAIC treatment, tumor size, HBV DNA, alkaline phosphatase (AKP), prothrombin time (PT), and portal vein tumor thrombosis (PVTT)). Random Forest had the best performance among the seven ML algorithms (AUC=0.935 for training, AUC=0.876 for validation). The combination of Stepcox [forward] and Gradient Boosting Machine was the best for predicting survival (AUC=0.98 in training, AUC=0.83 in validation). Based on the above clinical characteristics, patients were categorized into high-risk and low-risk groups based on the median risk score, and it was found that these characteristics also performed well in the prognostic model for predicting the survival of patients with HCC.

Conclusion: Pre-HAIC treatment, tumor size, HBV DNA, AKP, PT, and PVTT are effective predictors of post-HAIC resistance and survival in patients with unresectable advanced HCC.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
0.50
自引率
2.40%
发文量
108
审稿时长
16 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信