Fan Yao, Jianliang Miao, Bing Quan, Jinghuan Li, Bei Tang, Shenxin Lu, Xin Yin
{"title":"预测肝癌患者haic后的耐药性和生存率:基于Shapley加性解释和机器学习。","authors":"Fan Yao, Jianliang Miao, Bing Quan, Jinghuan Li, Bei Tang, Shenxin Lu, Xin Yin","doi":"10.2147/JHC.S523806","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To establish prediction models using Shapley Additive exPlanations (SHAP) and multiple machine learning (ML) algorithms to identify clinical features influencing hepatic arterial infusion chemotherapy (HAIC) resistance and survival in patients with hepatocellular carcinoma (HCC).</p><p><strong>Patients and methods: </strong>We recruited 286 patients with unresectable HCC who underwent HAIC. Patients were divided into training and validation datasets (7:3 ratio). eXtreme Gradient Boosting (XGBoost) was used to build the preliminary resistance prediction model. The SHAP values explained the importance of the clinical features. Recursive Feature Elimination with Cross-Validation (RFECV) was used to select the optimum number of features. Seven ML methods were used to construct further resistance prediction models, and ten ML algorithms were employed to establish the survival prognosis models.</p><p><strong>Results: </strong>The areas under the curve (AUC) of the XGBoost model were 1.000 and 0.812 for the training and validation groups, respectively. SHAP identified 27 of the 38 clinical features affecting resistance, with pre-HAIC treatment being the main factor. RFECV showed the best model performance with six features (pre-HAIC treatment, tumor size, HBV DNA, alkaline phosphatase (AKP), prothrombin time (PT), and portal vein tumor thrombosis (PVTT)). Random Forest had the best performance among the seven ML algorithms (AUC=0.935 for training, AUC=0.876 for validation). The combination of Stepcox [forward] and Gradient Boosting Machine was the best for predicting survival (AUC=0.98 in training, AUC=0.83 in validation). Based on the above clinical characteristics, patients were categorized into high-risk and low-risk groups based on the median risk score, and it was found that these characteristics also performed well in the prognostic model for predicting the survival of patients with HCC.</p><p><strong>Conclusion: </strong>Pre-HAIC treatment, tumor size, HBV DNA, AKP, PT, and PVTT are effective predictors of post-HAIC resistance and survival in patients with unresectable advanced HCC.</p>","PeriodicalId":15906,"journal":{"name":"Journal of Hepatocellular Carcinoma","volume":"12 ","pages":"1111-1128"},"PeriodicalIF":4.2000,"publicationDate":"2025-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12140075/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting Resistance and Survival of HCC Patients Post-HAIC: Based on Shapley Additive exPlanations and Machine Learning.\",\"authors\":\"Fan Yao, Jianliang Miao, Bing Quan, Jinghuan Li, Bei Tang, Shenxin Lu, Xin Yin\",\"doi\":\"10.2147/JHC.S523806\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>To establish prediction models using Shapley Additive exPlanations (SHAP) and multiple machine learning (ML) algorithms to identify clinical features influencing hepatic arterial infusion chemotherapy (HAIC) resistance and survival in patients with hepatocellular carcinoma (HCC).</p><p><strong>Patients and methods: </strong>We recruited 286 patients with unresectable HCC who underwent HAIC. Patients were divided into training and validation datasets (7:3 ratio). eXtreme Gradient Boosting (XGBoost) was used to build the preliminary resistance prediction model. The SHAP values explained the importance of the clinical features. Recursive Feature Elimination with Cross-Validation (RFECV) was used to select the optimum number of features. Seven ML methods were used to construct further resistance prediction models, and ten ML algorithms were employed to establish the survival prognosis models.</p><p><strong>Results: </strong>The areas under the curve (AUC) of the XGBoost model were 1.000 and 0.812 for the training and validation groups, respectively. SHAP identified 27 of the 38 clinical features affecting resistance, with pre-HAIC treatment being the main factor. RFECV showed the best model performance with six features (pre-HAIC treatment, tumor size, HBV DNA, alkaline phosphatase (AKP), prothrombin time (PT), and portal vein tumor thrombosis (PVTT)). Random Forest had the best performance among the seven ML algorithms (AUC=0.935 for training, AUC=0.876 for validation). The combination of Stepcox [forward] and Gradient Boosting Machine was the best for predicting survival (AUC=0.98 in training, AUC=0.83 in validation). Based on the above clinical characteristics, patients were categorized into high-risk and low-risk groups based on the median risk score, and it was found that these characteristics also performed well in the prognostic model for predicting the survival of patients with HCC.</p><p><strong>Conclusion: </strong>Pre-HAIC treatment, tumor size, HBV DNA, AKP, PT, and PVTT are effective predictors of post-HAIC resistance and survival in patients with unresectable advanced HCC.</p>\",\"PeriodicalId\":15906,\"journal\":{\"name\":\"Journal of Hepatocellular Carcinoma\",\"volume\":\"12 \",\"pages\":\"1111-1128\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12140075/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Hepatocellular Carcinoma\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2147/JHC.S523806\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hepatocellular Carcinoma","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/JHC.S523806","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
Predicting Resistance and Survival of HCC Patients Post-HAIC: Based on Shapley Additive exPlanations and Machine Learning.
Purpose: To establish prediction models using Shapley Additive exPlanations (SHAP) and multiple machine learning (ML) algorithms to identify clinical features influencing hepatic arterial infusion chemotherapy (HAIC) resistance and survival in patients with hepatocellular carcinoma (HCC).
Patients and methods: We recruited 286 patients with unresectable HCC who underwent HAIC. Patients were divided into training and validation datasets (7:3 ratio). eXtreme Gradient Boosting (XGBoost) was used to build the preliminary resistance prediction model. The SHAP values explained the importance of the clinical features. Recursive Feature Elimination with Cross-Validation (RFECV) was used to select the optimum number of features. Seven ML methods were used to construct further resistance prediction models, and ten ML algorithms were employed to establish the survival prognosis models.
Results: The areas under the curve (AUC) of the XGBoost model were 1.000 and 0.812 for the training and validation groups, respectively. SHAP identified 27 of the 38 clinical features affecting resistance, with pre-HAIC treatment being the main factor. RFECV showed the best model performance with six features (pre-HAIC treatment, tumor size, HBV DNA, alkaline phosphatase (AKP), prothrombin time (PT), and portal vein tumor thrombosis (PVTT)). Random Forest had the best performance among the seven ML algorithms (AUC=0.935 for training, AUC=0.876 for validation). The combination of Stepcox [forward] and Gradient Boosting Machine was the best for predicting survival (AUC=0.98 in training, AUC=0.83 in validation). Based on the above clinical characteristics, patients were categorized into high-risk and low-risk groups based on the median risk score, and it was found that these characteristics also performed well in the prognostic model for predicting the survival of patients with HCC.
Conclusion: Pre-HAIC treatment, tumor size, HBV DNA, AKP, PT, and PVTT are effective predictors of post-HAIC resistance and survival in patients with unresectable advanced HCC.