{"title":"结直肠癌化疗患者中至重度癌症相关疲劳风险预测模型的构建:可解释的机器学习方法","authors":"Tian Xiao, Fangyi Li, Linyu Zhou, Ruihan Xiao, Ting Chen, Xiaoli Huang, Qing Li, Ya Zhang, Ling Yang, Xueqin Qiu, Xiaoju Chen","doi":"10.1007/s00520-025-09950-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aimed to analyze the influencing factors of moderate-to-severe cancer-related fatigue (CRF) in colorectal cancer (CRC) chemotherapy patients and to develop a predictive risk stratification model.</p><p><strong>Methods: </strong>A total of 630 CRC chemotherapy patients were selected from five hospitals in China. Data were collected using a general information forms, the Piper Fatigue Scale-Revised (PFS-R), the Hospital Anxiety and Depression Scale (HADS), and the Pittsburgh Sleep Quality Index (PSQI). The data was randomly divided into a training set and a test set in a 7:3 ratio, and feature selection was performed using univariate analysis and LASSO regression. Five machine learning algorithms were used to construct moderate-to-severe CRF models. The Shapley additive explanation (SHAP) method is used to increase the interpretability of the optimal performance model.</p><p><strong>Results: </strong>The overall incidence of moderate-to-severe CRF was 70.5%. The random forest (RF) model performed the best, with an AUC of 0.906, sensitivity of 0.943, accuracy of 0.931, precision of 0.977, specificity of 0.848, and F1 score of 0.960. Based on the analysis of the absolute mean SHAP values, the feature importance of the RF model, from highest to lowest, was sleep quality score, anxiety score, anorexia, magnesium ion concentration, smoking history, place of residence, and cancer stage.</p><p><strong>Conclusions: </strong>The RF model demonstrated superior predictive performance, positioning it as a viable screening tool for assessing the risk of moderate-to-severe CRF in CRC patients receiving chemotherapy. This approach may facilitate early intervention and improve clinical management of CRF symptoms.</p>","PeriodicalId":22046,"journal":{"name":"Supportive Care in Cancer","volume":"33 10","pages":"882"},"PeriodicalIF":3.0000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Construction of a predictive model for the risk of moderate-to-severe cancer-related fatigue in colorectal cancer chemotherapy patients: an interpretable machine learning approach.\",\"authors\":\"Tian Xiao, Fangyi Li, Linyu Zhou, Ruihan Xiao, Ting Chen, Xiaoli Huang, Qing Li, Ya Zhang, Ling Yang, Xueqin Qiu, Xiaoju Chen\",\"doi\":\"10.1007/s00520-025-09950-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>This study aimed to analyze the influencing factors of moderate-to-severe cancer-related fatigue (CRF) in colorectal cancer (CRC) chemotherapy patients and to develop a predictive risk stratification model.</p><p><strong>Methods: </strong>A total of 630 CRC chemotherapy patients were selected from five hospitals in China. Data were collected using a general information forms, the Piper Fatigue Scale-Revised (PFS-R), the Hospital Anxiety and Depression Scale (HADS), and the Pittsburgh Sleep Quality Index (PSQI). The data was randomly divided into a training set and a test set in a 7:3 ratio, and feature selection was performed using univariate analysis and LASSO regression. Five machine learning algorithms were used to construct moderate-to-severe CRF models. The Shapley additive explanation (SHAP) method is used to increase the interpretability of the optimal performance model.</p><p><strong>Results: </strong>The overall incidence of moderate-to-severe CRF was 70.5%. The random forest (RF) model performed the best, with an AUC of 0.906, sensitivity of 0.943, accuracy of 0.931, precision of 0.977, specificity of 0.848, and F1 score of 0.960. Based on the analysis of the absolute mean SHAP values, the feature importance of the RF model, from highest to lowest, was sleep quality score, anxiety score, anorexia, magnesium ion concentration, smoking history, place of residence, and cancer stage.</p><p><strong>Conclusions: </strong>The RF model demonstrated superior predictive performance, positioning it as a viable screening tool for assessing the risk of moderate-to-severe CRF in CRC patients receiving chemotherapy. This approach may facilitate early intervention and improve clinical management of CRF symptoms.</p>\",\"PeriodicalId\":22046,\"journal\":{\"name\":\"Supportive Care in Cancer\",\"volume\":\"33 10\",\"pages\":\"882\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Supportive Care in Cancer\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s00520-025-09950-4\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Supportive Care in Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00520-025-09950-4","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
Construction of a predictive model for the risk of moderate-to-severe cancer-related fatigue in colorectal cancer chemotherapy patients: an interpretable machine learning approach.
Purpose: This study aimed to analyze the influencing factors of moderate-to-severe cancer-related fatigue (CRF) in colorectal cancer (CRC) chemotherapy patients and to develop a predictive risk stratification model.
Methods: A total of 630 CRC chemotherapy patients were selected from five hospitals in China. Data were collected using a general information forms, the Piper Fatigue Scale-Revised (PFS-R), the Hospital Anxiety and Depression Scale (HADS), and the Pittsburgh Sleep Quality Index (PSQI). The data was randomly divided into a training set and a test set in a 7:3 ratio, and feature selection was performed using univariate analysis and LASSO regression. Five machine learning algorithms were used to construct moderate-to-severe CRF models. The Shapley additive explanation (SHAP) method is used to increase the interpretability of the optimal performance model.
Results: The overall incidence of moderate-to-severe CRF was 70.5%. The random forest (RF) model performed the best, with an AUC of 0.906, sensitivity of 0.943, accuracy of 0.931, precision of 0.977, specificity of 0.848, and F1 score of 0.960. Based on the analysis of the absolute mean SHAP values, the feature importance of the RF model, from highest to lowest, was sleep quality score, anxiety score, anorexia, magnesium ion concentration, smoking history, place of residence, and cancer stage.
Conclusions: The RF model demonstrated superior predictive performance, positioning it as a viable screening tool for assessing the risk of moderate-to-severe CRF in CRC patients receiving chemotherapy. This approach may facilitate early intervention and improve clinical management of CRF symptoms.
期刊介绍:
Supportive Care in Cancer provides members of the Multinational Association of Supportive Care in Cancer (MASCC) and all other interested individuals, groups and institutions with the most recent scientific and social information on all aspects of supportive care in cancer patients. It covers primarily medical, technical and surgical topics concerning supportive therapy and care which may supplement or substitute basic cancer treatment at all stages of the disease.
Nursing, rehabilitative, psychosocial and spiritual issues of support are also included.