Construction of a predictive model for the risk of moderate-to-severe cancer-related fatigue in colorectal cancer chemotherapy patients: an interpretable machine learning approach.

IF 3 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES
Tian Xiao, Fangyi Li, Linyu Zhou, Ruihan Xiao, Ting Chen, Xiaoli Huang, Qing Li, Ya Zhang, Ling Yang, Xueqin Qiu, Xiaoju Chen
{"title":"Construction of a predictive model for the risk of moderate-to-severe cancer-related fatigue in colorectal cancer chemotherapy patients: an interpretable machine learning approach.","authors":"Tian Xiao, Fangyi Li, Linyu Zhou, Ruihan Xiao, Ting Chen, Xiaoli Huang, Qing Li, Ya Zhang, Ling Yang, Xueqin Qiu, Xiaoju Chen","doi":"10.1007/s00520-025-09950-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aimed to analyze the influencing factors of moderate-to-severe cancer-related fatigue (CRF) in colorectal cancer (CRC) chemotherapy patients and to develop a predictive risk stratification model.</p><p><strong>Methods: </strong>A total of 630 CRC chemotherapy patients were selected from five hospitals in China. Data were collected using a general information forms, the Piper Fatigue Scale-Revised (PFS-R), the Hospital Anxiety and Depression Scale (HADS), and the Pittsburgh Sleep Quality Index (PSQI). The data was randomly divided into a training set and a test set in a 7:3 ratio, and feature selection was performed using univariate analysis and LASSO regression. Five machine learning algorithms were used to construct moderate-to-severe CRF models. The Shapley additive explanation (SHAP) method is used to increase the interpretability of the optimal performance model.</p><p><strong>Results: </strong>The overall incidence of moderate-to-severe CRF was 70.5%. The random forest (RF) model performed the best, with an AUC of 0.906, sensitivity of 0.943, accuracy of 0.931, precision of 0.977, specificity of 0.848, and F1 score of 0.960. Based on the analysis of the absolute mean SHAP values, the feature importance of the RF model, from highest to lowest, was sleep quality score, anxiety score, anorexia, magnesium ion concentration, smoking history, place of residence, and cancer stage.</p><p><strong>Conclusions: </strong>The RF model demonstrated superior predictive performance, positioning it as a viable screening tool for assessing the risk of moderate-to-severe CRF in CRC patients receiving chemotherapy. This approach may facilitate early intervention and improve clinical management of CRF symptoms.</p>","PeriodicalId":22046,"journal":{"name":"Supportive Care in Cancer","volume":"33 10","pages":"882"},"PeriodicalIF":3.0000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Supportive Care in Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00520-025-09950-4","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: This study aimed to analyze the influencing factors of moderate-to-severe cancer-related fatigue (CRF) in colorectal cancer (CRC) chemotherapy patients and to develop a predictive risk stratification model.

Methods: A total of 630 CRC chemotherapy patients were selected from five hospitals in China. Data were collected using a general information forms, the Piper Fatigue Scale-Revised (PFS-R), the Hospital Anxiety and Depression Scale (HADS), and the Pittsburgh Sleep Quality Index (PSQI). The data was randomly divided into a training set and a test set in a 7:3 ratio, and feature selection was performed using univariate analysis and LASSO regression. Five machine learning algorithms were used to construct moderate-to-severe CRF models. The Shapley additive explanation (SHAP) method is used to increase the interpretability of the optimal performance model.

Results: The overall incidence of moderate-to-severe CRF was 70.5%. The random forest (RF) model performed the best, with an AUC of 0.906, sensitivity of 0.943, accuracy of 0.931, precision of 0.977, specificity of 0.848, and F1 score of 0.960. Based on the analysis of the absolute mean SHAP values, the feature importance of the RF model, from highest to lowest, was sleep quality score, anxiety score, anorexia, magnesium ion concentration, smoking history, place of residence, and cancer stage.

Conclusions: The RF model demonstrated superior predictive performance, positioning it as a viable screening tool for assessing the risk of moderate-to-severe CRF in CRC patients receiving chemotherapy. This approach may facilitate early intervention and improve clinical management of CRF symptoms.

结直肠癌化疗患者中至重度癌症相关疲劳风险预测模型的构建:可解释的机器学习方法
目的:本研究旨在分析结直肠癌(CRC)化疗患者中重度癌症相关疲劳(CRF)的影响因素,并建立预测风险分层模型。方法:选择全国5家医院的结直肠癌化疗患者630例。使用一般信息表、Piper疲劳量表修订版(PFS-R)、医院焦虑和抑郁量表(HADS)和匹兹堡睡眠质量指数(PSQI)收集数据。将数据按7:3的比例随机分为训练集和测试集,采用单变量分析和LASSO回归进行特征选择。使用五种机器学习算法构建中重度CRF模型。采用Shapley加性解释(SHAP)方法提高了最优性能模型的可解释性。结果:中重度CRF总发生率为70.5%。随机森林(random forest, RF)模型的AUC为0.906,灵敏度为0.943,准确度为0.931,精密度为0.977,特异性为0.848,F1评分为0.960。通过对绝对均值SHAP值的分析,RF模型的特征重要性由高到低依次为睡眠质量评分、焦虑评分、厌食症、镁离子浓度、吸烟史、居住地、癌症分期。结论:RF模型显示出优越的预测性能,将其定位为评估接受化疗的CRC患者中至重度CRF风险的可行筛选工具。这种方法可以促进早期干预和改善CRF症状的临床管理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Supportive Care in Cancer
Supportive Care in Cancer 医学-康复医学
CiteScore
5.70
自引率
9.70%
发文量
751
审稿时长
3 months
期刊介绍: Supportive Care in Cancer provides members of the Multinational Association of Supportive Care in Cancer (MASCC) and all other interested individuals, groups and institutions with the most recent scientific and social information on all aspects of supportive care in cancer patients. It covers primarily medical, technical and surgical topics concerning supportive therapy and care which may supplement or substitute basic cancer treatment at all stages of the disease. Nursing, rehabilitative, psychosocial and spiritual issues of support are also included.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信