Regression analysis and validation of risk factors for upper limb dysfunction following modified radical mastectomy for breast cancer patients.

IF 1.7 4区医学 Q3 MEDICINE, RESEARCH & EXPERIMENTAL

American journal of translational research Pub Date : 2025-04-15 eCollection Date: 2025-01-01 DOI:10.62347/CZYA6232

Yonggang Li, Shuan Hui

{"title":"Regression analysis and validation of risk factors for upper limb dysfunction following modified radical mastectomy for breast cancer patients.","authors":"Yonggang Li, Shuan Hui","doi":"10.62347/CZYA6232","DOIUrl":null,"url":null,"abstract":"Objective: To develop and validate a predictive tool using machine learning models for identifying risk factors for upper limb dysfunction following modified radical mastectomy (MRM) in breast cancer patients.Methods: A total of 768 breast cancer patients who underwent Modified radical mastectomy (MRM) between January 2022 and December 2023 were included in this study. The dataset was divided into a training set (506 cases) and a validation set (262 cases). The collected data encompassed demographic characteristics, clinicopathological features, medical history, and postoperative rehabilitation plans. Predictive analyses were conducted using machine learning models, including support vector machine (SVM), extreme gradient boosting (XGBOOST), Gaussian naïve Bayes (GNB), adaptive boosting (ADABOOST), and random forest. Model evaluation was performed using ten-fold cross-validation, with performance metrics including receiver operating characteristic (ROC) curves, area under the curve (AUC) values, specificity, sensitivity, accuracy, and F1-score. DeLong's test was used to compare AUC values and identify the optimal predictive model.Results: Baseline characteristics showed no significant differences between the training and validation sets (P>0.05). Analysis of factors associated with upper limb dysfunction in the training set revealed significant differences in variables such as age, BMI, cancer type, axillary lymph node dissection, ipsilateral radiotherapy, postoperative rehabilitation plans, and monthly per capita household income (P<0.05). Low correlations were observed among these variables (R values close to 0), indicating minimal multicollinearity. Model performance evaluation showed that the XGBOOST and random forest models demonstrated high AUC values (0.817-0.884) across both the training and validation sets. These models also exhibited superior specificity and sensitivity, indicating strong predictive performance and robustness in identifying patients at risk of postoperative upper limb dysfunction.Conclusion: The XGBOOST and random forest models exhibited excellent predictive accuracy, offering valuable tools for the early identification and personalized management of high-risk patients. These models provide critical data support for postoperative rehabilitation planning and contribute to improving the quality of life for breast cancer patients.","PeriodicalId":7731,"journal":{"name":"American journal of translational research","volume":"17 4","pages":"2614-2628"},"PeriodicalIF":1.7000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12082522/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of translational research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.62347/CZYA6232","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: To develop and validate a predictive tool using machine learning models for identifying risk factors for upper limb dysfunction following modified radical mastectomy (MRM) in breast cancer patients.

Methods: A total of 768 breast cancer patients who underwent Modified radical mastectomy (MRM) between January 2022 and December 2023 were included in this study. The dataset was divided into a training set (506 cases) and a validation set (262 cases). The collected data encompassed demographic characteristics, clinicopathological features, medical history, and postoperative rehabilitation plans. Predictive analyses were conducted using machine learning models, including support vector machine (SVM), extreme gradient boosting (XGBOOST), Gaussian naïve Bayes (GNB), adaptive boosting (ADABOOST), and random forest. Model evaluation was performed using ten-fold cross-validation, with performance metrics including receiver operating characteristic (ROC) curves, area under the curve (AUC) values, specificity, sensitivity, accuracy, and F1-score. DeLong's test was used to compare AUC values and identify the optimal predictive model.

Results: Baseline characteristics showed no significant differences between the training and validation sets (P>0.05). Analysis of factors associated with upper limb dysfunction in the training set revealed significant differences in variables such as age, BMI, cancer type, axillary lymph node dissection, ipsilateral radiotherapy, postoperative rehabilitation plans, and monthly per capita household income (P<0.05). Low correlations were observed among these variables (R values close to 0), indicating minimal multicollinearity. Model performance evaluation showed that the XGBOOST and random forest models demonstrated high AUC values (0.817-0.884) across both the training and validation sets. These models also exhibited superior specificity and sensitivity, indicating strong predictive performance and robustness in identifying patients at risk of postoperative upper limb dysfunction.

Conclusion: The XGBOOST and random forest models exhibited excellent predictive accuracy, offering valuable tools for the early identification and personalized management of high-risk patients. These models provide critical data support for postoperative rehabilitation planning and contribute to improving the quality of life for breast cancer patients.

查看原文本刊更多论文

乳腺癌改良乳房根治术后上肢功能障碍危险因素的回归分析与验证。

目的：开发并验证一种使用机器学习模型的预测工具，用于识别乳腺癌患者改良根治性乳房切除术（MRM）后上肢功能障碍的危险因素。方法：在2022年1月至2023年12月期间接受改良根治性乳房切除术（MRM）的768例乳腺癌患者纳入本研究。数据集分为训练集（506例）和验证集（262例）。收集的数据包括人口统计学特征、临床病理特征、病史和术后康复计划。使用机器学习模型进行预测分析，包括支持向量机（SVM）、极端梯度增强（XGBOOST）、高斯naïve贝叶斯（GNB）、自适应增强（ADABOOST）和随机森林。采用10倍交叉验证进行模型评价，性能指标包括受试者工作特征（ROC）曲线、曲线下面积（AUC）值、特异性、敏感性、准确性和f1评分。采用DeLong检验法比较AUC值，确定最优预测模型。结果：基线特征在训练集和验证集之间无显著差异（P < 0.05）。对训练集中与上肢功能障碍相关的因素进行分析，发现年龄、BMI、肿瘤类型、腋窝淋巴结清扫、同侧放疗、术后康复计划、家庭人均月收入等变量存在显著差异(p)。结论：XGBOOST和随机森林模型具有良好的预测准确性，为高危患者的早期识别和个性化管理提供了有价值的工具。这些模型为乳腺癌术后康复规划提供了重要的数据支持，有助于提高乳腺癌患者的生活质量。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

American journal of translational research ONCOLOGY-MEDICINE, RESEARCH & EXPERIMENTAL

自引率

0.00%

发文量

552

期刊介绍： Information not localized