使用机器学习算法对乳腺癌相关淋巴水肿进行预测建模。

IF 1.5 3区 医学 Q3 SURGERY
Gland surgery Pub Date : 2024-12-31 Epub Date: 2024-12-27 DOI:10.21037/gs-24-252
Yang Sun, Xiaomin Xia, Xia Liu
{"title":"使用机器学习算法对乳腺癌相关淋巴水肿进行预测建模。","authors":"Yang Sun, Xiaomin Xia, Xia Liu","doi":"10.21037/gs-24-252","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mostly used statistical methods such as linear regression and logistic regression to analyze the influencing factors, but all of them have certain limitations. Machine learning (ML) is an important branch of artificial intelligence, which can effectively overcome the problems of multivariate interaction and collinearity. This study aimed to explore the influencing factors for the occurrence of BCRL in breast cancer patients, and construct a predictive model with ML algorithms and validate its predictive value on this basis.</p><p><strong>Methods: </strong>Clinical data of breast cancer patients admitted to Hainan Cancer Hospital from September 2018 to May 2024 were retrospectively collected. BCRL was considered as the outcome measurement, and the data were divided into training and validation sets in a ratio of 7:3. In the training set, random forest (RF), support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms were used to construct predictive models. The discrimination accuracy of the models was evaluated with receiver operating characteristic (ROC) curve analysis, sensitivity, specificity, and F1 score. The calibration of the models was assessed using calibration curves and the Hosmer-Lemeshow (H-L) Chi-squared test.</p><p><strong>Results: </strong>Two hundred and forty patients who met the inclusion criteria were screened, and they were randomly divided into a training set (168 patients) and a validation set (72 patients) in a 7:3 ratio. In the training set, 44 cases developed BCRL, while 124 did not. There were statistically significant differences (P<0.05) in hypertension history, number of dissected lymph nodes, postoperative complications, postoperative functional exercises, chemotherapy, radiotherapy, tumor node metastasis (TNM) stage, and level of axillary lymph node dissection between the BCRL and non-BCRL groups. Among the four models, the XGBoost model showed the best predictive performance, with an area under the curve (AUC) of 0.99 in the training set and 0.89 in the validation set. The XGBoost model demonstrated good calibration in both the training and validation sets, showing good consistency with the ideal model.</p><p><strong>Conclusions: </strong>The ML-based XGBoost model for predicting BCRL exhibits excellent performance and assists healthcare professionals in rapidly and accurately assessing the risk of BCRL occurrence.</p>","PeriodicalId":12760,"journal":{"name":"Gland surgery","volume":"13 12","pages":"2243-2252"},"PeriodicalIF":1.5000,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11733644/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predictive modeling of breast cancer-related lymphedema using machine learning algorithms.\",\"authors\":\"Yang Sun, Xiaomin Xia, Xia Liu\",\"doi\":\"10.21037/gs-24-252\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mostly used statistical methods such as linear regression and logistic regression to analyze the influencing factors, but all of them have certain limitations. Machine learning (ML) is an important branch of artificial intelligence, which can effectively overcome the problems of multivariate interaction and collinearity. This study aimed to explore the influencing factors for the occurrence of BCRL in breast cancer patients, and construct a predictive model with ML algorithms and validate its predictive value on this basis.</p><p><strong>Methods: </strong>Clinical data of breast cancer patients admitted to Hainan Cancer Hospital from September 2018 to May 2024 were retrospectively collected. BCRL was considered as the outcome measurement, and the data were divided into training and validation sets in a ratio of 7:3. In the training set, random forest (RF), support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms were used to construct predictive models. The discrimination accuracy of the models was evaluated with receiver operating characteristic (ROC) curve analysis, sensitivity, specificity, and F1 score. The calibration of the models was assessed using calibration curves and the Hosmer-Lemeshow (H-L) Chi-squared test.</p><p><strong>Results: </strong>Two hundred and forty patients who met the inclusion criteria were screened, and they were randomly divided into a training set (168 patients) and a validation set (72 patients) in a 7:3 ratio. In the training set, 44 cases developed BCRL, while 124 did not. There were statistically significant differences (P<0.05) in hypertension history, number of dissected lymph nodes, postoperative complications, postoperative functional exercises, chemotherapy, radiotherapy, tumor node metastasis (TNM) stage, and level of axillary lymph node dissection between the BCRL and non-BCRL groups. Among the four models, the XGBoost model showed the best predictive performance, with an area under the curve (AUC) of 0.99 in the training set and 0.89 in the validation set. The XGBoost model demonstrated good calibration in both the training and validation sets, showing good consistency with the ideal model.</p><p><strong>Conclusions: </strong>The ML-based XGBoost model for predicting BCRL exhibits excellent performance and assists healthcare professionals in rapidly and accurately assessing the risk of BCRL occurrence.</p>\",\"PeriodicalId\":12760,\"journal\":{\"name\":\"Gland surgery\",\"volume\":\"13 12\",\"pages\":\"2243-2252\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2024-12-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11733644/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Gland surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/gs-24-252\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/27 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"SURGERY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Gland surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/gs-24-252","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/27 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0

摘要

背景:乳腺癌相关性淋巴水肿(Breast cancer-related lymphodema, BCRL)是乳腺癌术后常见的并发症之一。易导致肢体肿胀、变形和上肢功能障碍,严重影响患者的身心健康和生活质量。以往的研究多采用线性回归、逻辑回归等统计方法分析影响因素,但都有一定的局限性。机器学习(ML)是人工智能的一个重要分支,它可以有效地克服多元交互和共线性问题。本研究旨在探讨乳腺癌患者发生BCRL的影响因素,并在此基础上构建ML算法预测模型,验证其预测价值。方法:回顾性收集海南省肿瘤医院2018年9月至2024年5月收治的乳腺癌患者的临床资料。以BCRL作为结果度量,将数据按7:3的比例分为训练集和验证集。在训练集中,采用随机森林(RF)、支持向量机(SVM)和极限梯度提升(XGBoost)算法构建预测模型。采用受试者工作特征(ROC)曲线分析、敏感性、特异性和F1评分评价模型的鉴别准确性。采用校正曲线和Hosmer-Lemeshow (H-L)卡方检验对模型的校正进行评估。结果:筛选出符合纳入标准的240例患者,按7:3的比例随机分为训练组(168例)和验证组(72例)。在训练集中,44例发生BCRL, 124例未发生。结论:基于ml的XGBoost模型预测BCRL具有良好的性能,可帮助医护人员快速准确地评估BCRL发生的风险。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predictive modeling of breast cancer-related lymphedema using machine learning algorithms.

Background: Breast cancer-related lymphedema (BCRL) is one of the common complications after breast cancer surgery. It can easily lead to limb swelling, deformation and upper limb dysfunction, which has a serious impact on the physical and mental health and quality of life of patients. Previous studies have mostly used statistical methods such as linear regression and logistic regression to analyze the influencing factors, but all of them have certain limitations. Machine learning (ML) is an important branch of artificial intelligence, which can effectively overcome the problems of multivariate interaction and collinearity. This study aimed to explore the influencing factors for the occurrence of BCRL in breast cancer patients, and construct a predictive model with ML algorithms and validate its predictive value on this basis.

Methods: Clinical data of breast cancer patients admitted to Hainan Cancer Hospital from September 2018 to May 2024 were retrospectively collected. BCRL was considered as the outcome measurement, and the data were divided into training and validation sets in a ratio of 7:3. In the training set, random forest (RF), support vector machine (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms were used to construct predictive models. The discrimination accuracy of the models was evaluated with receiver operating characteristic (ROC) curve analysis, sensitivity, specificity, and F1 score. The calibration of the models was assessed using calibration curves and the Hosmer-Lemeshow (H-L) Chi-squared test.

Results: Two hundred and forty patients who met the inclusion criteria were screened, and they were randomly divided into a training set (168 patients) and a validation set (72 patients) in a 7:3 ratio. In the training set, 44 cases developed BCRL, while 124 did not. There were statistically significant differences (P<0.05) in hypertension history, number of dissected lymph nodes, postoperative complications, postoperative functional exercises, chemotherapy, radiotherapy, tumor node metastasis (TNM) stage, and level of axillary lymph node dissection between the BCRL and non-BCRL groups. Among the four models, the XGBoost model showed the best predictive performance, with an area under the curve (AUC) of 0.99 in the training set and 0.89 in the validation set. The XGBoost model demonstrated good calibration in both the training and validation sets, showing good consistency with the ideal model.

Conclusions: The ML-based XGBoost model for predicting BCRL exhibits excellent performance and assists healthcare professionals in rapidly and accurately assessing the risk of BCRL occurrence.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Gland surgery
Gland surgery Medicine-Surgery
CiteScore
3.60
自引率
0.00%
发文量
113
期刊介绍: Gland Surgery (Gland Surg; GS, Print ISSN 2227-684X; Online ISSN 2227-8575) being indexed by PubMed/PubMed Central, is an open access, peer-review journal launched at May of 2012, published bio-monthly since February 2015.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信