Predictive models based on machine learning for early recurrence and metastasis in postoperative patients with colorectal cancer

Qian Dong, Minghui Mo, Xia Huang, Xia Sun, Peipei Jia, Ting Wang, Cuiping Liu
{"title":"Predictive models based on machine learning for early recurrence and metastasis in postoperative patients with colorectal cancer","authors":"Qian Dong, Minghui Mo, Xia Huang, Xia Sun, Peipei Jia, Ting Wang, Cuiping Liu","doi":"10.1097/nr9.0000000000000051","DOIUrl":null,"url":null,"abstract":"\n \n \n To construct and validate a prediction model based on machine learning algorithms for early recurrence and metastasis in patients with colorectal cancer after surgery.\n \n \n \n This study employed a prospective cohort design. A total of 498 postoperative patients with colorectal cancer, treated at an affiliated hospital of Qingdao University, were recruited using convenience sampling from June to December 2021. Data were collected during outpatient visits and hospitalizations. The risk factors for early recurrence and metastasis of colorectal cancer were determined through multivariate logistic regression analysis in SPSS 26.0 software. Using Python 3.7.0 software, four machine learning algorithms (logistic regression, Support Vector Machine, XGBoost, and LightGBM) were used to develop and validate prediction models for early recurrence and metastasis of colorectal cancer after surgery.\n \n \n \n Of the 498 patients, 51 (10.24%) had early recurrence and metastasis. Multivariate logistic regression analysis showed that personal traits (family history of cancer, histological type, degree of tumor differentiation, number of positive lymph nodes, and T stage), behaviour and/or lifestyle (intake of refined grains, whole grains, fish, shrimp, crab, and nuts, as well as resilience), and interpersonal networks (social support) were all associated with early recurrence and metastasis of colorectal cancer (P<0.05). The logistic regression prediction model showed the best prediction performance out of the four models, with an accuracy rate of 0.920, specificity of 0.982, F1 of 0.495, AUC of 0.867, Kappa of 0.056, and Brier score of 0.067.\n \n \n \n Our findings suggest that a prediction model based on logistic regression could accurately and scientifically predict which patients are likely to experience early recurrence and metastasis, helping to lessen the burden for both patients and the healthcare system.\n","PeriodicalId":73407,"journal":{"name":"Interdisciplinary nursing research","volume":"40 3","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Interdisciplinary nursing research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1097/nr9.0000000000000051","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

To construct and validate a prediction model based on machine learning algorithms for early recurrence and metastasis in patients with colorectal cancer after surgery. This study employed a prospective cohort design. A total of 498 postoperative patients with colorectal cancer, treated at an affiliated hospital of Qingdao University, were recruited using convenience sampling from June to December 2021. Data were collected during outpatient visits and hospitalizations. The risk factors for early recurrence and metastasis of colorectal cancer were determined through multivariate logistic regression analysis in SPSS 26.0 software. Using Python 3.7.0 software, four machine learning algorithms (logistic regression, Support Vector Machine, XGBoost, and LightGBM) were used to develop and validate prediction models for early recurrence and metastasis of colorectal cancer after surgery. Of the 498 patients, 51 (10.24%) had early recurrence and metastasis. Multivariate logistic regression analysis showed that personal traits (family history of cancer, histological type, degree of tumor differentiation, number of positive lymph nodes, and T stage), behaviour and/or lifestyle (intake of refined grains, whole grains, fish, shrimp, crab, and nuts, as well as resilience), and interpersonal networks (social support) were all associated with early recurrence and metastasis of colorectal cancer (P<0.05). The logistic regression prediction model showed the best prediction performance out of the four models, with an accuracy rate of 0.920, specificity of 0.982, F1 of 0.495, AUC of 0.867, Kappa of 0.056, and Brier score of 0.067. Our findings suggest that a prediction model based on logistic regression could accurately and scientifically predict which patients are likely to experience early recurrence and metastasis, helping to lessen the burden for both patients and the healthcare system.
基于机器学习的结直肠癌术后患者早期复发和转移预测模型
构建并验证基于机器学习算法的结直肠癌术后早期复发和转移预测模型。 本研究采用前瞻性队列设计。从2021年6月至12月,通过便利抽样法共招募了498名在青岛大学附属医院接受治疗的结直肠癌术后患者。数据收集于门诊和住院期间。通过SPSS 26.0软件进行多变量Logistic回归分析,确定结直肠癌早期复发和转移的风险因素。利用 Python 3.7.0 软件,使用四种机器学习算法(逻辑回归、支持向量机、XGBoost 和 LightGBM)开发并验证了结直肠癌术后早期复发和转移的预测模型。 在 498 例患者中,有 51 例(10.24%)出现早期复发和转移。多变量逻辑回归分析表明,个人特征(癌症家族史、组织学类型、肿瘤分化程度、阳性淋巴结数量和T分期)、行为和/或生活方式(精制谷物、全谷物、鱼、虾、蟹和坚果的摄入量以及复原力)以及人际网络(社会支持)均与结直肠癌的早期复发和转移有关(P<0.05)。在四个模型中,逻辑回归预测模型的预测效果最好,准确率为 0.920,特异性为 0.982,F1 为 0.495,AUC 为 0.867,Kappa 为 0.056,Brier 得分为 0.067。 我们的研究结果表明,基于逻辑回归的预测模型可以准确、科学地预测哪些患者可能会出现早期复发和转移,有助于减轻患者和医疗系统的负担。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信