基于氧化应激预测老年食管鳞癌患者 3 年生存率的机器学习方法比较

IF 3.4 2区 医学 Q2 ONCOLOGY
Jin-Biao Xie, Shi-Jie Huang, Tian-Bao Yang, Wu Wang, Bo-Yang Chen, Lianyi Guo
{"title":"基于氧化应激预测老年食管鳞癌患者 3 年生存率的机器学习方法比较","authors":"Jin-Biao Xie, Shi-Jie Huang, Tian-Bao Yang, Wu Wang, Bo-Yang Chen, Lianyi Guo","doi":"10.1186/s12885-024-13115-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Oxidative stress process plays a key role in aging and cancer; however, currently, there is paucity of machine-learning model studies investigating the relationship between oxidative stress and prognosis of elderly patients with esophageal squamous cancer (ESCC).</p><p><strong>Methods: </strong>This study included elderly patients with ESCC who underwent curative ESCC resection surgery continuously from January 2013 to December 2020 and were stratified into the training and external validation cohorts. Using Cox stepwise regression analysis based on Akaike information criterion, the relationship between oxidative stress biomarkers and prognosis was explored, and a geriatric ESCC-related oxidative stress score (OSS) was constructed. To construct a predictive model for 3-year overall survival (OS), machine-learning strategies including decision tree (DT), random forest (RF), and support vector machine (SVM) were employed. These machine-learning strategies play a key role in data mining and pattern recognition tasks. Each model was tested in the external validation cohort through 1000 resampling iterations. Validation was conducted using receiver operating characteristic area under the curve (AUC) and calibration plots.</p><p><strong>Results: </strong>The training cohort and validation cohort consisted of 340 and 145 patients, respectively. In the training cohort, the 3-year OS rate for patients was 59.2%. We constructed the OSS based on systemic oxidative stress biomarkers using the training cohort. The study found that pathological N stage, pathological T stage, tumor histological type, lymphovascular invasion, CEA, OSS, CA 19 - 9, and the amount of bleeding were the most important factors influencing the 3-year OS. These eight important features were included in training the RF, DT, and SVM and trained on the training cohort and validated cohort, respectively. In the training cohort, the RF model demonstrated the highest predictive performance with an AUC of 0.975 (0.962-0.987), while the DT model is 0.784 (0.739-0.830) and the SVM is 0.879 (0.843-0.916). In the external validation cohort, the RF model again exhibited the highest performance with an AUC of 0.791 (0.717-0.864), compared to the DT model with an AUC of 0.717 (0.640-0.794) and 0.779 (0.702-0.856) in SVM.</p><p><strong>Conclusions: </strong>The random forest clinical prediction model constructed based on OSS can effectively predict the prognosis of elderly patients with ESCC after curative surgery.</p>","PeriodicalId":9131,"journal":{"name":"BMC Cancer","volume":"24 1","pages":"1432"},"PeriodicalIF":3.4000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11580478/pdf/","citationCount":"0","resultStr":"{\"title\":\"Comparison of machine learning methods for Predicting 3-Year survival in elderly esophageal squamous cancer patients based on oxidative stress.\",\"authors\":\"Jin-Biao Xie, Shi-Jie Huang, Tian-Bao Yang, Wu Wang, Bo-Yang Chen, Lianyi Guo\",\"doi\":\"10.1186/s12885-024-13115-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Oxidative stress process plays a key role in aging and cancer; however, currently, there is paucity of machine-learning model studies investigating the relationship between oxidative stress and prognosis of elderly patients with esophageal squamous cancer (ESCC).</p><p><strong>Methods: </strong>This study included elderly patients with ESCC who underwent curative ESCC resection surgery continuously from January 2013 to December 2020 and were stratified into the training and external validation cohorts. Using Cox stepwise regression analysis based on Akaike information criterion, the relationship between oxidative stress biomarkers and prognosis was explored, and a geriatric ESCC-related oxidative stress score (OSS) was constructed. To construct a predictive model for 3-year overall survival (OS), machine-learning strategies including decision tree (DT), random forest (RF), and support vector machine (SVM) were employed. These machine-learning strategies play a key role in data mining and pattern recognition tasks. Each model was tested in the external validation cohort through 1000 resampling iterations. Validation was conducted using receiver operating characteristic area under the curve (AUC) and calibration plots.</p><p><strong>Results: </strong>The training cohort and validation cohort consisted of 340 and 145 patients, respectively. In the training cohort, the 3-year OS rate for patients was 59.2%. We constructed the OSS based on systemic oxidative stress biomarkers using the training cohort. The study found that pathological N stage, pathological T stage, tumor histological type, lymphovascular invasion, CEA, OSS, CA 19 - 9, and the amount of bleeding were the most important factors influencing the 3-year OS. These eight important features were included in training the RF, DT, and SVM and trained on the training cohort and validated cohort, respectively. In the training cohort, the RF model demonstrated the highest predictive performance with an AUC of 0.975 (0.962-0.987), while the DT model is 0.784 (0.739-0.830) and the SVM is 0.879 (0.843-0.916). In the external validation cohort, the RF model again exhibited the highest performance with an AUC of 0.791 (0.717-0.864), compared to the DT model with an AUC of 0.717 (0.640-0.794) and 0.779 (0.702-0.856) in SVM.</p><p><strong>Conclusions: </strong>The random forest clinical prediction model constructed based on OSS can effectively predict the prognosis of elderly patients with ESCC after curative surgery.</p>\",\"PeriodicalId\":9131,\"journal\":{\"name\":\"BMC Cancer\",\"volume\":\"24 1\",\"pages\":\"1432\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-11-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11580478/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Cancer\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12885-024-13115-7\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12885-024-13115-7","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:氧化应激过程在衰老和癌症中起着关键作用;然而,目前很少有机器学习模型研究探讨氧化应激与老年食管鳞癌(ESCC)患者预后之间的关系:本研究纳入了2013年1月至2020年12月期间连续接受治愈性ESCC切除手术的老年ESCC患者,并将其分为训练队列和外部验证队列。采用基于 Akaike 信息准则的 Cox 逐步回归分析,探讨氧化应激生物标志物与预后之间的关系,并构建了老年 ESCC 相关氧化应激评分(OSS)。为了构建3年总生存率(OS)预测模型,研究人员采用了机器学习策略,包括决策树(DT)、随机森林(RF)和支持向量机(SVM)。这些机器学习策略在数据挖掘和模式识别任务中发挥着关键作用。每个模型都在外部验证队列中进行了 1000 次重采样迭代测试。使用接收器操作特征曲线下面积(AUC)和校准图进行验证:训练队列和验证队列分别由 340 名和 145 名患者组成。在训练队列中,患者的 3 年 OS 率为 59.2%。我们利用训练队列构建了基于全身氧化应激生物标志物的OSS。研究发现,病理 N 分期、病理 T 分期、肿瘤组织学类型、淋巴管侵犯、CEA、OSS、CA 19 - 9 和出血量是影响 3 年 OS 的最重要因素。这八个重要特征被纳入了 RF、DT 和 SVM 的训练中,并分别在训练队列和验证队列中进行了训练。在训练队列中,RF 模型的预测性能最高,AUC 为 0.975(0.962-0.987),DT 模型为 0.784(0.739-0.830),SVM 为 0.879(0.843-0.916)。在外部验证队列中,RF 模型再次表现出最高的性能,AUC 为 0.791(0.717-0.864),而 DT 模型的 AUC 为 0.717(0.640-0.794),SVM 为 0.779(0.702-0.856):基于OSS构建的随机森林临床预测模型能有效预测老年ESCC患者治愈性手术后的预后。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Comparison of machine learning methods for Predicting 3-Year survival in elderly esophageal squamous cancer patients based on oxidative stress.

Background: Oxidative stress process plays a key role in aging and cancer; however, currently, there is paucity of machine-learning model studies investigating the relationship between oxidative stress and prognosis of elderly patients with esophageal squamous cancer (ESCC).

Methods: This study included elderly patients with ESCC who underwent curative ESCC resection surgery continuously from January 2013 to December 2020 and were stratified into the training and external validation cohorts. Using Cox stepwise regression analysis based on Akaike information criterion, the relationship between oxidative stress biomarkers and prognosis was explored, and a geriatric ESCC-related oxidative stress score (OSS) was constructed. To construct a predictive model for 3-year overall survival (OS), machine-learning strategies including decision tree (DT), random forest (RF), and support vector machine (SVM) were employed. These machine-learning strategies play a key role in data mining and pattern recognition tasks. Each model was tested in the external validation cohort through 1000 resampling iterations. Validation was conducted using receiver operating characteristic area under the curve (AUC) and calibration plots.

Results: The training cohort and validation cohort consisted of 340 and 145 patients, respectively. In the training cohort, the 3-year OS rate for patients was 59.2%. We constructed the OSS based on systemic oxidative stress biomarkers using the training cohort. The study found that pathological N stage, pathological T stage, tumor histological type, lymphovascular invasion, CEA, OSS, CA 19 - 9, and the amount of bleeding were the most important factors influencing the 3-year OS. These eight important features were included in training the RF, DT, and SVM and trained on the training cohort and validated cohort, respectively. In the training cohort, the RF model demonstrated the highest predictive performance with an AUC of 0.975 (0.962-0.987), while the DT model is 0.784 (0.739-0.830) and the SVM is 0.879 (0.843-0.916). In the external validation cohort, the RF model again exhibited the highest performance with an AUC of 0.791 (0.717-0.864), compared to the DT model with an AUC of 0.717 (0.640-0.794) and 0.779 (0.702-0.856) in SVM.

Conclusions: The random forest clinical prediction model constructed based on OSS can effectively predict the prognosis of elderly patients with ESCC after curative surgery.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Cancer
BMC Cancer 医学-肿瘤学
CiteScore
6.00
自引率
2.60%
发文量
1204
审稿时长
6.8 months
期刊介绍: BMC Cancer is an open access, peer-reviewed journal that considers articles on all aspects of cancer research, including the pathophysiology, prevention, diagnosis and treatment of cancers. The journal welcomes submissions concerning molecular and cellular biology, genetics, epidemiology, and clinical trials.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信