基于SEER数据库的机器学习肺癌生存预测模型的研究进展。

IF 1.9 4区 医学 Q3 ONCOLOGY
Ye Zhang, Jiaye Wang, Shiyu Hu, Yufen Xu, Qi Yang, Wenyu Chen
{"title":"基于SEER数据库的机器学习肺癌生存预测模型的研究进展。","authors":"Ye Zhang, Jiaye Wang, Shiyu Hu, Yufen Xu, Qi Yang, Wenyu Chen","doi":"10.1080/07357907.2025.2563716","DOIUrl":null,"url":null,"abstract":"<p><p>The SEER (Surveillance, Epidemiology, and End Results) database, a comprehensive public repository of clinical oncology data, has been increasingly used to construct clinical prediction models for predicting the prognosis of cancer. With the advances in machine learning, various algorithms including logistic regression (LR), support vector machines (SVM), decision trees (DT), random forest (RF), artificial neural networks (ANN), and extreme gradient boosting (XGBoost) have been successively employed in the development of lung cancer survival prediction models (LCSPMs). This study combs through the progress of these machine learning algorithms in constructing lung cancer survival prediction models, points out the problems of data imbalance, poor model interpretability, and lack of external validation, and clarifies the future development direction.</p>","PeriodicalId":9463,"journal":{"name":"Cancer Investigation","volume":" ","pages":"1-12"},"PeriodicalIF":1.9000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Progress in Development of Lung Cancer Survival Prediction Models Using Machine Learning Based on SEER Database.\",\"authors\":\"Ye Zhang, Jiaye Wang, Shiyu Hu, Yufen Xu, Qi Yang, Wenyu Chen\",\"doi\":\"10.1080/07357907.2025.2563716\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The SEER (Surveillance, Epidemiology, and End Results) database, a comprehensive public repository of clinical oncology data, has been increasingly used to construct clinical prediction models for predicting the prognosis of cancer. With the advances in machine learning, various algorithms including logistic regression (LR), support vector machines (SVM), decision trees (DT), random forest (RF), artificial neural networks (ANN), and extreme gradient boosting (XGBoost) have been successively employed in the development of lung cancer survival prediction models (LCSPMs). This study combs through the progress of these machine learning algorithms in constructing lung cancer survival prediction models, points out the problems of data imbalance, poor model interpretability, and lack of external validation, and clarifies the future development direction.</p>\",\"PeriodicalId\":9463,\"journal\":{\"name\":\"Cancer Investigation\",\"volume\":\" \",\"pages\":\"1-12\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2025-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cancer Investigation\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1080/07357907.2025.2563716\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Investigation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/07357907.2025.2563716","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

SEER(监测、流行病学和最终结果)数据库是临床肿瘤学数据的综合公共存储库,已越来越多地用于构建预测癌症预后的临床预测模型。随着机器学习技术的进步,各种算法,包括逻辑回归(LR)、支持向量机(SVM)、决策树(DT)、随机森林(RF)、人工神经网络(ANN)和极端梯度增强(XGBoost),相继被用于肺癌生存预测模型(LCSPMs)的开发。本研究梳理了这些机器学习算法在构建肺癌生存预测模型方面的进展,指出了数据不平衡、模型可解释性差、缺乏外部验证等问题,明确了未来的发展方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Progress in Development of Lung Cancer Survival Prediction Models Using Machine Learning Based on SEER Database.

The SEER (Surveillance, Epidemiology, and End Results) database, a comprehensive public repository of clinical oncology data, has been increasingly used to construct clinical prediction models for predicting the prognosis of cancer. With the advances in machine learning, various algorithms including logistic regression (LR), support vector machines (SVM), decision trees (DT), random forest (RF), artificial neural networks (ANN), and extreme gradient boosting (XGBoost) have been successively employed in the development of lung cancer survival prediction models (LCSPMs). This study combs through the progress of these machine learning algorithms in constructing lung cancer survival prediction models, points out the problems of data imbalance, poor model interpretability, and lack of external validation, and clarifies the future development direction.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Cancer Investigation
Cancer Investigation 医学-肿瘤学
CiteScore
3.80
自引率
4.20%
发文量
71
审稿时长
8.5 months
期刊介绍: Cancer Investigation is one of the most highly regarded and recognized journals in the field of basic and clinical oncology. It is designed to give physicians a comprehensive resource on the current state of progress in the cancer field as well as a broad background of reliable information necessary for effective decision making. In addition to presenting original papers of fundamental significance, it also publishes reviews, essays, specialized presentations of controversies, considerations of new technologies and their applications to specific laboratory problems, discussions of public issues, miniseries on major topics, new and experimental drugs and therapies, and an innovative letters to the editor section. One of the unique features of the journal is its departmentalized editorial sections reporting on more than 30 subject categories covering the broad spectrum of specialized areas that together comprise the field of oncology. Edited by leading physicians and research scientists, these sections make Cancer Investigation the prime resource for clinicians seeking to make sense of the sometimes-overwhelming amount of information available throughout the field. In addition to its peer-reviewed clinical research, the journal also features translational studies that bridge the gap between the laboratory and the clinic.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信