Development of prediction models for liver metastasis in colorectal cancer based on machine learning: a population-level study.

IF 1.5 4区 医学 Q4 ONCOLOGY
Translational cancer research Pub Date : 2024-11-30 Epub Date: 2024-11-18 DOI:10.21037/tcr-24-1194
Yuncan Xing, Guanhua Yu, Zheng Jiang, Zheng Wang
{"title":"Development of prediction models for liver metastasis in colorectal cancer based on machine learning: a population-level study.","authors":"Yuncan Xing, Guanhua Yu, Zheng Jiang, Zheng Wang","doi":"10.21037/tcr-24-1194","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Liver metastasis (LM) is of vital importance in making treatment-related decisions in patients with colorectal cancer (CRC). The aim of our study was to develop and validate prediction models for LM in CRC by making use of machine learning.</p><p><strong>Methods: </strong>We selected patients diagnosed with CRC from 2010 to 2015 from the Surveillance, Epidemiology, and End Results (SEER) database. Four machine-learning methods, eXtreme gradient boost (XGB), decision tree (DT), random forest (RF), and support vector machine (SVM), were employed to develop a predictive model. The receiver operating characteristic (ROC) curves, decision curve analysis (DCA) curves and calibration curves were adopted to evaluate the model performance. The SHapley Additive exPlanation (SHAP) technique was chosen for visual analysis to enhance the interpretation of the outcomes of models.</p><p><strong>Results: </strong>A total of 51,632 patients suffering from CRC were selected from the SEER database. Excellent accuracy of machine learning models was showed from ROC curves. In both the training and validation cohorts, calibration curves for the likelihood of LM demonstrated a high degree of concordance between model prediction and actual observation. The DCA indicated that each machine learning model can yield net benefits for both treat-none and treat-all strategies. Carcinoembryonic antigen (CEA) and N stage were identified as the most significant risk factors for LM based on the SHAP summary plot of the RF and XGB models.</p><p><strong>Conclusions: </strong>The XGB and RF were the best machine learning models among the four algorithms, of which CEA and N stage were identified as the most important risk factors related to LM.</p>","PeriodicalId":23216,"journal":{"name":"Translational cancer research","volume":"13 11","pages":"5943-5952"},"PeriodicalIF":1.5000,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11651848/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tcr-24-1194","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/18 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Liver metastasis (LM) is of vital importance in making treatment-related decisions in patients with colorectal cancer (CRC). The aim of our study was to develop and validate prediction models for LM in CRC by making use of machine learning.

Methods: We selected patients diagnosed with CRC from 2010 to 2015 from the Surveillance, Epidemiology, and End Results (SEER) database. Four machine-learning methods, eXtreme gradient boost (XGB), decision tree (DT), random forest (RF), and support vector machine (SVM), were employed to develop a predictive model. The receiver operating characteristic (ROC) curves, decision curve analysis (DCA) curves and calibration curves were adopted to evaluate the model performance. The SHapley Additive exPlanation (SHAP) technique was chosen for visual analysis to enhance the interpretation of the outcomes of models.

Results: A total of 51,632 patients suffering from CRC were selected from the SEER database. Excellent accuracy of machine learning models was showed from ROC curves. In both the training and validation cohorts, calibration curves for the likelihood of LM demonstrated a high degree of concordance between model prediction and actual observation. The DCA indicated that each machine learning model can yield net benefits for both treat-none and treat-all strategies. Carcinoembryonic antigen (CEA) and N stage were identified as the most significant risk factors for LM based on the SHAP summary plot of the RF and XGB models.

Conclusions: The XGB and RF were the best machine learning models among the four algorithms, of which CEA and N stage were identified as the most important risk factors related to LM.

求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.10
自引率
0.00%
发文量
252
期刊介绍: Translational Cancer Research (Transl Cancer Res TCR; Print ISSN: 2218-676X; Online ISSN 2219-6803; http://tcr.amegroups.com/) is an Open Access, peer-reviewed journal, indexed in Science Citation Index Expanded (SCIE). TCR publishes laboratory studies of novel therapeutic interventions as well as clinical trials which evaluate new treatment paradigms for cancer; results of novel research investigations which bridge the laboratory and clinical settings including risk assessment, cellular and molecular characterization, prevention, detection, diagnosis and treatment of human cancers with the overall goal of improving the clinical care of cancer patients. The focus of TCR is original, peer-reviewed, science-based research that successfully advances clinical medicine toward the goal of improving patients'' quality of life. The editors and an international advisory group of scientists and clinician-scientists as well as other experts will hold TCR articles to the high-quality standards. We accept Original Articles as well as Review Articles, Editorials and Brief Articles.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信