Development and validation of machine learning models for young-onset colorectal cancer risk stratification

IF 6.8 1区 医学 Q1 ONCOLOGY
Junhai Zhen, Jiao Li, Fei Liao, Jixiang Zhang, Chuan Liu, Huabing Xie, Cheng Tan, Weiguo Dong
{"title":"Development and validation of machine learning models for young-onset colorectal cancer risk stratification","authors":"Junhai Zhen, Jiao Li, Fei Liao, Jixiang Zhang, Chuan Liu, Huabing Xie, Cheng Tan, Weiguo Dong","doi":"10.1038/s41698-024-00719-2","DOIUrl":null,"url":null,"abstract":"Incidence of young-onset colorectal cancer (YOCRC, younger than 50) has significantly increased worldwide. The performance of fecal immunochemical test in detecting YOCRC is unsatisfactory. Using routine clinical data, we aimed to develop machine learning (ML) models to identify individuals with high-risk YOCRC who require further colonoscopy. We retrospectively extracted data of 10,874 young individuals. Multiple supervised ML techniques were devised to distinguish individuals with and without CRC, classifiers were trained, internally validated and temporally validated. In internal validation cohort, Random Forest (RF) ML model demonstrated good performance with AUC of 0.859 and highest recall of 0.840. In temporal validation cohort, the RF ML model also exhibited good classification performance, achieving AUC of 0.888 and highest recall of 0.872. RF algorithm-based approach is effective and feasible in YOCRC risk stratification. This could be valuable in assessing the risk of YOCRC so that clinical management, including further colonoscopy, can be subsequently made. (Registration: This study was registered with ClinicalTrials.gov (NCT06342622) on March 15, 2024.).","PeriodicalId":19433,"journal":{"name":"NPJ Precision Oncology","volume":" ","pages":"1-14"},"PeriodicalIF":6.8000,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11496529/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Precision Oncology","FirstCategoryId":"3","ListUrlMain":"https://www.nature.com/articles/s41698-024-00719-2","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Incidence of young-onset colorectal cancer (YOCRC, younger than 50) has significantly increased worldwide. The performance of fecal immunochemical test in detecting YOCRC is unsatisfactory. Using routine clinical data, we aimed to develop machine learning (ML) models to identify individuals with high-risk YOCRC who require further colonoscopy. We retrospectively extracted data of 10,874 young individuals. Multiple supervised ML techniques were devised to distinguish individuals with and without CRC, classifiers were trained, internally validated and temporally validated. In internal validation cohort, Random Forest (RF) ML model demonstrated good performance with AUC of 0.859 and highest recall of 0.840. In temporal validation cohort, the RF ML model also exhibited good classification performance, achieving AUC of 0.888 and highest recall of 0.872. RF algorithm-based approach is effective and feasible in YOCRC risk stratification. This could be valuable in assessing the risk of YOCRC so that clinical management, including further colonoscopy, can be subsequently made. (Registration: This study was registered with ClinicalTrials.gov (NCT06342622) on March 15, 2024.).

Abstract Image

开发和验证用于年轻发病结直肠癌风险分层的机器学习模型。
全球年轻发病结直肠癌(YOCRC,50 岁以下)的发病率大幅上升。粪便免疫化学检验在检测 YOCRC 方面的效果并不理想。利用常规临床数据,我们旨在开发机器学习(ML)模型,以识别需要进一步进行结肠镜检查的高风险 YOCRC 患者。我们回顾性地提取了 10,874 名年轻人的数据。我们设计了多种有监督的 ML 技术来区分是否患有 CRC,并对分类器进行了训练、内部验证和时间验证。在内部验证队列中,随机森林(RF)ML 模型表现良好,AUC 为 0.859,最高召回率为 0.840。在时间验证队列中,RF ML 模型也表现出良好的分类性能,AUC 达到 0.888,最高召回率为 0.872。基于射频算法的方法在 YOCRC 风险分层中是有效和可行的。这对评估YOCRC的风险很有价值,以便随后进行包括进一步结肠镜检查在内的临床管理。(注册:本研究已于2024年3月15日在ClinicalTrials.gov(NCT06342622)注册)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
9.90
自引率
1.30%
发文量
87
审稿时长
18 weeks
期刊介绍: Online-only and open access, npj Precision Oncology is an international, peer-reviewed journal dedicated to showcasing cutting-edge scientific research in all facets of precision oncology, spanning from fundamental science to translational applications and clinical medicine.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信