Yuepeng Cao, Nannan Wang, Xuxiaochen Wu, Wanxiangfu Tang, Hua Bao, Chengshuai Si, Peng Shao, Dongzheng Li, Xin Zhou, Dongqin Zhu, Shanshan Yang, Fufeng Wang, Guoqing Su, Ke Wang, Qifan Wang, Yao Zhang, Qiangcheng Wang, Dongsheng Yu, Qian Jiang, Jun Bao, Liu Yang
{"title":"Multidimensional Fragmentomics Enables Early and Accurate Detection of Colorectal Cancer.","authors":"Yuepeng Cao, Nannan Wang, Xuxiaochen Wu, Wanxiangfu Tang, Hua Bao, Chengshuai Si, Peng Shao, Dongzheng Li, Xin Zhou, Dongqin Zhu, Shanshan Yang, Fufeng Wang, Guoqing Su, Ke Wang, Qifan Wang, Yao Zhang, Qiangcheng Wang, Dongsheng Yu, Qian Jiang, Jun Bao, Liu Yang","doi":"10.1158/0008-5472.CAN-23-3486","DOIUrl":null,"url":null,"abstract":"<p><p>Colorectal cancer is frequently diagnosed in advanced stages, highlighting the need for developing approaches for early detection. Liquid biopsy using cell-free DNA (cfDNA) fragmentomics is a promising approach, but the clinical application is hindered by complexity and cost. This study aimed to develop an integrated model using cfDNA fragmentomics for accurate, cost-effective early-stage colorectal cancer detection. Plasma cfDNA was extracted and sequenced from a training cohort of 360 participants, including 176 patients with colorectal cancer and 184 healthy controls. An ensemble stacked model comprising five machine learning models was employed to distinguish patients with colorectal cancer from healthy controls using five cfDNA fragmentomic features. The model was validated in an independent cohort of 236 participants (117 patients with colorectal cancer and 119 controls) and a prospective cohort of 242 participants (129 patients with colorectal cancer and 113 controls). The ensemble stacked model showed remarkable discriminatory power between patients with colorectal cancer and controls, outperforming all base models and achieving a high area under the receiver operating characteristic curve of 0.986 in the validation cohort. It reached 94.88% sensitivity and 98% specificity for detecting colorectal cancer in the validation cohort, with sensitivity increasing as the cancer progressed. The model also demonstrated consistently high accuracy in within-run and between-run tests and across various conditions in healthy individuals. In the prospective cohort, it achieved 91.47% sensitivity and 95.58% specificity. This integrated model capitalizes on the multiplex nature of cfDNA fragmentomics to achieve high sensitivity and robustness, offering significant promise for early colorectal cancer detection and broad patient benefit. Significance: The development of a minimally invasive, efficient approach for early colorectal cancer detection using advanced machine learning to analyze cfDNA fragment patterns could expedite diagnosis and improve treatment outcomes for patients. See related commentary by Rolfo and Russo, p. 3128.</p>","PeriodicalId":9441,"journal":{"name":"Cancer research","volume":" ","pages":"3286-3295"},"PeriodicalIF":12.5000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1158/0008-5472.CAN-23-3486","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Colorectal cancer is frequently diagnosed in advanced stages, highlighting the need for developing approaches for early detection. Liquid biopsy using cell-free DNA (cfDNA) fragmentomics is a promising approach, but the clinical application is hindered by complexity and cost. This study aimed to develop an integrated model using cfDNA fragmentomics for accurate, cost-effective early-stage colorectal cancer detection. Plasma cfDNA was extracted and sequenced from a training cohort of 360 participants, including 176 patients with colorectal cancer and 184 healthy controls. An ensemble stacked model comprising five machine learning models was employed to distinguish patients with colorectal cancer from healthy controls using five cfDNA fragmentomic features. The model was validated in an independent cohort of 236 participants (117 patients with colorectal cancer and 119 controls) and a prospective cohort of 242 participants (129 patients with colorectal cancer and 113 controls). The ensemble stacked model showed remarkable discriminatory power between patients with colorectal cancer and controls, outperforming all base models and achieving a high area under the receiver operating characteristic curve of 0.986 in the validation cohort. It reached 94.88% sensitivity and 98% specificity for detecting colorectal cancer in the validation cohort, with sensitivity increasing as the cancer progressed. The model also demonstrated consistently high accuracy in within-run and between-run tests and across various conditions in healthy individuals. In the prospective cohort, it achieved 91.47% sensitivity and 95.58% specificity. This integrated model capitalizes on the multiplex nature of cfDNA fragmentomics to achieve high sensitivity and robustness, offering significant promise for early colorectal cancer detection and broad patient benefit. Significance: The development of a minimally invasive, efficient approach for early colorectal cancer detection using advanced machine learning to analyze cfDNA fragment patterns could expedite diagnosis and improve treatment outcomes for patients. See related commentary by Rolfo and Russo, p. 3128.
期刊介绍:
Cancer Research, published by the American Association for Cancer Research (AACR), is a journal that focuses on impactful original studies, reviews, and opinion pieces relevant to the broad cancer research community. Manuscripts that present conceptual or technological advances leading to insights into cancer biology are particularly sought after. The journal also places emphasis on convergence science, which involves bridging multiple distinct areas of cancer research.
With primary subsections including Cancer Biology, Cancer Immunology, Cancer Metabolism and Molecular Mechanisms, Translational Cancer Biology, Cancer Landscapes, and Convergence Science, Cancer Research has a comprehensive scope. It is published twice a month and has one volume per year, with a print ISSN of 0008-5472 and an online ISSN of 1538-7445.
Cancer Research is abstracted and/or indexed in various databases and platforms, including BIOSIS Previews (R) Database, MEDLINE, Current Contents/Life Sciences, Current Contents/Clinical Medicine, Science Citation Index, Scopus, and Web of Science.