Tae-Rim Lee, Jin Mo Ahn, Junnam Lee, Dasom Kim, Juntae Park, Byeong-Ho Jeong, Dongryul Oh, Sang Man Kim, Gyou-Chul Jung, Beom Hee Choi, Min-Jung Kwon, Mengchi Wang, Michael Salmans, Andrew Carson, Bryan Leatham, Kristin Fathe, Byung In Lee, Byoungsok Jung, Chang-Seok Ki, Young Sik Park, Eun-Hae Cho
{"title":"Integrating Plasma Cell-Free DNA Fragment End Motif and Size with Genomic Features Enables Lung Cancer Detection.","authors":"Tae-Rim Lee, Jin Mo Ahn, Junnam Lee, Dasom Kim, Juntae Park, Byeong-Ho Jeong, Dongryul Oh, Sang Man Kim, Gyou-Chul Jung, Beom Hee Choi, Min-Jung Kwon, Mengchi Wang, Michael Salmans, Andrew Carson, Bryan Leatham, Kristin Fathe, Byung In Lee, Byoungsok Jung, Chang-Seok Ki, Young Sik Park, Eun-Hae Cho","doi":"10.1158/0008-5472.CAN-24-1517","DOIUrl":null,"url":null,"abstract":"<p><p>Early detection of lung cancer is important for improving patient survival rates. Liquid biopsy using whole-genome sequencing of cell-free DNA (cfDNA) offers a promising avenue for lung cancer screening, providing a potential alternative or complementary approach to current screening modalities. Here, we aimed to develop and validate an approach by integrating fragment and genomic features of cfDNA to enhance lung cancer detection accuracy across diverse populations. Deep learning-based classifiers were trained using comprehensive cfDNA fragmentomic features from participants in multi-institutional studies, including a Korean discovery dataset (218 patients with lung cancer and 2,559 controls), a Korean validation dataset (111 patients with lung cancer and 1,136 controls), and an independent Caucasian validation cohort (50 patients with lung cancer and 50 controls). In the discovery dataset, classifiers using fragment end motif by size, a feature that captures both fragment end motif and size profiles, outperformed standalone fragment end motif and fragment size classifiers, achieving an area under the curve (AUC) of 0.917. The ensemble classifier integrating fragment end motif by size and genomic coverage achieved an improved performance, with an AUC of 0.937. This performance extended to the Korean validation dataset and demonstrated ethnic generalizability in the Caucasian validation cohort. Overall, the development of a deep learning-based classifier integrating cfDNA fragmentomic and genomic features in this study highlights the potential for accurate lung cancer detection across diverse populations. Significance: Evaluating fragment-based features and genomic coverage in cell-free DNA offers an accurate lung cancer screening method, promising improvements in early cancer detection and addressing challenges associated with current screening methods.</p>","PeriodicalId":9441,"journal":{"name":"Cancer research","volume":" ","pages":"1696-1707"},"PeriodicalIF":12.5000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1158/0008-5472.CAN-24-1517","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Early detection of lung cancer is important for improving patient survival rates. Liquid biopsy using whole-genome sequencing of cell-free DNA (cfDNA) offers a promising avenue for lung cancer screening, providing a potential alternative or complementary approach to current screening modalities. Here, we aimed to develop and validate an approach by integrating fragment and genomic features of cfDNA to enhance lung cancer detection accuracy across diverse populations. Deep learning-based classifiers were trained using comprehensive cfDNA fragmentomic features from participants in multi-institutional studies, including a Korean discovery dataset (218 patients with lung cancer and 2,559 controls), a Korean validation dataset (111 patients with lung cancer and 1,136 controls), and an independent Caucasian validation cohort (50 patients with lung cancer and 50 controls). In the discovery dataset, classifiers using fragment end motif by size, a feature that captures both fragment end motif and size profiles, outperformed standalone fragment end motif and fragment size classifiers, achieving an area under the curve (AUC) of 0.917. The ensemble classifier integrating fragment end motif by size and genomic coverage achieved an improved performance, with an AUC of 0.937. This performance extended to the Korean validation dataset and demonstrated ethnic generalizability in the Caucasian validation cohort. Overall, the development of a deep learning-based classifier integrating cfDNA fragmentomic and genomic features in this study highlights the potential for accurate lung cancer detection across diverse populations. Significance: Evaluating fragment-based features and genomic coverage in cell-free DNA offers an accurate lung cancer screening method, promising improvements in early cancer detection and addressing challenges associated with current screening methods.
期刊介绍:
Cancer Research, published by the American Association for Cancer Research (AACR), is a journal that focuses on impactful original studies, reviews, and opinion pieces relevant to the broad cancer research community. Manuscripts that present conceptual or technological advances leading to insights into cancer biology are particularly sought after. The journal also places emphasis on convergence science, which involves bridging multiple distinct areas of cancer research.
With primary subsections including Cancer Biology, Cancer Immunology, Cancer Metabolism and Molecular Mechanisms, Translational Cancer Biology, Cancer Landscapes, and Convergence Science, Cancer Research has a comprehensive scope. It is published twice a month and has one volume per year, with a print ISSN of 0008-5472 and an online ISSN of 1538-7445.
Cancer Research is abstracted and/or indexed in various databases and platforms, including BIOSIS Previews (R) Database, MEDLINE, Current Contents/Life Sciences, Current Contents/Clinical Medicine, Science Citation Index, Scopus, and Web of Science.