Genetics, primary care records and lifestyle factors for short-term dynamic risk prediction of colorectal cancer: prospective study of asymptomatic and symptomatic UK Biobank participants.
Samantha Ip, Hannah Harrison, Juliet A Usher-Smith, Matthew E Barclay, Jonathan Tyrer, Joe Dennis, Xin Yang, Michael Lush, Cristina Renzi, Nora Pashayan, Spiros Denaxas, Georgios Lyratzopoulos, Antonis C Antoniou, Angela M Wood
{"title":"Genetics, primary care records and lifestyle factors for short-term dynamic risk prediction of colorectal cancer: prospective study of asymptomatic and symptomatic UK Biobank participants.","authors":"Samantha Ip, Hannah Harrison, Juliet A Usher-Smith, Matthew E Barclay, Jonathan Tyrer, Joe Dennis, Xin Yang, Michael Lush, Cristina Renzi, Nora Pashayan, Spiros Denaxas, Georgios Lyratzopoulos, Antonis C Antoniou, Angela M Wood","doi":"10.1136/bmjonc-2024-000336","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>To quantify the contributions of polygenic scores, primary care records (presenting symptoms, medical history and common blood tests) and lifestyle factors, for short-term risk prediction of colorectal cancer (CRC) in general and symptomatic individuals.</p><p><strong>Methods and analysis: </strong>This prospective cohort study used data from the UK Biobank with follow-up until 2018. It included 160 507 participants with linked primary care records and a subcohort of 42 782 participants with recent CRC-related symptoms. The outcome was the first-recorded CRC diagnosis within 2 years. Dynamic risk models with time-varying predictors were derived using a super-landmark framework. Model discrimination was assessed through Harrel's C-index, and predictor contributions to model discrimination were evaluated using inclusion-order-agnostic Shapley values.</p><p><strong>Results: </strong>C-indices (95% CIs) were 0.73 (0.72 to 0.73) and 0.69 (0.68 to 0.70) for the general and symptomatic participants, respectively. Shapley contributions to model discrimination (95% CIs) were core predictors (eg, age, sex) 33% (25% to 42%) (symptomatic: 34% (9% to 75%)), polygenic scores 16% (8% to 26%) (8% (-21% to 35%)), primary care blood tests 32% (19% to 43%) (41% (16% to 73%)), medical history 11% (4% to 17%) (9% (-25% to 37%)), lifestyle factors 6% (0% to 11%) (-5% (-32% to 13.4%)) and symptoms 3% (-2% to 7%) (13% (-19% to 41%)).</p><p><strong>Conclusions: </strong>Polygenic scores contribute substantially to short-term risk prediction for CRC in both general and symptomatic populations; however, the contribution of information in primary care records (including presenting symptoms, medical history and common blood tests) is greater. Lifestyle factors not routinely collected in primary care contribute minimally.</p>","PeriodicalId":72436,"journal":{"name":"BMJ oncology","volume":"4 1","pages":"e000336"},"PeriodicalIF":0.0000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11880779/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ oncology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmjonc-2024-000336","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: To quantify the contributions of polygenic scores, primary care records (presenting symptoms, medical history and common blood tests) and lifestyle factors, for short-term risk prediction of colorectal cancer (CRC) in general and symptomatic individuals.
Methods and analysis: This prospective cohort study used data from the UK Biobank with follow-up until 2018. It included 160 507 participants with linked primary care records and a subcohort of 42 782 participants with recent CRC-related symptoms. The outcome was the first-recorded CRC diagnosis within 2 years. Dynamic risk models with time-varying predictors were derived using a super-landmark framework. Model discrimination was assessed through Harrel's C-index, and predictor contributions to model discrimination were evaluated using inclusion-order-agnostic Shapley values.
Results: C-indices (95% CIs) were 0.73 (0.72 to 0.73) and 0.69 (0.68 to 0.70) for the general and symptomatic participants, respectively. Shapley contributions to model discrimination (95% CIs) were core predictors (eg, age, sex) 33% (25% to 42%) (symptomatic: 34% (9% to 75%)), polygenic scores 16% (8% to 26%) (8% (-21% to 35%)), primary care blood tests 32% (19% to 43%) (41% (16% to 73%)), medical history 11% (4% to 17%) (9% (-25% to 37%)), lifestyle factors 6% (0% to 11%) (-5% (-32% to 13.4%)) and symptoms 3% (-2% to 7%) (13% (-19% to 41%)).
Conclusions: Polygenic scores contribute substantially to short-term risk prediction for CRC in both general and symptomatic populations; however, the contribution of information in primary care records (including presenting symptoms, medical history and common blood tests) is greater. Lifestyle factors not routinely collected in primary care contribute minimally.