{"title":"Derivation and validation of lifestyle-based and microbiota-based models for colorectal adenoma risk evaluation and self-prediction.","authors":"Yi-Lu Zhou, Jia-Wen Deng, Zhu-Hui Liu, Xin-Yue Ma, Chun-Qi Zhu, Yuan-Hong Xie, Cheng-Bei Zhou, Jing-Yuan Fang","doi":"10.1136/bmjgast-2024-001597","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Early warning and screening of colorectal adenoma (CRA) is important for colorectal cancer (CRC) prevention. This study aimed to construct a non-invasive prediction model to improve CRA screening efficacy.</p><p><strong>Methods: </strong>This study incorporated three cohorts, comprising 9747 participants who underwent colonoscopy. In cohort 1, 683 participants were prospectively recruited with comprehensive lifestyle information and faecal samples. CRA-associated bacteria were identified through 16S rRNA sequencing and quantitative real-time PCR. CRA prediction models were established using lifestyle and gut microbiota information. Cohort 2 prospectively enrolled 1529 participants to validate the lifestyle-based model, while cohort 3 retrospectively analysed 7535 individuals to determine the recommended initial colonoscopy screening ages for different risk groups based on age-specific CRA incidence rates.</p><p><strong>Results: </strong>Multivariable logistic regression yielded a prediction model incorporating 14 variables, demonstrating robust discrimination (c-statistic=0.79, 95% CI 0.75, 0.82). Other machine learning approaches showed comparable performance (random forest: 0.78, 95% CI 0.73, 0.81; gradient boosting: 0.78, 95% CI 0.76, 0.83). The ages for starting colonoscopy screening were established at 42 years for the high-risk group vs 53 years for the low-risk group. The inclusion of <i>Fusobacterium nucleatum</i> and <i>pks<sup>+</sup> Escherichia</i> <i>coli</i> enhanced the model's performance (c-statistic=0.84-0.86).</p><p><strong>Conclusion: </strong>Integrated mathematical modelling incorporating lifestyle parameters and gut microbial signatures provides an effective non-invasive strategy for CRA risk stratification, while the accompanying machine learning-assisted prediction application enables cost-effective, population-level screening implementation to optimise CRC prevention protocols.</p>","PeriodicalId":9235,"journal":{"name":"BMJ Open Gastroenterology","volume":"12 1","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ Open Gastroenterology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmjgast-2024-001597","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: Early warning and screening of colorectal adenoma (CRA) is important for colorectal cancer (CRC) prevention. This study aimed to construct a non-invasive prediction model to improve CRA screening efficacy.
Methods: This study incorporated three cohorts, comprising 9747 participants who underwent colonoscopy. In cohort 1, 683 participants were prospectively recruited with comprehensive lifestyle information and faecal samples. CRA-associated bacteria were identified through 16S rRNA sequencing and quantitative real-time PCR. CRA prediction models were established using lifestyle and gut microbiota information. Cohort 2 prospectively enrolled 1529 participants to validate the lifestyle-based model, while cohort 3 retrospectively analysed 7535 individuals to determine the recommended initial colonoscopy screening ages for different risk groups based on age-specific CRA incidence rates.
Results: Multivariable logistic regression yielded a prediction model incorporating 14 variables, demonstrating robust discrimination (c-statistic=0.79, 95% CI 0.75, 0.82). Other machine learning approaches showed comparable performance (random forest: 0.78, 95% CI 0.73, 0.81; gradient boosting: 0.78, 95% CI 0.76, 0.83). The ages for starting colonoscopy screening were established at 42 years for the high-risk group vs 53 years for the low-risk group. The inclusion of Fusobacterium nucleatum and pks+ Escherichiacoli enhanced the model's performance (c-statistic=0.84-0.86).
Conclusion: Integrated mathematical modelling incorporating lifestyle parameters and gut microbial signatures provides an effective non-invasive strategy for CRA risk stratification, while the accompanying machine learning-assisted prediction application enables cost-effective, population-level screening implementation to optimise CRC prevention protocols.
期刊介绍:
BMJ Open Gastroenterology is an online-only, peer-reviewed, open access gastroenterology journal, dedicated to publishing high-quality medical research from all disciplines and therapeutic areas of gastroenterology. It is the open access companion journal of Gut and is co-owned by the British Society of Gastroenterology. The journal publishes all research study types, from study protocols to phase I trials to meta-analyses, including small or specialist studies. Publishing procedures are built around continuous publication, publishing research online as soon as the article is ready.