Improved diagnostic efficiency of CRC subgroups revealed using machine learning based on intestinal microbes

IF 2.5 3区 医学 Q2 GASTROENTEROLOGY & HEPATOLOGY
Guang Liu, Lili Su, Cheng Kong, Liang Huang, Xiaoyan Zhu, Xuanping Zhang, Yanlei Ma, Jiayin Wang
{"title":"Improved diagnostic efficiency of CRC subgroups revealed using machine learning based on intestinal microbes","authors":"Guang Liu, Lili Su, Cheng Kong, Liang Huang, Xiaoyan Zhu, Xuanping Zhang, Yanlei Ma, Jiayin Wang","doi":"10.1186/s12876-024-03408-3","DOIUrl":null,"url":null,"abstract":"Colorectal cancer (CRC) is a common cancer that causes millions of deaths worldwide each year. At present, numerous studies have confirmed that intestinal microbes play a crucial role in the process of CRC. Additionally, studies have shown that CRC can be divided into several consensus molecular subtypes (CMS) based on tumor gene expression, and CRC microbiomes have been reported related to CMS. However, most previous studies on intestinal microbiome of CRC have only compared patients with healthy controls, without classifying of CRC patients based on intestinal microbial composition. In this study, a CRC cohort including 339 CRC samples and 333 healthy controls was selected as the discovery set, and the CRC samples were divided into two subgroups (234 Subgroup1 and 105 Subgroup2) using PAM clustering algorithm based on the intestinal microbial composition. We found that not only the microbial diversity was significantly different (Shannon index, p-value < 0.05), but also 129 shared genera altered (p-value < 0.05) between the two CRC subgroups, including several marker genera in CRC, such as Fusobacterium and Bacteroides. A random forest algorithm was used to construct diagnostic models, which showed significantly higher efficiency when the CRC samples were divided into subgroups. Then an independent cohort including 187 CRC samples (divided into 153 Subgroup1 and 34 Subgroup2) and 123 healthy controls was chosen to validate the models, and confirmed the results. These results indicate that the divided CRC subgroups can improve the efficiency of disease diagnosis, with various microbial composition in the subgroups.","PeriodicalId":9129,"journal":{"name":"BMC Gastroenterology","volume":null,"pages":null},"PeriodicalIF":2.5000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Gastroenterology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12876-024-03408-3","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Colorectal cancer (CRC) is a common cancer that causes millions of deaths worldwide each year. At present, numerous studies have confirmed that intestinal microbes play a crucial role in the process of CRC. Additionally, studies have shown that CRC can be divided into several consensus molecular subtypes (CMS) based on tumor gene expression, and CRC microbiomes have been reported related to CMS. However, most previous studies on intestinal microbiome of CRC have only compared patients with healthy controls, without classifying of CRC patients based on intestinal microbial composition. In this study, a CRC cohort including 339 CRC samples and 333 healthy controls was selected as the discovery set, and the CRC samples were divided into two subgroups (234 Subgroup1 and 105 Subgroup2) using PAM clustering algorithm based on the intestinal microbial composition. We found that not only the microbial diversity was significantly different (Shannon index, p-value < 0.05), but also 129 shared genera altered (p-value < 0.05) between the two CRC subgroups, including several marker genera in CRC, such as Fusobacterium and Bacteroides. A random forest algorithm was used to construct diagnostic models, which showed significantly higher efficiency when the CRC samples were divided into subgroups. Then an independent cohort including 187 CRC samples (divided into 153 Subgroup1 and 34 Subgroup2) and 123 healthy controls was chosen to validate the models, and confirmed the results. These results indicate that the divided CRC subgroups can improve the efficiency of disease diagnosis, with various microbial composition in the subgroups.
利用基于肠道微生物的机器学习提高对 CRC 亚组的诊断效率
结肠直肠癌(CRC)是一种常见癌症,每年导致全球数百万人死亡。目前,大量研究证实,肠道微生物在 CRC 的发病过程中起着至关重要的作用。此外,研究还表明,根据肿瘤基因表达,CRC 可分为几种共识分子亚型(CMS),并且有报道称 CRC 微生物组与 CMS 相关。然而,以往关于 CRC 肠道微生物组的研究大多只是将患者与健康对照进行比较,并没有根据肠道微生物组成对 CRC 患者进行分类。本研究选择了一个包括 339 例 CRC 样本和 333 例健康对照的 CRC 队列作为发现集,并根据肠道微生物组成采用 PAM 聚类算法将 CRC 样本分为两个亚组(234 Subgroup1 和 105 Subgroup2)。我们发现,两个 CRC 亚组之间不仅微生物多样性存在显著差异(香农指数,P 值<0.05),而且有 129 个共有属发生了改变(P 值<0.05),其中包括 CRC 中的几个标记属,如 Fusobacterium 和 Bacteroides。研究人员使用随机森林算法构建诊断模型,结果表明,将 CRC 样本分成亚组后,诊断效率明显提高。随后,研究人员选择了一个包括 187 个 CRC 样本(分为 153 个亚组 1 和 34 个亚组 2)和 123 个健康对照的独立队列来验证模型,结果证实了上述结论。这些结果表明,在不同微生物组成的亚组中划分 CRC 亚组可以提高疾病诊断的效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Gastroenterology
BMC Gastroenterology 医学-胃肠肝病学
CiteScore
4.20
自引率
0.00%
发文量
465
审稿时长
6 months
期刊介绍: BMC Gastroenterology is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of gastrointestinal and hepatobiliary disorders, as well as related molecular genetics, pathophysiology, and epidemiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信