基于粪便的蛋白质组学特征用于克罗恩病和溃疡性结肠炎的非侵入性分类。

IF 3 3区 医学 Q2 GASTROENTEROLOGY & HEPATOLOGY
Elmira Shajari, David Gagné, Francis Bourassa, Mandy Malick, Patricia Roy, Jean-François Noël, Hugo Gagnon, Maxime Delisle, François-Michel Boisvert, Marie Brunet, Jean-François Beaulieu
{"title":"基于粪便的蛋白质组学特征用于克罗恩病和溃疡性结肠炎的非侵入性分类。","authors":"Elmira Shajari, David Gagné, Francis Bourassa, Mandy Malick, Patricia Roy, Jean-François Noël, Hugo Gagnon, Maxime Delisle, François-Michel Boisvert, Marie Brunet, Jean-François Beaulieu","doi":"10.14309/ctg.0000000000000925","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Crohn's disease and ulcerative colitis have overlapping symptoms, but they differ in pathology and treatment. Currently, distinguishing between these diseases involves invasive procedures such as colonoscopy and histopathology. Fecal proteins, stable and in direct contact with inflammation, offer a non-invasive alternative. This study focuses on using high-throughput data-independent acquisition mass spectrometry and machine learning to develop an accurate biomarker signature from complex stool samples.</p><p><strong>Methods: </strong>Stool samples obtained from 69 active patients were analyzed. Analysis of the stool proteome led to the identification and quantification of approximately 1,250 proteins. The samples were divided into training and testing groups. After data processing, various feature selection algorithms were applied on the training group to determine proteins that were significantly different between the Crohn's disease and ulcerative colitis groups. Additionally, six machine learning algorithms were evaluated to identify the best-performing classifiers.</p><p><strong>Results: </strong>Sixteen proteins were selected based on several feature selection algorithms and six models were trained based on them. According to the performance metrics of each algorithm on the training dataset, the Naïve Bayes model was selected. For performance validation, the final predictive model was applied to 16 blind prospective samples as the test dataset. Notably, the model achieved an AUC of 0.96 on both the training and test datasets, highlighting its robustness and stability.</p><p><strong>Discussion: </strong>This study demonstrates the potential of combining multiple stool protein biomarkers via high-throughput data-independent acquisition mass spectrometry and machine learning tools to develop a predictive model for efficiently distinguishing Crohn's disease from ulcerative colitis.</p>","PeriodicalId":10278,"journal":{"name":"Clinical and Translational Gastroenterology","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Stool-Based Proteomic Signature for the Non-Invasive Classification of Crohn's Disease and Ulcerative Colitis Using Machine Learning.\",\"authors\":\"Elmira Shajari, David Gagné, Francis Bourassa, Mandy Malick, Patricia Roy, Jean-François Noël, Hugo Gagnon, Maxime Delisle, François-Michel Boisvert, Marie Brunet, Jean-François Beaulieu\",\"doi\":\"10.14309/ctg.0000000000000925\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Crohn's disease and ulcerative colitis have overlapping symptoms, but they differ in pathology and treatment. Currently, distinguishing between these diseases involves invasive procedures such as colonoscopy and histopathology. Fecal proteins, stable and in direct contact with inflammation, offer a non-invasive alternative. This study focuses on using high-throughput data-independent acquisition mass spectrometry and machine learning to develop an accurate biomarker signature from complex stool samples.</p><p><strong>Methods: </strong>Stool samples obtained from 69 active patients were analyzed. Analysis of the stool proteome led to the identification and quantification of approximately 1,250 proteins. The samples were divided into training and testing groups. After data processing, various feature selection algorithms were applied on the training group to determine proteins that were significantly different between the Crohn's disease and ulcerative colitis groups. Additionally, six machine learning algorithms were evaluated to identify the best-performing classifiers.</p><p><strong>Results: </strong>Sixteen proteins were selected based on several feature selection algorithms and six models were trained based on them. According to the performance metrics of each algorithm on the training dataset, the Naïve Bayes model was selected. For performance validation, the final predictive model was applied to 16 blind prospective samples as the test dataset. Notably, the model achieved an AUC of 0.96 on both the training and test datasets, highlighting its robustness and stability.</p><p><strong>Discussion: </strong>This study demonstrates the potential of combining multiple stool protein biomarkers via high-throughput data-independent acquisition mass spectrometry and machine learning tools to develop a predictive model for efficiently distinguishing Crohn's disease from ulcerative colitis.</p>\",\"PeriodicalId\":10278,\"journal\":{\"name\":\"Clinical and Translational Gastroenterology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Clinical and Translational Gastroenterology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.14309/ctg.0000000000000925\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GASTROENTEROLOGY & HEPATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Translational Gastroenterology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.14309/ctg.0000000000000925","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

简介:克罗恩病和溃疡性结肠炎有重叠的症状,但在病理和治疗上有所不同。目前,区分这些疾病涉及侵入性手术,如结肠镜检查和组织病理学。粪便蛋白稳定且与炎症直接接触,提供了一种非侵入性的替代方法。本研究的重点是使用高通量数据独立采集质谱和机器学习从复杂的粪便样本中开发准确的生物标志物签名。方法:对69例活动性患者的粪便标本进行分析。对粪便蛋白质组的分析导致了大约1250种蛋白质的鉴定和定量。这些样本被分为训练组和测试组。数据处理后,对训练组应用各种特征选择算法,确定克罗恩病组和溃疡性结肠炎组之间存在显著差异的蛋白质。此外,还评估了六种机器学习算法,以确定性能最佳的分类器。结果:基于几种特征选择算法选择了16种蛋白质,并在此基础上训练了6个模型。根据各算法在训练数据集上的性能指标,选择Naïve贝叶斯模型。为了进行性能验证,将最终的预测模型应用于16个盲前瞻性样本作为测试数据集。值得注意的是,该模型在训练和测试数据集上的AUC都达到了0.96,突出了其鲁棒性和稳定性。讨论:本研究展示了通过高通量数据独立采集质谱和机器学习工具结合多种粪便蛋白生物标志物来开发有效区分克罗恩病和溃疡性结肠炎的预测模型的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Stool-Based Proteomic Signature for the Non-Invasive Classification of Crohn's Disease and Ulcerative Colitis Using Machine Learning.

Introduction: Crohn's disease and ulcerative colitis have overlapping symptoms, but they differ in pathology and treatment. Currently, distinguishing between these diseases involves invasive procedures such as colonoscopy and histopathology. Fecal proteins, stable and in direct contact with inflammation, offer a non-invasive alternative. This study focuses on using high-throughput data-independent acquisition mass spectrometry and machine learning to develop an accurate biomarker signature from complex stool samples.

Methods: Stool samples obtained from 69 active patients were analyzed. Analysis of the stool proteome led to the identification and quantification of approximately 1,250 proteins. The samples were divided into training and testing groups. After data processing, various feature selection algorithms were applied on the training group to determine proteins that were significantly different between the Crohn's disease and ulcerative colitis groups. Additionally, six machine learning algorithms were evaluated to identify the best-performing classifiers.

Results: Sixteen proteins were selected based on several feature selection algorithms and six models were trained based on them. According to the performance metrics of each algorithm on the training dataset, the Naïve Bayes model was selected. For performance validation, the final predictive model was applied to 16 blind prospective samples as the test dataset. Notably, the model achieved an AUC of 0.96 on both the training and test datasets, highlighting its robustness and stability.

Discussion: This study demonstrates the potential of combining multiple stool protein biomarkers via high-throughput data-independent acquisition mass spectrometry and machine learning tools to develop a predictive model for efficiently distinguishing Crohn's disease from ulcerative colitis.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Clinical and Translational Gastroenterology
Clinical and Translational Gastroenterology GASTROENTEROLOGY & HEPATOLOGY-
CiteScore
7.00
自引率
0.00%
发文量
114
审稿时长
16 weeks
期刊介绍: Clinical and Translational Gastroenterology (CTG), published on behalf of the American College of Gastroenterology (ACG), is a peer-reviewed open access online journal dedicated to innovative clinical work in the field of gastroenterology and hepatology. CTG hopes to fulfill an unmet need for clinicians and scientists by welcoming novel cohort studies, early-phase clinical trials, qualitative and quantitative epidemiologic research, hypothesis-generating research, studies of novel mechanisms and methodologies including public health interventions, and integration of approaches across organs and disciplines. CTG also welcomes hypothesis-generating small studies, methods papers, and translational research with clear applications to human physiology or disease. Colon and small bowel Endoscopy and novel diagnostics Esophagus Functional GI disorders Immunology of the GI tract Microbiology of the GI tract Inflammatory bowel disease Pancreas and biliary tract Liver Pathology Pediatrics Preventative medicine Nutrition/obesity Stomach.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信