Identification and validation of microbial biomarkers from cross-cohort datasets using xMarkerFinder

IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Wenxing Gao, Weili Lin, Qiang Li, Wanning Chen, Wenjing Yin, Xinyue Zhu, Sheng Gao, Lei Liu, Wenjie Li, Dingfeng Wu, Guoqing Zhang, Ruixin Zhu, Na Jiao
{"title":"Identification and validation of microbial biomarkers from cross-cohort datasets using xMarkerFinder","authors":"Wenxing Gao, Weili Lin, Qiang Li, Wanning Chen, Wenjing Yin, Xinyue Zhu, Sheng Gao, Lei Liu, Wenjie Li, Dingfeng Wu, Guoqing Zhang, Ruixin Zhu, Na Jiao","doi":"10.1038/s41596-024-00999-9","DOIUrl":null,"url":null,"abstract":"Microbial signatures have emerged as promising biomarkers for disease diagnostics and prognostics, yet their variability across different studies calls for a standardized approach to biomarker research. Therefore, we introduce xMarkerFinder, a four-stage computational framework for microbial biomarker identification with comprehensive validations from cross-cohort datasets, including differential signature identification, model construction, model validation and biomarker interpretation. xMarkerFinder enables the identification and validation of reproducible biomarkers for cross-cohort studies, along with the establishment of classification models and potential microbiome-induced mechanisms. Originally developed for gut microbiome research, xMarkerFinder’s adaptable design makes it applicable to various microbial habitats and data types. Distinct from existing biomarker research tools that typically concentrate on a singular aspect, xMarkerFinder uniquely incorporates a sophisticated feature selection process, specifically designed to address the heterogeneity between different cohorts, extensive internal and external validations, and detailed specificity assessments. Execution time varies depending on the sample size, selected algorithm and computational resource. Accessible via GitHub ( https://github.com/tjcadd2020/xMarkerFinder ), xMarkerFinder supports users with diverse expertise levels through different execution options, including step-to-step scripts with detailed tutorials and frequently asked questions, a single-command execution script, a ready-to-use Docker image and a user-friendly web server ( https://www.biosino.org/xmarkerfinder ). This protocol is for using xMarkerFinder, a four-stage computational framework, to enable the identification and validation of reproducible microbial biomarkers from cross-cohort studies, and establish potential microbiome-induced mechanisms.","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":"19 9","pages":"2803-2830"},"PeriodicalIF":13.1000,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Protocols","FirstCategoryId":"99","ListUrlMain":"https://www.nature.com/articles/s41596-024-00999-9","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Microbial signatures have emerged as promising biomarkers for disease diagnostics and prognostics, yet their variability across different studies calls for a standardized approach to biomarker research. Therefore, we introduce xMarkerFinder, a four-stage computational framework for microbial biomarker identification with comprehensive validations from cross-cohort datasets, including differential signature identification, model construction, model validation and biomarker interpretation. xMarkerFinder enables the identification and validation of reproducible biomarkers for cross-cohort studies, along with the establishment of classification models and potential microbiome-induced mechanisms. Originally developed for gut microbiome research, xMarkerFinder’s adaptable design makes it applicable to various microbial habitats and data types. Distinct from existing biomarker research tools that typically concentrate on a singular aspect, xMarkerFinder uniquely incorporates a sophisticated feature selection process, specifically designed to address the heterogeneity between different cohorts, extensive internal and external validations, and detailed specificity assessments. Execution time varies depending on the sample size, selected algorithm and computational resource. Accessible via GitHub ( https://github.com/tjcadd2020/xMarkerFinder ), xMarkerFinder supports users with diverse expertise levels through different execution options, including step-to-step scripts with detailed tutorials and frequently asked questions, a single-command execution script, a ready-to-use Docker image and a user-friendly web server ( https://www.biosino.org/xmarkerfinder ). This protocol is for using xMarkerFinder, a four-stage computational framework, to enable the identification and validation of reproducible microbial biomarkers from cross-cohort studies, and establish potential microbiome-induced mechanisms.

Abstract Image

Abstract Image

使用 xMarkerFinder 从交叉队列数据集中识别和验证微生物生物标记物。
微生物特征已成为疾病诊断和预后的有前途的生物标志物,但它们在不同研究中的变异性要求生物标志物研究采用标准化方法。因此,我们引入了 xMarkerFinder,这是一个用于微生物生物标记物鉴定的四阶段计算框架,可通过跨队列数据集进行全面验证,包括差异特征鉴定、模型构建、模型验证和生物标记物解释。xMarkerFinder 可为跨队列研究鉴定和验证可重复的生物标记物,同时建立分类模型和潜在的微生物诱导机制。xMarkerFinder 最初是为肠道微生物组研究而开发的,其适应性强的设计使其适用于各种微生物栖息地和数据类型。现有的生物标记物研究工具通常只专注于一个方面,而 xMarkerFinder 则与众不同,它采用了复杂的特征选择过程,专门用于解决不同队列之间的异质性、广泛的内部和外部验证以及详细的特异性评估。执行时间取决于样本大小、所选算法和计算资源。xMarkerFinder 可通过 GitHub ( https://github.com/tjcadd2020/xMarkerFinder ) 访问,通过不同的执行选项支持不同专业水平的用户,包括带有详细教程和常见问题的分步脚本、单指令执行脚本、即用型 Docker 映像和用户友好型 Web 服务器 ( https://www.biosino.org/xmarkerfinder )。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Nature Protocols
Nature Protocols 生物-生化研究方法
CiteScore
29.10
自引率
0.70%
发文量
128
审稿时长
4 months
期刊介绍: Nature Protocols focuses on publishing protocols used to address significant biological and biomedical science research questions, including methods grounded in physics and chemistry with practical applications to biological problems. The journal caters to a primary audience of research scientists and, as such, exclusively publishes protocols with research applications. Protocols primarily aimed at influencing patient management and treatment decisions are not featured. The specific techniques covered encompass a wide range, including but not limited to: Biochemistry, Cell biology, Cell culture, Chemical modification, Computational biology, Developmental biology, Epigenomics, Genetic analysis, Genetic modification, Genomics, Imaging, Immunology, Isolation, purification, and separation, Lipidomics, Metabolomics, Microbiology, Model organisms, Nanotechnology, Neuroscience, Nucleic-acid-based molecular biology, Pharmacology, Plant biology, Protein analysis, Proteomics, Spectroscopy, Structural biology, Synthetic chemistry, Tissue culture, Toxicology, and Virology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信