b宏蛋白质组学蛋白数据库构建及高分辨率分类注释的基因组参考。

IF 3.8 2区 生物学 Q2 MICROBIOLOGY
Microbiology spectrum Pub Date : 2025-02-04 Epub Date: 2024-12-12 DOI:10.1128/spectrum.01755-24
Xiaowei Gao, Hewei Liang, Tongyuan Hu, Yuanqiang Zou, Liang Xiao
{"title":"b宏蛋白质组学蛋白数据库构建及高分辨率分类注释的基因组参考。","authors":"Xiaowei Gao, Hewei Liang, Tongyuan Hu, Yuanqiang Zou, Liang Xiao","doi":"10.1128/spectrum.01755-24","DOIUrl":null,"url":null,"abstract":"<p><p>Metaproteomics offers a profound understanding of the functional dynamics of the gut microbiome, which is crucial for personalized healthcare strategies. The selection of an appropriate database is a critical step for the identification of peptides and proteins, as well as for the provision of accurate taxonomic and functional annotations. The matched metagenomic-derived database is considered to be the best, but its limitations include the identification of low-abundance organisms and taxonomic resolution. Herein, we constructed a protein database (DBCGR2) based on Cultivated Genome Reference 2 (CGR2) and developed a complete peptide-centric analysis workflow for database searching and for the annotation of taxonomy and function. This workflow was subsequently appraised in comparison with metagenomics-derived databases for the analysis of metaproteomic data. Our findings suggested that the performance of DBCGR2 in identification was comparable with metagenomics-derived databases with improvement in identification rates of peptides from low-abundance species. The database searching results could be fully annotated using the pepTaxa taxonomic annotation approach developed in this study, and the taxonomic resolution was enhanced to strain level. Additionally, the results demonstrated that the sensitivity of functional annotation could be enhanced by employing DBCGR2. Overall, the DBCGR2 combined with pepTaxa can be considered an alternative for metaproteomic data analysis with superior analysis performances.IMPORTANCEMass spectrometry-based metaproteomics offers a profound understanding of the gut microbial taxonomy and functionality. The databases utilized in the analysis of metaproteomic data are crucial, as they determine the identification of proteins that can be recognized and linked to overall human health, in addition to the quality of taxonomic and functional annotation. Among the most effective approaches for constructing protein databases is the utilization of metagenomic sequencing to create matched databases. However, the database, derived from isolated genomes, has yet to undergo rigorous testing for their efficacy and accuracy in protein identification and taxonomic and functional annotation. Here, we constructed a protein database DBCGR2 derived from Cultivated Genome Reference 2 (CGR2) and a complete workflow for data analysis. We compared the performances of DBCGR2 and metagenomics-derived databases. Our results indicated that DBCGR2 can be regarded as an alternative to metagenomics-derived databases, which contribute to metaproteomic data analysis.</p>","PeriodicalId":18670,"journal":{"name":"Microbiology spectrum","volume":" ","pages":"e0175524"},"PeriodicalIF":3.8000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11792528/pdf/","citationCount":"0","resultStr":"{\"title\":\"Cultivated genome references for protein database construction and high-resolution taxonomic annotation in metaproteomics.\",\"authors\":\"Xiaowei Gao, Hewei Liang, Tongyuan Hu, Yuanqiang Zou, Liang Xiao\",\"doi\":\"10.1128/spectrum.01755-24\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Metaproteomics offers a profound understanding of the functional dynamics of the gut microbiome, which is crucial for personalized healthcare strategies. The selection of an appropriate database is a critical step for the identification of peptides and proteins, as well as for the provision of accurate taxonomic and functional annotations. The matched metagenomic-derived database is considered to be the best, but its limitations include the identification of low-abundance organisms and taxonomic resolution. Herein, we constructed a protein database (DBCGR2) based on Cultivated Genome Reference 2 (CGR2) and developed a complete peptide-centric analysis workflow for database searching and for the annotation of taxonomy and function. This workflow was subsequently appraised in comparison with metagenomics-derived databases for the analysis of metaproteomic data. Our findings suggested that the performance of DBCGR2 in identification was comparable with metagenomics-derived databases with improvement in identification rates of peptides from low-abundance species. The database searching results could be fully annotated using the pepTaxa taxonomic annotation approach developed in this study, and the taxonomic resolution was enhanced to strain level. Additionally, the results demonstrated that the sensitivity of functional annotation could be enhanced by employing DBCGR2. Overall, the DBCGR2 combined with pepTaxa can be considered an alternative for metaproteomic data analysis with superior analysis performances.IMPORTANCEMass spectrometry-based metaproteomics offers a profound understanding of the gut microbial taxonomy and functionality. The databases utilized in the analysis of metaproteomic data are crucial, as they determine the identification of proteins that can be recognized and linked to overall human health, in addition to the quality of taxonomic and functional annotation. Among the most effective approaches for constructing protein databases is the utilization of metagenomic sequencing to create matched databases. However, the database, derived from isolated genomes, has yet to undergo rigorous testing for their efficacy and accuracy in protein identification and taxonomic and functional annotation. Here, we constructed a protein database DBCGR2 derived from Cultivated Genome Reference 2 (CGR2) and a complete workflow for data analysis. We compared the performances of DBCGR2 and metagenomics-derived databases. Our results indicated that DBCGR2 can be regarded as an alternative to metagenomics-derived databases, which contribute to metaproteomic data analysis.</p>\",\"PeriodicalId\":18670,\"journal\":{\"name\":\"Microbiology spectrum\",\"volume\":\" \",\"pages\":\"e0175524\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-02-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11792528/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Microbiology spectrum\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1128/spectrum.01755-24\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/12 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbiology spectrum","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1128/spectrum.01755-24","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/12 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

宏蛋白质组学提供了对肠道微生物组功能动态的深刻理解,这对于个性化医疗保健策略至关重要。选择合适的数据库是鉴定多肽和蛋白质以及提供准确的分类和功能注释的关键步骤。匹配的宏基因组衍生数据库被认为是最好的,但其局限性包括低丰度生物的鉴定和分类分辨率。为此,我们基于栽培基因组参考文献2 (CGR2)构建了一个蛋白质数据库(DBCGR2),并开发了一套完整的以肽为中心的分析工作流程,用于数据库检索、分类和功能标注。随后,将该工作流程与元基因组学衍生数据库进行比较,以分析元基因组学数据。我们的研究结果表明,DBCGR2在鉴定方面的性能与宏基因组衍生数据库相当,并且在低丰度物种的肽鉴定率方面有所提高。利用本研究开发的pepTaxa分类标注方法可以对数据库检索结果进行完整的标注,并将分类分辨率提高到品系水平。结果表明,DBCGR2可以提高功能标注的灵敏度。综上所述,DBCGR2与pepTaxa结合可以被认为是一种具有优越分析性能的元蛋白质组学数据分析的替代方法。基于质谱的宏蛋白质组学提供了对肠道微生物分类和功能的深刻理解。用于分析元蛋白质组学数据的数据库至关重要,因为除了分类和功能注释的质量外,它们还确定了可识别并与整体人类健康相关的蛋白质的鉴定。构建蛋白质数据库最有效的方法之一是利用宏基因组测序来创建匹配数据库。然而,该数据库来源于分离的基因组,尚未经过严格的测试,以确定其在蛋白质鉴定、分类和功能注释方面的有效性和准确性。在此,我们构建了一个来自栽培基因组参考资料2 (CGR2)的蛋白质数据库DBCGR2和一个完整的数据分析工作流程。我们比较了DBCGR2和宏基因组衍生数据库的性能。我们的研究结果表明,DBCGR2可以被视为元基因组学衍生数据库的替代方案,后者有助于元蛋白质组学数据分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Cultivated genome references for protein database construction and high-resolution taxonomic annotation in metaproteomics.

Cultivated genome references for protein database construction and high-resolution taxonomic annotation in metaproteomics.

Cultivated genome references for protein database construction and high-resolution taxonomic annotation in metaproteomics.

Cultivated genome references for protein database construction and high-resolution taxonomic annotation in metaproteomics.

Metaproteomics offers a profound understanding of the functional dynamics of the gut microbiome, which is crucial for personalized healthcare strategies. The selection of an appropriate database is a critical step for the identification of peptides and proteins, as well as for the provision of accurate taxonomic and functional annotations. The matched metagenomic-derived database is considered to be the best, but its limitations include the identification of low-abundance organisms and taxonomic resolution. Herein, we constructed a protein database (DBCGR2) based on Cultivated Genome Reference 2 (CGR2) and developed a complete peptide-centric analysis workflow for database searching and for the annotation of taxonomy and function. This workflow was subsequently appraised in comparison with metagenomics-derived databases for the analysis of metaproteomic data. Our findings suggested that the performance of DBCGR2 in identification was comparable with metagenomics-derived databases with improvement in identification rates of peptides from low-abundance species. The database searching results could be fully annotated using the pepTaxa taxonomic annotation approach developed in this study, and the taxonomic resolution was enhanced to strain level. Additionally, the results demonstrated that the sensitivity of functional annotation could be enhanced by employing DBCGR2. Overall, the DBCGR2 combined with pepTaxa can be considered an alternative for metaproteomic data analysis with superior analysis performances.IMPORTANCEMass spectrometry-based metaproteomics offers a profound understanding of the gut microbial taxonomy and functionality. The databases utilized in the analysis of metaproteomic data are crucial, as they determine the identification of proteins that can be recognized and linked to overall human health, in addition to the quality of taxonomic and functional annotation. Among the most effective approaches for constructing protein databases is the utilization of metagenomic sequencing to create matched databases. However, the database, derived from isolated genomes, has yet to undergo rigorous testing for their efficacy and accuracy in protein identification and taxonomic and functional annotation. Here, we constructed a protein database DBCGR2 derived from Cultivated Genome Reference 2 (CGR2) and a complete workflow for data analysis. We compared the performances of DBCGR2 and metagenomics-derived databases. Our results indicated that DBCGR2 can be regarded as an alternative to metagenomics-derived databases, which contribute to metaproteomic data analysis.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Microbiology spectrum
Microbiology spectrum Biochemistry, Genetics and Molecular Biology-Genetics
CiteScore
3.20
自引率
5.40%
发文量
1800
期刊介绍: Microbiology Spectrum publishes commissioned review articles on topics in microbiology representing ten content areas: Archaea; Food Microbiology; Bacterial Genetics, Cell Biology, and Physiology; Clinical Microbiology; Environmental Microbiology and Ecology; Eukaryotic Microbes; Genomics, Computational, and Synthetic Microbiology; Immunology; Pathogenesis; and Virology. Reviews are interrelated, with each review linking to other related content. A large board of Microbiology Spectrum editors aids in the development of topics for potential reviews and in the identification of an editor, or editors, who shepherd each collection.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信