STAVER: a standardized benchmark dataset-based algorithm for effective variation reduction in large-scale DIA-MS data.

IF 6.8 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

Briefings in bioinformatics Pub Date : 2024-09-23 DOI:10.1093/bib/bbae553

Peng Ran, Yunzhi Wang, Kai Li, Shiman He, Subei Tan, Jiacheng Lv, Jiajun Zhu, Shaoshuai Tang, Jinwen Feng, Zhaoyu Qin, Yan Li, Lin Huang, Yanan Yin, Lingli Zhu, Wenjun Yang, Chen Ding

{"title":"STAVER: a standardized benchmark dataset-based algorithm for effective variation reduction in large-scale DIA-MS data.","authors":"Peng Ran, Yunzhi Wang, Kai Li, Shiman He, Subei Tan, Jiacheng Lv, Jiajun Zhu, Shaoshuai Tang, Jinwen Feng, Zhaoyu Qin, Yan Li, Lin Huang, Yanan Yin, Lingli Zhu, Wenjun Yang, Chen Ding","doi":"10.1093/bib/bbae553","DOIUrl":null,"url":null,"abstract":"<p><p>Mass spectrometry (MS)-based proteomics has become instrumental in comprehensively investigating complex biological systems. Data-independent acquisition (DIA)-MS, utilizing hybrid spectral library search strategies, allows for the simultaneous quantification of thousands of proteins, showing promise in enhancing protein identification and quantification precision. However, low-quality profiles can considerably undermine quantitative precision, resulting in inaccurate protein quantification. To tackle this challenge, we introduced STAVER, a novel algorithm that leverages standardized benchmark datasets to reduce non-biological variation in large-scale DIA-MS analyses. By eliminating unwanted noise in MS signals, STAVER significantly improved protein quantification precision, especially in hybrid spectral library searches. Moreover, we validated STAVER's robustness and applicability across multiple large-scale DIA datasets, demonstrating significantly enhanced precision and reproducibility of protein quantification. STAVER offers an innovative and effective approach for enhancing the quality of large-scale DIA proteomic data, facilitating cross-platform and cross-laboratory comparative analyses. This advancement significantly enhances the consistency and reliability of findings in clinical research. The complete package is available at https://github.com/Ran485/STAVER.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"25 6","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11540132/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbae553","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Mass spectrometry (MS)-based proteomics has become instrumental in comprehensively investigating complex biological systems. Data-independent acquisition (DIA)-MS, utilizing hybrid spectral library search strategies, allows for the simultaneous quantification of thousands of proteins, showing promise in enhancing protein identification and quantification precision. However, low-quality profiles can considerably undermine quantitative precision, resulting in inaccurate protein quantification. To tackle this challenge, we introduced STAVER, a novel algorithm that leverages standardized benchmark datasets to reduce non-biological variation in large-scale DIA-MS analyses. By eliminating unwanted noise in MS signals, STAVER significantly improved protein quantification precision, especially in hybrid spectral library searches. Moreover, we validated STAVER's robustness and applicability across multiple large-scale DIA datasets, demonstrating significantly enhanced precision and reproducibility of protein quantification. STAVER offers an innovative and effective approach for enhancing the quality of large-scale DIA proteomic data, facilitating cross-platform and cross-laboratory comparative analyses. This advancement significantly enhances the consistency and reliability of findings in clinical research. The complete package is available at https://github.com/Ran485/STAVER.

查看原文本刊更多论文

STAVER：基于标准化基准数据集的算法，可有效减少大规模 DIA-MS 数据中的变异。

基于质谱（MS）的蛋白质组学已成为全面研究复杂生物系统的重要工具。数据独立采集（DIA）-质谱利用混合谱库搜索策略，可同时对数千种蛋白质进行定量分析，在提高蛋白质鉴定和定量精度方面大有可为。然而，低质量的图谱会大大降低定量精度，导致蛋白质定量不准确。为了应对这一挑战，我们引入了 STAVER 算法，这是一种利用标准化基准数据集来减少大规模 DIA-MS 分析中的非生物变异的新型算法。通过消除质谱信号中不必要的噪声，STAVER 显著提高了蛋白质定量精度，尤其是在混合谱库搜索中。此外，我们还在多个大规模 DIA 数据集上验证了 STAVER 的稳健性和适用性，证明其显著提高了蛋白质定量的精度和可重复性。STAVER 为提高大规模 DIA 蛋白质组学数据的质量提供了一种创新而有效的方法，促进了跨平台和跨实验室的比较分析。这一进步大大提高了临床研究结果的一致性和可靠性。完整的软件包可从 https://github.com/Ran485/STAVER 获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Briefings in bioinformatics 生物-生化研究方法

CiteScore

13.20

自引率

13.70%

发文量

549

审稿时长

6 months

期刊介绍： Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data. The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.