swap：超越匹配-运行之间的模块化深度学习授权肽身份传播框架

IF 3.6 2区生物学 Q1 BIOCHEMICAL RESEARCH METHODS

Journal of Proteome Research Pub Date : 2025-03-07 DOI:10.1021/acs.jproteome.4c0097210.1021/acs.jproteome.4c00972

Zixuan Xiao, Johanna Tüshaus, Bernhard Kuster, Matthew The and Mathias Wilhelm*,

{"title":"swap：超越匹配-运行之间的模块化深度学习授权肽身份传播框架","authors":"Zixuan Xiao, Johanna Tüshaus, Bernhard Kuster, Matthew The and Mathias Wilhelm*, ","doi":"10.1021/acs.jproteome.4c0097210.1021/acs.jproteome.4c00972","DOIUrl":null,"url":null,"abstract":"<p >Mass spectrometry (MS)-based proteomics relies heavily on MS/MS (MS2) data, which do not fully exploit the available MS1 information. Traditional peptide identity propagation (PIP) methods, such as match-between-runs (MBR), are limited to similar runs, particularly with the same liquid chromatography (LC) gradients, thus potentially underutilizing available proteomics libraries. We introduce SWAPS, a novel and modular MS1-centric framework incorporating advances in peptide property prediction, extensive proteomics libraries, and deep-learning-based postprocessing to enable and explore PIP across more diverse experimental conditions and LC gradients. SWAPS substantially enhances precursor identification, especially in shorter gradients. On the example of 30, 15, and 7.5 min gradients, SWAPS achieves increases of 46.3, 86.2, and 112.1% on precursor level over MaxQuant’s MS2-based identifications. Despite the inherent challenges in controlling false discovery rates (FDR) with MS1-based methods, SWAPS demonstrates strong efficacy in deconvoluting MS1 signals, offering powerful discrimination and deeper sequence exploration, while maintaining quantitative accuracy. By building on and applying peptide property predictions in practical contexts, SWAPS reveals that current models, while advanced, are still not fully comparable to experimental measurements, sparking the need for further research. Additionally, its modular design allows seamless integration of future improvements, positioning SWAPS as a forward-looking tool in proteomics.</p>","PeriodicalId":48,"journal":{"name":"Journal of Proteome Research","volume":"24 4","pages":"1926–1940 1926–1940"},"PeriodicalIF":3.6000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acs.jproteome.4c00972","citationCount":"0","resultStr":"{\"title\":\"SWAPS: A Modular Deep-Learning Empowered Peptide Identity Propagation Framework Beyond Match-Between-Run\",\"authors\":\"Zixuan Xiao, Johanna Tüshaus, Bernhard Kuster, Matthew The and Mathias Wilhelm*, \",\"doi\":\"10.1021/acs.jproteome.4c0097210.1021/acs.jproteome.4c00972\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Mass spectrometry (MS)-based proteomics relies heavily on MS/MS (MS2) data, which do not fully exploit the available MS1 information. Traditional peptide identity propagation (PIP) methods, such as match-between-runs (MBR), are limited to similar runs, particularly with the same liquid chromatography (LC) gradients, thus potentially underutilizing available proteomics libraries. We introduce SWAPS, a novel and modular MS1-centric framework incorporating advances in peptide property prediction, extensive proteomics libraries, and deep-learning-based postprocessing to enable and explore PIP across more diverse experimental conditions and LC gradients. SWAPS substantially enhances precursor identification, especially in shorter gradients. On the example of 30, 15, and 7.5 min gradients, SWAPS achieves increases of 46.3, 86.2, and 112.1% on precursor level over MaxQuant’s MS2-based identifications. Despite the inherent challenges in controlling false discovery rates (FDR) with MS1-based methods, SWAPS demonstrates strong efficacy in deconvoluting MS1 signals, offering powerful discrimination and deeper sequence exploration, while maintaining quantitative accuracy. By building on and applying peptide property predictions in practical contexts, SWAPS reveals that current models, while advanced, are still not fully comparable to experimental measurements, sparking the need for further research. Additionally, its modular design allows seamless integration of future improvements, positioning SWAPS as a forward-looking tool in proteomics.</p>\",\"PeriodicalId\":48,\"journal\":{\"name\":\"Journal of Proteome Research\",\"volume\":\"24 4\",\"pages\":\"1926–1940 1926–1940\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2025-03-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.acs.org/doi/epdf/10.1021/acs.jproteome.4c00972\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Proteome Research\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acs.jproteome.4c00972\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Proteome Research","FirstCategoryId":"99","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.jproteome.4c00972","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

摘要

基于质谱（MS）的蛋白质组学在很大程度上依赖于MS/MS （MS2）数据，不能充分利用现有的MS1信息。传统的多肽识别繁殖（PIP）方法，如序列间匹配（MBR），仅限于相似的序列，特别是在相同的液相色谱（LC）梯度下，因此可能未充分利用现有的蛋白质组学文库。我们引入了swap，这是一种以ms1为中心的新型模块化框架，结合了肽特性预测、广泛的蛋白质组学库和基于深度学习的后处理方面的进展，可以在更多样化的实验条件和LC梯度下实现和探索PIP。swap大大提高了前体识别，特别是在较短的梯度中。在梯度为30,15和7.5 min的例子中，swap在前体水平上比MaxQuant基于ms2的识别提高了46.3,86.2和112.1%。尽管基于MS1的方法在控制错误发现率（FDR）方面存在固有的挑战，但swap在反卷积MS1信号方面表现出强大的功效，在保持定量准确性的同时，提供强大的区分和更深层次的序列探索。通过在实际环境中建立和应用肽性质预测，swap表明，目前的模型虽然先进，但仍不能完全与实验测量相媲美，这引发了进一步研究的需要。此外，其模块化设计允许无缝集成未来的改进，将swap定位为蛋白质组学中的前瞻性工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

SWAPS: A Modular Deep-Learning Empowered Peptide Identity Propagation Framework Beyond Match-Between-Run

Mass spectrometry (MS)-based proteomics relies heavily on MS/MS (MS2) data, which do not fully exploit the available MS1 information. Traditional peptide identity propagation (PIP) methods, such as match-between-runs (MBR), are limited to similar runs, particularly with the same liquid chromatography (LC) gradients, thus potentially underutilizing available proteomics libraries. We introduce SWAPS, a novel and modular MS1-centric framework incorporating advances in peptide property prediction, extensive proteomics libraries, and deep-learning-based postprocessing to enable and explore PIP across more diverse experimental conditions and LC gradients. SWAPS substantially enhances precursor identification, especially in shorter gradients. On the example of 30, 15, and 7.5 min gradients, SWAPS achieves increases of 46.3, 86.2, and 112.1% on precursor level over MaxQuant’s MS2-based identifications. Despite the inherent challenges in controlling false discovery rates (FDR) with MS1-based methods, SWAPS demonstrates strong efficacy in deconvoluting MS1 signals, offering powerful discrimination and deeper sequence exploration, while maintaining quantitative accuracy. By building on and applying peptide property predictions in practical contexts, SWAPS reveals that current models, while advanced, are still not fully comparable to experimental measurements, sparking the need for further research. Additionally, its modular design allows seamless integration of future improvements, positioning SWAPS as a forward-looking tool in proteomics.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Proteome Research 生物-生化研究方法

CiteScore

9.00

自引率

4.50%

发文量

251

审稿时长

3 months

期刊介绍： Journal of Proteome Research publishes content encompassing all aspects of global protein analysis and function, including the dynamic aspects of genomics, spatio-temporal proteomics, metabonomics and metabolomics, clinical and agricultural proteomics, as well as advances in methodology including bioinformatics. The theme and emphasis is on a multidisciplinary approach to the life sciences through the synergy between the different types of "omics".