P4PP: A Universal Shotgun Proteomics Data Analysis Pipeline for Virus Identification.

IF 5.5 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Molecular & Cellular Proteomics Pub Date : 2025-07-01 Epub Date: 2025-05-29 DOI:10.1016/j.mcpro.2025.101004
Armand Paauw, Evgeni Levin, Ingrid A I Voskamp-Visser, Ilka M F Marissen, Vincent Ramisse, Marine Eschlimann, Jiří Dresler, Petr Pajer, Christoph Stingl, Hans C van Leeuwen, Theo M Luider, Luc M Hornstra
{"title":"P4PP: A Universal Shotgun Proteomics Data Analysis Pipeline for Virus Identification.","authors":"Armand Paauw, Evgeni Levin, Ingrid A I Voskamp-Visser, Ilka M F Marissen, Vincent Ramisse, Marine Eschlimann, Jiří Dresler, Petr Pajer, Christoph Stingl, Hans C van Leeuwen, Theo M Luider, Luc M Hornstra","doi":"10.1016/j.mcpro.2025.101004","DOIUrl":null,"url":null,"abstract":"<p><p>Humans can be infected by a wide variety of virus species. We developed a data analysis approach for shotgun proteomic data to detect these viruses. A proteome for pandemic preparedness (P4PP) pipeline, a corresponding database (P4PP v01), and a web application (P4PP) were constructed. The P4PP pipeline enables the identification of 1896 virus species from the 32 virus families, based on multiple identified discriminatory peptides, in which at least one human infectious virus is described. P4PP was evaluated using different datasets of cell-cultivated viruses, generated at different institutes, measured with different instruments, and prepared with different sample preparation methods. In total, 174 mass spectrometry datasets of 160 and 14 protein trypsin digests of virus-infected and noninfected cell lines were analyzed, respectively. Of the 160 samples, 146 were correctly identified at the species level, and an additional four samples were identified at the family level. In the remaining 10 samples, no virus was detected. However, all these 10 samples tested positive in follow-up samples obtained later in time series were negative samples were measured, indicating that the number of peptides derived from the virus was initially too low in the samples obtained at the start of the experiment. Furthermore, results show that influenza A or severe acute respiratory syndrome coronavirus 2 can be subtyped if enough discriminative peptides of the virus are identified. In the noninfected cell lines, no virus was detected except in one sample where the in that experiment studied virus was detected. Shotgun proteomics, in combination with the developed data analysis approach, can identify all types of virus species after cultivation in a cell line. Implementing this agnostic virus proteome analysis capability in viral diagnostic laboratories has the potential to improve their capabilities to cope with unexpected, mutated, or re-emerging viruses.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101004"},"PeriodicalIF":5.5000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12418414/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular & Cellular Proteomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.mcpro.2025.101004","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/29 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Humans can be infected by a wide variety of virus species. We developed a data analysis approach for shotgun proteomic data to detect these viruses. A proteome for pandemic preparedness (P4PP) pipeline, a corresponding database (P4PP v01), and a web application (P4PP) were constructed. The P4PP pipeline enables the identification of 1896 virus species from the 32 virus families, based on multiple identified discriminatory peptides, in which at least one human infectious virus is described. P4PP was evaluated using different datasets of cell-cultivated viruses, generated at different institutes, measured with different instruments, and prepared with different sample preparation methods. In total, 174 mass spectrometry datasets of 160 and 14 protein trypsin digests of virus-infected and noninfected cell lines were analyzed, respectively. Of the 160 samples, 146 were correctly identified at the species level, and an additional four samples were identified at the family level. In the remaining 10 samples, no virus was detected. However, all these 10 samples tested positive in follow-up samples obtained later in time series were negative samples were measured, indicating that the number of peptides derived from the virus was initially too low in the samples obtained at the start of the experiment. Furthermore, results show that influenza A or severe acute respiratory syndrome coronavirus 2 can be subtyped if enough discriminative peptides of the virus are identified. In the noninfected cell lines, no virus was detected except in one sample where the in that experiment studied virus was detected. Shotgun proteomics, in combination with the developed data analysis approach, can identify all types of virus species after cultivation in a cell line. Implementing this agnostic virus proteome analysis capability in viral diagnostic laboratories has the potential to improve their capabilities to cope with unexpected, mutated, or re-emerging viruses.

P4PP:用于病毒鉴定的通用散弹枪蛋白质组学数据分析管道。
人类可以被各种各样的病毒感染。我们开发了一种针对鸟枪蛋白质组学数据的数据分析方法来检测这些病毒。构建了大流行预防蛋白质组(P4PP)管道、相应的数据库(P4PP v01)和web应用程序(P4PP)。P4PP管道能够基于多个已鉴定的歧视性肽,从32个病毒科中鉴定出1896种病毒,其中至少描述了一种人类传染性病毒。采用不同的细胞培养病毒数据集、不同的研究所、不同的仪器和不同的样品制备方法对P4PP进行了评估。共分析了病毒感染细胞系160株和未感染细胞系14株蛋白胰蛋白酶酶切酶的174个MS数据集。在160个样本中,正确鉴定了物种水平的146个样本,正确鉴定了科水平的4个样本。在其余10个样本中,未检测到病毒。然而,在随后的时间序列中获得的后续样本中检测为阳性的这10个样本均为阴性样本,这表明在实验开始时获得的样本中,源自病毒的肽的数量最初过低。此外,研究结果表明,如果鉴定出足够的病毒区分肽,则可以对甲型流感或SARS-CoV-2进行亚型分型。在未感染的细胞系中,除了在一个样本中检测到该实验所研究的病毒外,未检测到病毒。霰弹枪蛋白质组学与开发的数据分析方法相结合,可以在细胞系中培养后识别所有类型的病毒物种。在病毒诊断实验室实施这种不可知的病毒蛋白质组分析能力,有可能提高他们应对意外、突变或重新出现的病毒的能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Molecular & Cellular Proteomics
Molecular & Cellular Proteomics 生物-生化研究方法
CiteScore
11.50
自引率
4.30%
发文量
131
审稿时长
84 days
期刊介绍: The mission of MCP is to foster the development and applications of proteomics in both basic and translational research. MCP will publish manuscripts that report significant new biological or clinical discoveries underpinned by proteomic observations across all kingdoms of life. Manuscripts must define the biological roles played by the proteins investigated or their mechanisms of action. The journal also emphasizes articles that describe innovative new computational methods and technological advancements that will enable future discoveries. Manuscripts describing such approaches do not have to include a solution to a biological problem, but must demonstrate that the technology works as described, is reproducible and is appropriate to uncover yet unknown protein/proteome function or properties using relevant model systems or publicly available data. Scope: -Fundamental studies in biology, including integrative "omics" studies, that provide mechanistic insights -Novel experimental and computational technologies -Proteogenomic data integration and analysis that enable greater understanding of physiology and disease processes -Pathway and network analyses of signaling that focus on the roles of post-translational modifications -Studies of proteome dynamics and quality controls, and their roles in disease -Studies of evolutionary processes effecting proteome dynamics, quality and regulation -Chemical proteomics, including mechanisms of drug action -Proteomics of the immune system and antigen presentation/recognition -Microbiome proteomics, host-microbe and host-pathogen interactions, and their roles in health and disease -Clinical and translational studies of human diseases -Metabolomics to understand functional connections between genes, proteins and phenotypes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信