Quantitative Proteogenomic Characterization of Inflamed Murine Colon Tissue Using an Integrated Discovery, Verification, and Validation Proteogenomic Workflow.
Andrew T Rajczewski, Qiyuan Han, Subina Mehta, Praveen Kumar, Pratik D Jagtap, Charles G Knutson, James G Fox, Natalia Y Tretyakova, Timothy J Griffin
{"title":"Quantitative Proteogenomic Characterization of Inflamed Murine Colon Tissue Using an Integrated Discovery, Verification, and Validation Proteogenomic Workflow.","authors":"Andrew T Rajczewski, Qiyuan Han, Subina Mehta, Praveen Kumar, Pratik D Jagtap, Charles G Knutson, James G Fox, Natalia Y Tretyakova, Timothy J Griffin","doi":"10.3390/proteomes10020011","DOIUrl":null,"url":null,"abstract":"<p><p>Chronic inflammation of the colon causes genomic and/or transcriptomic events, which can lead to expression of non-canonical protein sequences contributing to oncogenesis. To better understand these mechanisms, <i>Rag2</i><sup>-/-</sup><i>Il10</i><sup>-/-</sup> mice were infected with <i>Helicobacter hepaticus</i> to induce chronic inflammation of the cecum and the colon. Transcriptomic data from harvested proximal colon samples were used to generate a customized FASTA database containing non-canonical protein sequences. Using a proteogenomic approach, mass spectrometry data for proximal colon proteins were searched against this custom FASTA database using the Galaxy for Proteomics (Galaxy-P) platform. In addition to the increased abundance in inflammatory response proteins, we also discovered several non-canonical peptide sequences derived from unique proteoforms. We confirmed the veracity of these novel sequences using an automated bioinformatics verification workflow with targeted MS-based assays for peptide validation. Our bioinformatics discovery workflow identified 235 putative non-canonical peptide sequences, of which 58 were verified with high confidence and 39 were validated in targeted proteomics assays. This study provides insights into challenges faced when identifying non-canonical peptides using a proteogenomics approach and demonstrates an integrated workflow addressing these challenges. Our bioinformatic discovery and verification workflow is publicly available and accessible via the Galaxy platform and should be valuable in non-canonical peptide identification using proteogenomics.</p>","PeriodicalId":20877,"journal":{"name":"Proteomes","volume":null,"pages":null},"PeriodicalIF":4.0000,"publicationDate":"2022-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9036229/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteomes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/proteomes10020011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Chronic inflammation of the colon causes genomic and/or transcriptomic events, which can lead to expression of non-canonical protein sequences contributing to oncogenesis. To better understand these mechanisms, Rag2-/-Il10-/- mice were infected with Helicobacter hepaticus to induce chronic inflammation of the cecum and the colon. Transcriptomic data from harvested proximal colon samples were used to generate a customized FASTA database containing non-canonical protein sequences. Using a proteogenomic approach, mass spectrometry data for proximal colon proteins were searched against this custom FASTA database using the Galaxy for Proteomics (Galaxy-P) platform. In addition to the increased abundance in inflammatory response proteins, we also discovered several non-canonical peptide sequences derived from unique proteoforms. We confirmed the veracity of these novel sequences using an automated bioinformatics verification workflow with targeted MS-based assays for peptide validation. Our bioinformatics discovery workflow identified 235 putative non-canonical peptide sequences, of which 58 were verified with high confidence and 39 were validated in targeted proteomics assays. This study provides insights into challenges faced when identifying non-canonical peptides using a proteogenomics approach and demonstrates an integrated workflow addressing these challenges. Our bioinformatic discovery and verification workflow is publicly available and accessible via the Galaxy platform and should be valuable in non-canonical peptide identification using proteogenomics.
结肠慢性炎症引起基因组和/或转录组事件,这可能导致非规范蛋白序列的表达,从而促进肿瘤的发生。为了更好地了解这些机制,Rag2−/−Il10−/−小鼠感染肝螺杆菌诱导盲肠和结肠的慢性炎症。来自近端结肠样本的转录组学数据用于生成包含非典型蛋白序列的定制FASTA数据库。使用蛋白质基因组学方法,使用Galaxy for Proteomics (Galaxy- p)平台对该自定义FASTA数据库检索近端结肠蛋白的质谱数据。除了炎症反应蛋白的丰度增加外,我们还发现了一些来自独特蛋白形态的非规范肽序列。我们使用自动化生物信息学验证工作流程和靶向MS-based肽验证分析来确认这些新序列的准确性。我们的生物信息学发现工作流程确定了235个假定的非规范肽序列,其中58个得到了高可信度的验证,39个在靶向蛋白质组学分析中得到了验证。本研究提供了使用蛋白质基因组学方法识别非规范肽时面临的挑战的见解,并展示了解决这些挑战的集成工作流程。我们的生物信息学发现和验证工作流程是公开的,可以通过Galaxy平台访问,在使用蛋白质基因组学鉴定非规范肽方面应该是有价值的。
ProteomesBiochemistry, Genetics and Molecular Biology-Clinical Biochemistry
CiteScore
6.50
自引率
3.00%
发文量
37
审稿时长
11 weeks
期刊介绍:
Proteomes (ISSN 2227-7382) is an open access, peer reviewed journal on all aspects of proteome science. Proteomes covers the multi-disciplinary topics of structural and functional biology, protein chemistry, cell biology, methodology used for protein analysis, including mass spectrometry, protein arrays, bioinformatics, HTS assays, etc. Our aim is to encourage scientists to publish their experimental and theoretical results in as much detail as possible. Therefore, there is no restriction on the length of papers. Scope: -whole proteome analysis of any organism -disease/pharmaceutical studies -comparative proteomics -protein-ligand/protein interactions -structure/functional proteomics -gene expression -methodology -bioinformatics -applications of proteomics