Proteome Informatics最新文献

Chapter 14. R for Proteomics 第14章。R代表蛋白质组学

Proteome Informatics Pub Date : 2016-11-15 DOI: 10.1039/9781782626732-00321

L. Breckels, Sebastian Gibb, V. Petyuk, L. Gatto

引用次数: 1

Chapter 9. Informatics Solutions for Selected Reaction Monitoring 第9章。选定反应监测的信息学解决方案

Proteome Informatics Pub Date : 2016-11-15 DOI: 10.1039/9781782626732-00178

B. Schilling, B. MacLean, Jason M. Held, B. Gibson

{"title":"Chapter 9. Informatics Solutions for Selected Reaction Monitoring","authors":"B. Schilling, B. MacLean, Jason M. Held, B. Gibson","doi":"10.1039/9781782626732-00178","DOIUrl":"https://doi.org/10.1039/9781782626732-00178","url":null,"abstract":"Informatics solutions for SRM assays pose several specific bioinformatics challenges including assay development, generating acquisition methods, and data processing. Furthermore, SRM is often coupled to experimental designs using stable isotope dilution SRM mass spectrometry workflows (SID-SRM-MS) that utilize one or more stable isotope versions of the analyte as internal standards. Skyline, an open-source software suite of tools for targeted proteomics, has emerged as the most widely used platform for SRM-specific assays. Skyline is a freely-available, comprehensive tool with high versatility for SRM assay development and subsequent processing of data acquired on triple quadrupole mass spectrometers. Skyline can be used for peptide and transition selection, assay optimization, retention time scheduling, SRM instrument method export, peak detection/integration, post-acquisition signal processing, and integration with statistical tools and algorithms to generate quantitative results for peptides and proteins. To highlight some of the Skyline SRM functionalities, we describe features including important visual displays and statistical tools, including ‘External Tools’. We discuss Skyline features that are particularly valuable for system suitability assessments, as well as for data sets with posttranslational modifications. Finally, an easy, point-and-click strategy is presented that supports dissemination of SRM data processed in Skyline to the Panorama web data repositories.","PeriodicalId":192946,"journal":{"name":"Proteome Informatics","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129229349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Chapter 3. Peptide Spectrum Matching via Database Search and Spectral Library Search 第三章。基于数据库搜索和谱库搜索的肽谱匹配

Proteome Informatics Pub Date : 2016-11-15 DOI: 10.1039/9781782626732-00039

Brian Netzel, S. Dasari

引用次数: 0

Chapter 12. OpenMS: A Modular, Open-Source Workflow System for the Analysis of Quantitative Proteomics Data 第十二章。用于定量蛋白质组学数据分析的模块化、开源工作流系统

Proteome Informatics Pub Date : 2016-11-15 DOI: 10.1039/9781782626732-00259

L. Nilse

引用次数: 2

Chapter 5. Protein Inference and Grouping 第五章。蛋白质推断与分组

Proteome Informatics Pub Date : 2016-11-15 DOI: 10.1039/9781782626732-00093

A. Jones

{"title":"Chapter 5. Protein Inference and Grouping","authors":"A. Jones","doi":"10.1039/9781782626732-00093","DOIUrl":"https://doi.org/10.1039/9781782626732-00093","url":null,"abstract":"A key process in many proteomics workflows is the identification of proteins, following analysis of tandem MS (MS/MS) spectra, for example by a database search. The core unit of identification from a database search is the identification of peptides, yet most researchers wish to know which proteins have been confidently identified in their samples. As such, following peptide identification, a second stage of data analysis is performed, either internally in the search engine or in a second package, called protein inference. Protein inference is challenging in the common case that proteins have been digested into peptides early in the proteomics workflow, and thus there is no direct link between a peptide and its parent protein. Many peptides could theoretically have been derived from more than one protein in the database searched, and thus it is not straightforward to determine which is the correct assignment. A variety of algorithms and implementations have been developed, which are reviewed in this chapter. Most approaches now report “protein groups” as a the core unit of identification from protein inference, since it is common for more than one database protein to share the same-set of evidence, and thus be indistinguishable. The chapter also describes scoring and statistical values that can be assigned during the protein identification process, to give confidence in the resulting values.","PeriodicalId":192946,"journal":{"name":"Proteome Informatics","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133195277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Chapter 15. Proteogenomics: Proteomics for Genome Annotation 第15章。蛋白质基因组学:用于基因组注释的蛋白质组学

Proteome Informatics Pub Date : 2016-11-15 DOI: 10.1039/9781782626732-00365

F. Ghali, A. Jones

{"title":"Chapter 15. Proteogenomics: Proteomics for Genome Annotation","authors":"F. Ghali, A. Jones","doi":"10.1039/9781782626732-00365","DOIUrl":"https://doi.org/10.1039/9781782626732-00365","url":null,"abstract":"One of major bottlenecks in omics biology is the generation of accurate gene models, including correct calling of the start codon, splicing of introns (taking account of alternative splicing), and the stop codon – collectively called genome annotation. Current genome annotation approaches for newly sequenced genomes are generally based on automated or semi-automated methods, usually involving gene finding software to look for intrinsic gene-like signatures (motifs) in the DNA sequence, the propagation of annotations from other (more well annotated) related species, and the mapping of experimental data sets, particularly from RNA Sequencing (RNA-Seq). Large scale proteomics data can also play an important role for confirming and correcting gene models. While proteomics approaches tend not to have the same level of sensitivity as RNA-Seq, they have the advantage that they can provide evidence that a predicted gene/transcript is indeed protein-coding. The use of proteomics data for genome annotation is called proteogenomics, and forms the basis for this chapter. We describe the theoretical underpinnings, different software packages that have been developed for proteogenomics, statistical approaches for validating the evidence, and support for proteogenomics data in file formats, standards and databases.","PeriodicalId":192946,"journal":{"name":"Proteome Informatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130957193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Chapter 11. Data Formats of the Proteomics Standards Initiative 第十一章。蛋白质组学标准倡议的数据格式

Proteome Informatics Pub Date : 2016-11-15 DOI: 10.1039/9781782626732-00229

J. Vizcaíno, S. Perkins, A. Jones, E. Deutsch

{"title":"Chapter 11. Data Formats of the Proteomics Standards Initiative","authors":"J. Vizcaíno, S. Perkins, A. Jones, E. Deutsch","doi":"10.1039/9781782626732-00229","DOIUrl":"https://doi.org/10.1039/9781782626732-00229","url":null,"abstract":"The existence and adoption of data standards in computational proteomics, as in any other field, is generally perceived to be crucial for the further development of the discipline. We here give an up-to-date overview of the open standard data formats that have been developed under the umbrella of the Proteomics Standards Initiative (PSI). We will focus in those formats related to mass spectrometry (MS). Most of them are based in XML (Extensible Markup Language) schemas: mzML (for primary MS data, the output of mass spectrometers), mzIdentML (for peptide and protein identification data), mzQuantML (for peptide and protein quantification data) and TraML (for reporting transition lists for selected reaction monitoring approaches). In addition, mzTab was developed as a simpler tab-delimited file to support peptide, protein and small molecule identification and quantification data in the same file. In all cases, we will explain the main characteristics of each format, describe the main existing software implementations and give an update of the ongoing work to extend the formats to support new use cases. Additionally, we will discuss other data formats that have been inspired by the PSI formats. Finally, other PSI data standard formats (not MS related) will be also outlined in brief.","PeriodicalId":192946,"journal":{"name":"Proteome Informatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132035986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Chapter 13. Using Galaxy for Proteomics 第13章。使用Galaxy进行蛋白质组学研究

Proteome Informatics Pub Date : 2016-11-15 DOI: 10.1039/9781782626732-00289

Candace R. Guerrero, P. Jagtap, James E. Johnson, T. Griffin

{"title":"Chapter 13. Using Galaxy for Proteomics","authors":"Candace R. Guerrero, P. Jagtap, James E. Johnson, T. Griffin","doi":"10.1039/9781782626732-00289","DOIUrl":"https://doi.org/10.1039/9781782626732-00289","url":null,"abstract":"The area of informatics for mass spectrometry (MS)-based proteomics data has steadily grown over the last two decades. Numerous, effective software programs now exist for various aspects of proteomic informatics. However, many researchers still have difficulties in using these software. These difficulties arise from problems with running and integrating disparate software programs, scalability issues when dealing with large data volumes, and lack of ability to share and reproduce workflows comprised of different software. The Galaxy framework for bioinformatics provides an attractive option for solving many of these current issues in proteomic informatics. Originally developed as a workbench to enable genomic data analysis, numerous researchers are now turning to Galaxy to implement software for MS-based proteomics applications. Here, we provide an introduction to Galaxy and its features, and describe how software tools are deployed, published and shared via the scalable framework. We also describe some of the existing tools in Galaxy for basic MS-based proteomics data analysis and informatics. Finally, we describe how proteomics tools in Galaxy can be combined with other existing tools for genomic and transcriptomic data analysis to enable powerful multi-omic data analysis applications.","PeriodicalId":192946,"journal":{"name":"Proteome Informatics","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123379393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Chapter 16. Proteomics Informed by Transcriptomics 第十六章。转录组学为蛋白质组学提供信息

Proteome Informatics Pub Date : 2016-11-15 DOI: 10.1039/9781782626732-00385

Shyamasree Saha, D. Matthews, C. Bessant

引用次数: 1

CHAPTER 6. Identification and Localization of Post-Translational Modifications by High-Resolution Mass Spectrometry 第六章。高分辨率质谱法鉴定和定位翻译后修饰

Proteome Informatics Pub Date : 2016-11-15 DOI: 10.1039/9781782626732-00116

R. Matthiesen, A. S. Carvalho

{"title":"CHAPTER 6. Identification and Localization of Post-Translational Modifications by High-Resolution Mass Spectrometry","authors":"R. Matthiesen, A. S. Carvalho","doi":"10.1039/9781782626732-00116","DOIUrl":"https://doi.org/10.1039/9781782626732-00116","url":null,"abstract":"Cells either in response to stimulus or in homeostasis require dynamic signaling through alterations in protein composition. Identification and temporospatial profiling of post translational modifications constitutes one of the most challenging tasks in biology. These challenges comprise both experimental and computational aspects. From the computational point of view identification of post translational modifications by mass spectrometry analysis frequently leads to algorithms with exponential complexity which in practice is approached by algorithms with lower complexity. Regulation of post translational modifications has been implicated in a number of diseases such as cancer, neurodegenerative diseases and metabolic diseases. Furthermore, some post translational modifications are considered as biomarkers and surrogate markers. Consequently, there is a high interest in methodologies that can identify and quantify post translational modifications. We found few papers addressing the issue of which modifications should be considered in a standard database dependent search of MS data for protein analysis. Furthermore, the few papers on the topic are from a time where MS instruments with high precision in both MS and MS/MS were not available. Therefore, based on literature search and extensive analysis we provide recommendations on post translational modifications to be included in mass spectrometry database searches of MS data with high precision in both MS and MS/MS (e.g. <5 ppm).","PeriodicalId":192946,"journal":{"name":"Proteome Informatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130812816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0