PepTiger:从从头序列中识别容错蛋白的搜索引擎

The open spectroscopy journal Pub Date : 2007-09-11 DOI:10.2174/1874383800701010001

Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang

{"title":"PepTiger:从从头序列中识别容错蛋白的搜索引擎","authors":"Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang","doi":"10.2174/1874383800701010001","DOIUrl":null,"url":null,"abstract":"In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.","PeriodicalId":88758,"journal":{"name":"The open spectroscopy journal","volume":"1 1","pages":"1-8"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"PepTiger: Search Engine for Error-Tolerant Protein Identification from de Novo Sequences\",\"authors\":\"Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang\",\"doi\":\"10.2174/1874383800701010001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.\",\"PeriodicalId\":88758,\"journal\":{\"name\":\"The open spectroscopy journal\",\"volume\":\"1 1\",\"pages\":\"1-8\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The open spectroscopy journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2174/1874383800701010001\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The open spectroscopy journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/1874383800701010001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

近年来，许多从头开始的测序软件产品成为可用的，为肽的MS/MS光谱提供可能的部分或完整氨基酸序列标签。然而，由于各种原因，包括光谱化学噪声和不完善的碎片化，这些序列标签几乎总是包含错误。额外的困难来自于实际的蛋白质序列变异和翻译后修饰。我们提出了一个名为PepTiger的搜索引擎，它能够正确地将有错误的从头序列标签与蛋白质数据库中的蛋白质序列进行匹配。该算法基于近似字符串匹配，然后是一种新的评分程序，该程序考虑了质量差异和新生序列与匹配肽之间的字符串距离以及理论和实验MS/MS谱之间的相似性。PepTiger与其他蛋白质鉴定软件的比较表明，PepTiger能够更好地将有错误的从头序列标签分配到正确的肽序列。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

PepTiger: Search Engine for Error-Tolerant Protein Identification from de Novo Sequences

In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The open spectroscopy journal

自引率

0.00%

发文量