PepTiger: Search Engine for Error-Tolerant Protein Identification from de Novo Sequences

Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang
{"title":"PepTiger: Search Engine for Error-Tolerant Protein Identification from de Novo Sequences","authors":"Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang","doi":"10.2174/1874383800701010001","DOIUrl":null,"url":null,"abstract":"In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.","PeriodicalId":88758,"journal":{"name":"The open spectroscopy journal","volume":"1 1","pages":"1-8"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The open spectroscopy journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/1874383800701010001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.
PepTiger:从从头序列中识别容错蛋白的搜索引擎
近年来,许多从头开始的测序软件产品成为可用的,为肽的MS/MS光谱提供可能的部分或完整氨基酸序列标签。然而,由于各种原因,包括光谱化学噪声和不完善的碎片化,这些序列标签几乎总是包含错误。额外的困难来自于实际的蛋白质序列变异和翻译后修饰。我们提出了一个名为PepTiger的搜索引擎,它能够正确地将有错误的从头序列标签与蛋白质数据库中的蛋白质序列进行匹配。该算法基于近似字符串匹配,然后是一种新的评分程序,该程序考虑了质量差异和新生序列与匹配肽之间的字符串距离以及理论和实验MS/MS谱之间的相似性。PepTiger与其他蛋白质鉴定软件的比较表明,PepTiger能够更好地将有错误的从头序列标签分配到正确的肽序列。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信