Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang
{"title":"PepTiger: Search Engine for Error-Tolerant Protein Identification from de Novo Sequences","authors":"Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang","doi":"10.2174/1874383800701010001","DOIUrl":null,"url":null,"abstract":"In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.","PeriodicalId":88758,"journal":{"name":"The open spectroscopy journal","volume":"1 1","pages":"1-8"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The open spectroscopy journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/1874383800701010001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.