SPIDER: software for protein identification from sequence tags with de novo sequencing error.

Proceedings. IEEE Computational Systems Bioinformatics Conference Pub Date : 2004-01-01 DOI:10.1109/csb.2004.1332434

Yonghua Han, Bin Ma, Kaizhong Zhang

引用次数: 30

Abstract

For the identification of novel proteins using MS/MS, de novo sequencing software computes one or several possible amino acid sequences (called sequence tags) for each MS/MS spectrum. Those tags are then used to match, accounting amino acid mutations, the sequences in a protein database. If the de novo sequencing gives correct tags, the homologs of the proteins can be identified by this approach and software such as MS-BLAST is available for the matching. However, de novo sequencing very often gives only partially correct tags. The most common error is that a segment of amino acids is replaced by another segment with approximately the same masses. We developed a new efficient algorithm to match sequence tags with errors to database sequences for the purpose of protein and peptide identification. A software package, SPIDER, was developed and made available on Internet for free public use. This paper describes the algorithms and features of the SPIDER software.

查看原文本刊更多论文

蜘蛛:软件蛋白质鉴定从序列标签与从头测序错误。

为了使用MS/MS鉴定新的蛋白质，de novo测序软件为每个MS/MS谱计算一个或几个可能的氨基酸序列(称为序列标签)。然后用这些标签来匹配，计算氨基酸突变，蛋白质数据库中的序列。如果从头测序给出了正确的标签，则可以通过这种方法识别蛋白质的同源物，并且可以使用MS-BLAST等软件进行匹配。然而，从头测序通常只能给出部分正确的标签。最常见的错误是一段氨基酸被另一段质量大致相同的氨基酸所取代。我们开发了一种新的高效算法，将序列标签与数据库序列相匹配，用于蛋白质和肽的鉴定。开发了一个软件包“信息平台”，并在互联网上提供给公众免费使用。本文介绍了SPIDER软件的算法和特点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. IEEE Computational Systems Bioinformatics Conference

自引率

0.00%

发文量