A search method for homologs of small proteins. Ubiquitin-like proteins in prokaryotic cells?

Protein engineering Pub Date : 2003-12-01 DOI:10.1093/protein/gzg130

Jadwiga R Bienkowska, Hyman Hartman, Temple F Smith

{"title":"A search method for homologs of small proteins. Ubiquitin-like proteins in prokaryotic cells?","authors":"Jadwiga R Bienkowska, Hyman Hartman, Temple F Smith","doi":"10.1093/protein/gzg130","DOIUrl":null,"url":null,"abstract":"<p><p>The question of protein homology versus analogy arises when proteins share a common function or a common structural fold without any statistically significant amino acid sequence similarity. Even though two or more proteins do not have similar sequences but share a common fold and the same or closely related function, they are assumed to be homologs, descendant from a common ancestor. The problem of homolog identification is compounded in the case of proteins of 100 or less amino acids. This is due to a limited number of basic single domain folds and to a likelihood of identifying by chance sequence similarity. The latter arises from two conditions: first, any search of the currently very large protein database is likely to identify short regions of chance match; secondly, a direct sequence comparison among a small set of short proteins sharing a similar fold can detect many similar patterns of hydrophobicity even if proteins do not descend from a common ancestor. In an effort to identify distant homologs of the many ubiquitin proteins, we have developed a combined structure and sequence similarity approach that attempts to overcome the above limitations of homolog identification. This approach results in the identification of 90 probable ubiquitin-related proteins, including examples from the two prokaryotic domains of life, Archaea and Bacteria.</p>","PeriodicalId":20902,"journal":{"name":"Protein engineering","volume":"16 12","pages":"897-904"},"PeriodicalIF":0.0000,"publicationDate":"2003-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/protein/gzg130","citationCount":"21","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Protein engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/protein/gzg130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 21

Abstract

The question of protein homology versus analogy arises when proteins share a common function or a common structural fold without any statistically significant amino acid sequence similarity. Even though two or more proteins do not have similar sequences but share a common fold and the same or closely related function, they are assumed to be homologs, descendant from a common ancestor. The problem of homolog identification is compounded in the case of proteins of 100 or less amino acids. This is due to a limited number of basic single domain folds and to a likelihood of identifying by chance sequence similarity. The latter arises from two conditions: first, any search of the currently very large protein database is likely to identify short regions of chance match; secondly, a direct sequence comparison among a small set of short proteins sharing a similar fold can detect many similar patterns of hydrophobicity even if proteins do not descend from a common ancestor. In an effort to identify distant homologs of the many ubiquitin proteins, we have developed a combined structure and sequence similarity approach that attempts to overcome the above limitations of homolog identification. This approach results in the identification of 90 probable ubiquitin-related proteins, including examples from the two prokaryotic domains of life, Archaea and Bacteria.

查看原文本刊更多论文

小分子蛋白同源物的搜索方法。原核细胞中的泛素样蛋白?

当蛋白质具有共同的功能或共同的结构折叠而没有任何统计上显着的氨基酸序列相似性时，就会出现蛋白质同源性与类比性的问题。即使两个或两个以上的蛋白质没有相似的序列，但具有共同的折叠和相同或密切相关的功能，它们也被认为是同源的，来自共同祖先的后代。在含有100个或更少氨基酸的蛋白质的情况下，同源性鉴定的问题更加复杂。这是由于有限数量的基本单域折叠和偶然序列相似性识别的可能性。后者产生于两个条件:首先，对目前非常大的蛋白质数据库的任何搜索都可能识别出偶然匹配的短区域;其次，在一小组具有相似折叠的短蛋白之间进行直接序列比较可以检测到许多相似的疏水性模式，即使蛋白质不是来自一个共同的祖先。为了鉴定许多泛素蛋白的远端同源物，我们开发了一种结合结构和序列相似性的方法，试图克服同源物鉴定的上述限制。这种方法鉴定了90种可能的泛素相关蛋白，包括来自两个原核生物领域的例子，古细菌和细菌。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Protein engineering

自引率

0.00%

发文量