Functional analysis of the Escherichia coli genome for members of the α/β hydrolase family

Li Zhang , Adam Godzik , Jeffrey Skolnick , Jacquelyn S. Fetrow
{"title":"Functional analysis of the Escherichia coli genome for members of the α/β hydrolase family","authors":"Li Zhang ,&nbsp;Adam Godzik ,&nbsp;Jeffrey Skolnick ,&nbsp;Jacquelyn S. Fetrow","doi":"10.1016/S1359-0278(98)00069-8","DOIUrl":null,"url":null,"abstract":"<div><p><strong>Background:</strong> Database-searching methods based on sequence similarity have become the most commonly used tools for characterizing newly sequenced proteins. Due to the often underestimated functional diversity in protein families and superfamilies, however, it is difficult to make the characterization specific and accurate. In this work, we have extended a method for active-site identification from predicted protein structures.</p><p><strong>Results:</strong> The structural conservation and variation of the active sites of the <em>α</em>/<em>β</em> hydrolases with known structures were studied. The similarities were incorporated into a three-dimensional motif that specifies essential requirements for the enzymatic functions. A threading algorithm was used to align 651 <em>Escherichia coli</em> open reading frames (ORFs) to one of the members of the <em>α</em>/<em>β</em> hydrolase fold family. These ORFs were then screened according to our three-dimensional motif and with an extra requirement that demands conservation of the key active-site residues among the proteins that bear significant sequence similarity to the ORFs. 17 ORFs from <em>E. coli</em> were predicted to have hydrolase activity and their putative active-site residues were identified. Most were in agreement with the experiments and results of other database-searching methods. The study further suggests that YHET_ECOLI, a hypothetical protein classified as a member of the UPF0017 family (an uncharacterized protein family), bears all the hallmarks of the <em>α</em>/<em>β</em> hydrolase family.</p><p><strong>Conclusions:</strong>The novel feature of our method is that it uses three-dimensional structural information for function prediction. The results demonstrate the importance and necessity of such a method to fill the gap between sequence alignment and function prediction; furthermore, the method provides a way to verify the structure predictions, which enables an expansion of the applicable scope of the threading algorithms.</p></div>","PeriodicalId":79488,"journal":{"name":"Folding & design","volume":"3 6","pages":"Pages 535-548"},"PeriodicalIF":0.0000,"publicationDate":"1998-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S1359-0278(98)00069-8","citationCount":"24","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Folding & design","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1359027898000698","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 24

Abstract

Background: Database-searching methods based on sequence similarity have become the most commonly used tools for characterizing newly sequenced proteins. Due to the often underestimated functional diversity in protein families and superfamilies, however, it is difficult to make the characterization specific and accurate. In this work, we have extended a method for active-site identification from predicted protein structures.

Results: The structural conservation and variation of the active sites of the α/β hydrolases with known structures were studied. The similarities were incorporated into a three-dimensional motif that specifies essential requirements for the enzymatic functions. A threading algorithm was used to align 651 Escherichia coli open reading frames (ORFs) to one of the members of the α/β hydrolase fold family. These ORFs were then screened according to our three-dimensional motif and with an extra requirement that demands conservation of the key active-site residues among the proteins that bear significant sequence similarity to the ORFs. 17 ORFs from E. coli were predicted to have hydrolase activity and their putative active-site residues were identified. Most were in agreement with the experiments and results of other database-searching methods. The study further suggests that YHET_ECOLI, a hypothetical protein classified as a member of the UPF0017 family (an uncharacterized protein family), bears all the hallmarks of the α/β hydrolase family.

Conclusions:The novel feature of our method is that it uses three-dimensional structural information for function prediction. The results demonstrate the importance and necessity of such a method to fill the gap between sequence alignment and function prediction; furthermore, the method provides a way to verify the structure predictions, which enables an expansion of the applicable scope of the threading algorithms.

大肠杆菌α/β水解酶家族成员基因组功能分析
背景:基于序列相似性的数据库检索方法已成为鉴定新测序蛋白质最常用的工具。然而,由于蛋白质家族和超家族的功能多样性经常被低估,很难使表征特异性和准确性。在这项工作中,我们扩展了一种从预测蛋白质结构中识别活性位点的方法。结果:研究了已知结构的α/β水解酶的结构守恒和活性位点的变化。这些相似性被整合到一个三维基序中,该基序指定了酶功能的基本要求。采用线程算法将651个大肠杆菌开放阅读框(orf)与α/β水解酶折叠家族的一个成员进行比对。然后根据我们的三维基序筛选这些orf,并有一个额外的要求,要求保留与orf序列具有显著相似性的蛋白质中的关键活性位点残基。预测大肠杆菌中有17个orf具有水解酶活性,并鉴定了它们的推定活性位点残基。大多数与其他数据库检索方法的实验和结果一致。该研究进一步表明,YHET_ECOLI是一种假想的蛋白,被归类为UPF0017家族(一个未被鉴定的蛋白家族)的成员,具有α/β水解酶家族的所有特征。结论:该方法的新颖之处在于利用三维结构信息进行功能预测。结果表明,该方法填补了序列比对与功能预测之间的空白,具有重要意义和必要性;此外,该方法还提供了一种验证结构预测的方法,从而扩大了线程算法的适用范围。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信