Similarities and differences between nonhomologous proteins with similar folds: evaluation of threading strategies

Baohong Zhang , Lukasz Jaroszewski , Leszek Rychlewski , Adam Godzik
{"title":"Similarities and differences between nonhomologous proteins with similar folds: evaluation of threading strategies","authors":"Baohong Zhang ,&nbsp;Lukasz Jaroszewski ,&nbsp;Leszek Rychlewski ,&nbsp;Adam Godzik","doi":"10.1016/S1359-0278(97)00042-4","DOIUrl":null,"url":null,"abstract":"<div><p><strong>Background:</strong> There are many pairs and groups of proteins with similar folds and interaction patterns, but whose sequence similarity is below the threshold of easily recognizable sequence homology. The existence of multiple sequence solutions for a given fold has inspired fold prediction methods in which structural information from one protein is used to estimate the energy of another, putatively similar, structure.</p><p><strong>Results:</strong> A set of 68 pairs of proteins with similar folds and sequence identity in the 8–30% range is identified from the literature. For each pair, the energy of one protein, calculated using knowledge-based statistical potentials, is compared to the estimated energy, calculated with the same potentials but using the structural information (burial status and interaction pattern) of another protein with the same fold. Different energy estimates, corresponding to approximations used in various fold recognition algorithms, are calculated and compared to each other, as well as to the correct energy. It is shown that the local energy terms, based on burial and secondary structure preferences, can be reliably estimated with an accuracy close to 70%. At the same time, the two-body nonlocal energy loses over 60% of its value due to the repacking of the structure. Further approximations, such as the ‘frozen approximation’, can bring it to an essentially random value.</p><p><strong>Conclusions:</strong> Local energy terms could be used safely to improve fold recognition algorithms. To utilize pair interaction information, specially designed pair potentials and/or a self-consistent description of pair interactions is necessary.</p></div>","PeriodicalId":79488,"journal":{"name":"Folding & design","volume":"2 5","pages":"Pages 307-317"},"PeriodicalIF":0.0000,"publicationDate":"1997-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S1359-0278(97)00042-4","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Folding & design","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1359027897000424","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 26

Abstract

Background: There are many pairs and groups of proteins with similar folds and interaction patterns, but whose sequence similarity is below the threshold of easily recognizable sequence homology. The existence of multiple sequence solutions for a given fold has inspired fold prediction methods in which structural information from one protein is used to estimate the energy of another, putatively similar, structure.

Results: A set of 68 pairs of proteins with similar folds and sequence identity in the 8–30% range is identified from the literature. For each pair, the energy of one protein, calculated using knowledge-based statistical potentials, is compared to the estimated energy, calculated with the same potentials but using the structural information (burial status and interaction pattern) of another protein with the same fold. Different energy estimates, corresponding to approximations used in various fold recognition algorithms, are calculated and compared to each other, as well as to the correct energy. It is shown that the local energy terms, based on burial and secondary structure preferences, can be reliably estimated with an accuracy close to 70%. At the same time, the two-body nonlocal energy loses over 60% of its value due to the repacking of the structure. Further approximations, such as the ‘frozen approximation’, can bring it to an essentially random value.

Conclusions: Local energy terms could be used safely to improve fold recognition algorithms. To utilize pair interaction information, specially designed pair potentials and/or a self-consistent description of pair interactions is necessary.

具有相似折叠的非同源蛋白的异同:线程策略的评价
背景:有许多蛋白质对和蛋白质组具有相似的折叠和相互作用模式,但其序列相似性低于易于识别的序列同源性的阈值。给定折叠的多个序列解的存在启发了折叠预测方法,其中使用来自一个蛋白质的结构信息来估计另一个假定相似的结构的能量。结果:从文献中鉴定出68对具有相似折叠和序列同源性的蛋白,其同源性在8-30%范围内。对于每一对,使用基于知识的统计电位计算出的一个蛋白质的能量,与使用具有相同折叠的另一个蛋白质的结构信息(埋藏状态和相互作用模式)计算出的估计能量进行比较。计算不同的能量估计,对应于各种折叠识别算法中使用的近似,并相互比较,以及正确的能量。结果表明,基于埋藏和次级结构偏好的局部能量项可以可靠地估计,精度接近70%。同时,由于结构的重新包装,两体非定域能量损失超过60%。进一步的近似,如“冻结近似”,可以使它本质上是一个随机值。结论:局部能量项可以安全地用于改进折叠识别算法。为了利用成对相互作用信息,需要专门设计成对势和/或对成对相互作用的自一致描述。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信