Clustering and percolation in protein loop structures

IF 2.222 Q3 Biochemistry, Genetics and Molecular Biology
Xubiao Peng, Jianfeng He, Antti J. Niemi
{"title":"Clustering and percolation in protein loop structures","authors":"Xubiao Peng,&nbsp;Jianfeng He,&nbsp;Antti J. Niemi","doi":"10.1186/s12900-015-0049-x","DOIUrl":null,"url":null,"abstract":"<p>High precision protein loop modelling remains a challenge, both in template based and template independent approaches to protein structure prediction.</p><p>We introduce the concepts of protein loop clustering and percolation, to develop a quantitative approach to systematically classify the modular building blocks of loops in crystallographic folded proteins. These fragments are all different parameterisations of a unique kink solution to a generalised discrete nonlinear Schr?dinger (DNLS) equation. Accordingly, the fragments are also local energy minima of the ensuing energy function.</p><p>We show how the loop fragments cover practically all ultrahigh resolution crystallographic protein structures in Protein Data Bank (PDB), with a 0.2 ?ngstr?m root-mean-square (RMS) precision. We find that no more than 12 different loop fragments are needed, to describe around 38 % of ultrahigh resolution loops in PDB. But there is also a large number of loop fragments that are either unique, or very rare, and examples of unique fragments are found even in the structure of a myoglobin.</p><p>Protein loops are built in a modular fashion. The loops are composed of fragments that can be modelled by the kink of the DNLS equation. The majority of loop fragments are also common, which are shared by many proteins. These common fragments are probably important for supporting the overall protein conformation. But there are also several fragments that are either unique to a given protein, or very rare. Such fragments are probably related to the function of the protein. Furthermore, we have found that the amino acid sequence does not determine the structure in a unique fashion. There are many examples of loop fragments with an identical amino acid sequence, but with a very different structure.</p>","PeriodicalId":498,"journal":{"name":"BMC Structural Biology","volume":"15 1","pages":""},"PeriodicalIF":2.2220,"publicationDate":"2015-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s12900-015-0049-x","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Structural Biology","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1186/s12900-015-0049-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 9

Abstract

High precision protein loop modelling remains a challenge, both in template based and template independent approaches to protein structure prediction.

We introduce the concepts of protein loop clustering and percolation, to develop a quantitative approach to systematically classify the modular building blocks of loops in crystallographic folded proteins. These fragments are all different parameterisations of a unique kink solution to a generalised discrete nonlinear Schr?dinger (DNLS) equation. Accordingly, the fragments are also local energy minima of the ensuing energy function.

We show how the loop fragments cover practically all ultrahigh resolution crystallographic protein structures in Protein Data Bank (PDB), with a 0.2 ?ngstr?m root-mean-square (RMS) precision. We find that no more than 12 different loop fragments are needed, to describe around 38 % of ultrahigh resolution loops in PDB. But there is also a large number of loop fragments that are either unique, or very rare, and examples of unique fragments are found even in the structure of a myoglobin.

Protein loops are built in a modular fashion. The loops are composed of fragments that can be modelled by the kink of the DNLS equation. The majority of loop fragments are also common, which are shared by many proteins. These common fragments are probably important for supporting the overall protein conformation. But there are also several fragments that are either unique to a given protein, or very rare. Such fragments are probably related to the function of the protein. Furthermore, we have found that the amino acid sequence does not determine the structure in a unique fashion. There are many examples of loop fragments with an identical amino acid sequence, but with a very different structure.

Abstract Image

蛋白质环结构中的聚类和渗透
无论是基于模板还是不依赖模板的蛋白质结构预测方法,高精度的蛋白质环建模都是一个挑战。我们引入了蛋白质环聚类和渗透的概念,以开发一种定量方法来系统地分类晶体折叠蛋白质中环的模块构建块。这些片段都是广义离散非线性Schr?的唯一扭结解的不同参数化。丁格方程。因此,碎片也是后续能量函数的局部能量最小值。我们展示了环路片段如何覆盖蛋白质数据库(PDB)中几乎所有的超高分辨率晶体学蛋白质结构,具有0.2 ?均方根(RMS)精度。我们发现不需要超过12个不同的环路片段,就可以描述PDB中大约38%的超高分辨率环路。但也有大量的环片段是独特的,或者非常罕见的,甚至在肌红蛋白的结构中也发现了独特片段的例子。蛋白质环是以模块化的方式构建的。这些环是由片段组成的,这些片段可以通过DNLS方程的扭结来建模。大多数环片段也是共同的,这是许多蛋白质共享的。这些共同的片段可能对支持整个蛋白质构象很重要。但也有一些片段对特定的蛋白质来说是独一无二的,或者非常罕见。这些片段可能与蛋白质的功能有关。此外,我们发现氨基酸序列并不以独特的方式决定结构。有许多环片段具有相同的氨基酸序列,但具有非常不同的结构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Structural Biology
BMC Structural Biology 生物-生物物理
CiteScore
3.60
自引率
0.00%
发文量
0
期刊介绍: BMC Structural Biology is an open access, peer-reviewed journal that considers articles on investigations into the structure of biological macromolecules, including solving structures, structural and functional analyses, and computational modeling.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信