Discrete representations of the protein Cα chain

Folding & design Pub Date : 1997-08-01 DOI:10.1016/S1359-0278(97)00031-X

Xavier F de la Cruz , Michael W Mahoney , Byungkook Lee

{"title":"Discrete representations of the protein Cα chain","authors":"Xavier F de la Cruz , Michael W Mahoney , Byungkook Lee","doi":"10.1016/S1359-0278(97)00031-X","DOIUrl":null,"url":null,"abstract":"<div>Background: When a large number of protein conformations are generated and screened, as in protein structure prediction studies, it is often advantageous to change the conformation in units of four consecutive residues at a time. The internal geometry of a chain of four consecutive Cα atoms is completely described by means of the three angles θ1, τ, and θ2, where τ is the virtual torsion angle defined by the four atoms and θ1 and θ2 are the virtual bond angles flanking the torsion angle on either side. In this paper, we examine the quality of the protein structures that can be obtained when they are represented by means of a set of discrete values for these angles (discrete states).Results: Different models were produced by selecting various different discrete states. The performance of these models was tested by rebuilding the Cα chains of 139 high-resolution nonhomologous protein structures using the build-up procedure of Park and Levitt. We find that the discrete state models introduce distortions at three levels, which can be measured by means of the ‘context-free’, ‘in-context’, and the overall root-mean-square deviation of the Cα coordinates (crms), respectively, and we find that these different levels of distortions are interrelated. As found by Park and Levitt, the overall crms decreases smoothly for most models with the complexity of the model. However, the decrease is significantly faster with our models than observed by Park and Levitt with their models. We also find that it is possible to choose models that perform considerably worse than expected from this smooth dependence on complexity.Conclusions: Of our models, the most suitable for use in initial protein folding studies appears to be model S8, in which the effective number of states available for a given residue quartet is 6.5. This model builds helices, β-strands, and coil/loop structures with approximately equal quality and gives the overall crms value of 1.9 Å on average with relatively little variation among the different proteins tried.</div>","PeriodicalId":79488,"journal":{"name":"Folding & design","volume":"2 4","pages":"Pages 223-234"},"PeriodicalIF":0.0000,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S1359-0278(97)00031-X","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Folding & design","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S135902789700031X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Background: When a large number of protein conformations are generated and screened, as in protein structure prediction studies, it is often advantageous to change the conformation in units of four consecutive residues at a time. The internal geometry of a chain of four consecutive C_α atoms is completely described by means of the three angles θ1, τ, and θ2, where τ is the virtual torsion angle defined by the four atoms and θ1 and θ2 are the virtual bond angles flanking the torsion angle on either side. In this paper, we examine the quality of the protein structures that can be obtained when they are represented by means of a set of discrete values for these angles (discrete states).

Results: Different models were produced by selecting various different discrete states. The performance of these models was tested by rebuilding the C_α chains of 139 high-resolution nonhomologous protein structures using the build-up procedure of Park and Levitt. We find that the discrete state models introduce distortions at three levels, which can be measured by means of the ‘context-free’, ‘in-context’, and the overall root-mean-square deviation of the C_α coordinates (crms), respectively, and we find that these different levels of distortions are interrelated. As found by Park and Levitt, the overall crms decreases smoothly for most models with the complexity of the model. However, the decrease is significantly faster with our models than observed by Park and Levitt with their models. We also find that it is possible to choose models that perform considerably worse than expected from this smooth dependence on complexity.

Conclusions: Of our models, the most suitable for use in initial protein folding studies appears to be model S8, in which the effective number of states available for a given residue quartet is 6.5. This model builds helices, β-strands, and coil/loop structures with approximately equal quality and gives the overall crms value of 1.9 Å on average with relatively little variation among the different proteins tried.

查看原文本刊更多论文

蛋白质Cα链的离散表示

背景:当产生和筛选大量的蛋白质构象时，如在蛋白质结构预测研究中，一次以四个连续残基为单位改变构象通常是有利的。由四个连续的Cα原子组成的链的内部几何结构完全可以用三个角θ1、τ和θ2来描述，其中τ是由四个原子定义的虚扭角，θ1和θ2是两侧扭角两侧的虚键角。在本文中，我们研究了当它们由这些角度的一组离散值(离散状态)表示时可以获得的蛋白质结构的质量。结果:选取不同的离散状态产生不同的模型。通过使用Park和Levitt的构建程序重建139个高分辨率非同源蛋白结构的Cα链来测试这些模型的性能。我们发现离散状态模型在三个层次上引入了扭曲，这可以分别通过“上下文无关”、“上下文内”和Cα坐标(crms)的总体均方根偏差来测量，我们发现这些不同层次的扭曲是相互关联的。Park和Levitt发现，随着模型复杂度的增加，大多数模型的总体crm平滑下降。然而，我们的模型比帕克和莱维特用他们的模型观察到的下降速度要快得多。我们还发现，从这种对复杂性的平滑依赖中，有可能选择比预期执行得差得多的模型。结论:在我们的模型中，最适合用于初始蛋白质折叠研究的似乎是模型S8，其中给定残基四重奏的有效状态数为6.5。该模型构建了螺旋、β-链和线圈/环结构，质量大致相同，并给出了平均1.9 Å的总体crms值，不同蛋白质之间的差异相对较小。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Folding & design

自引率

0.00%

发文量