Xavier F de la Cruz , Michael W Mahoney , Byungkook Lee
{"title":"Discrete representations of the protein Cα chain","authors":"Xavier F de la Cruz , Michael W Mahoney , Byungkook Lee","doi":"10.1016/S1359-0278(97)00031-X","DOIUrl":null,"url":null,"abstract":"<div><p><strong>Background:</strong> When a large number of protein conformations are generated and screened, as in protein structure prediction studies, it is often advantageous to change the conformation in units of four consecutive residues at a time. The internal geometry of a chain of four consecutive C<sub><em>α</em></sub> atoms is completely described by means of the three angles <em>θ</em>1, <em>τ</em>, and <em>θ</em>2, where <em>τ</em> is the virtual torsion angle defined by the four atoms and <em>θ</em>1 and <em>θ</em>2 are the virtual bond angles flanking the torsion angle on either side. In this paper, we examine the quality of the protein structures that can be obtained when they are represented by means of a set of discrete values for these angles (discrete states).</p><p><strong>Results:</strong> Different models were produced by selecting various different discrete states. The performance of these models was tested by rebuilding the C<sub><em>α</em></sub> chains of 139 high-resolution nonhomologous protein structures using the build-up procedure of Park and Levitt. We find that the discrete state models introduce distortions at three levels, which can be measured by means of the ‘context-free’, ‘in-context’, and the overall root-mean-square deviation of the C<sub><em>α</em></sub> coordinates (crms), respectively, and we find that these different levels of distortions are interrelated. As found by Park and Levitt, the overall crms decreases smoothly for most models with the complexity of the model. However, the decrease is significantly faster with our models than observed by Park and Levitt with their models. We also find that it is possible to choose models that perform considerably worse than expected from this smooth dependence on complexity.</p><p><strong>Conclusions:</strong> Of our models, the most suitable for use in initial protein folding studies appears to be model S8, in which the effective number of states available for a given residue quartet is 6.5. This model builds helices, <em>β</em>-strands, and coil/loop structures with approximately equal quality and gives the overall crms value of 1.9 Å on average with relatively little variation among the different proteins tried.</p></div>","PeriodicalId":79488,"journal":{"name":"Folding & design","volume":"2 4","pages":"Pages 223-234"},"PeriodicalIF":0.0000,"publicationDate":"1997-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/S1359-0278(97)00031-X","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Folding & design","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S135902789700031X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Background: When a large number of protein conformations are generated and screened, as in protein structure prediction studies, it is often advantageous to change the conformation in units of four consecutive residues at a time. The internal geometry of a chain of four consecutive Cα atoms is completely described by means of the three angles θ1, τ, and θ2, where τ is the virtual torsion angle defined by the four atoms and θ1 and θ2 are the virtual bond angles flanking the torsion angle on either side. In this paper, we examine the quality of the protein structures that can be obtained when they are represented by means of a set of discrete values for these angles (discrete states).
Results: Different models were produced by selecting various different discrete states. The performance of these models was tested by rebuilding the Cα chains of 139 high-resolution nonhomologous protein structures using the build-up procedure of Park and Levitt. We find that the discrete state models introduce distortions at three levels, which can be measured by means of the ‘context-free’, ‘in-context’, and the overall root-mean-square deviation of the Cα coordinates (crms), respectively, and we find that these different levels of distortions are interrelated. As found by Park and Levitt, the overall crms decreases smoothly for most models with the complexity of the model. However, the decrease is significantly faster with our models than observed by Park and Levitt with their models. We also find that it is possible to choose models that perform considerably worse than expected from this smooth dependence on complexity.
Conclusions: Of our models, the most suitable for use in initial protein folding studies appears to be model S8, in which the effective number of states available for a given residue quartet is 6.5. This model builds helices, β-strands, and coil/loop structures with approximately equal quality and gives the overall crms value of 1.9 Å on average with relatively little variation among the different proteins tried.