A Confidence Measure for Model Fitting with X-Ray Crystallography Data

Y. Lei, Ramgopal R. Mettu
{"title":"A Confidence Measure for Model Fitting with X-Ray Crystallography Data","authors":"Y. Lei, Ramgopal R. Mettu","doi":"10.1145/2506583.2506609","DOIUrl":null,"url":null,"abstract":"Structure determination from X-ray crystallography requires numerous stages of iterative refinement between real and reciprocal space. Current methods that fit a model structure to X-ray data therefore utilize a refined experimental electron density map along with a scoring function that characterizes the fit of the density map to structure. Additional information (e.g., from an energy function or conformational statistics) may supplement this score. In this paper, we derive a novel confidence measure for fitting model fragments into X-ray crystallography data. Given any set of conformations under consideration (e.g., a set of sidechain rotamers, or backbone fragments), and a scoring function for those conformations (e.g., least squares fit of the associated model density maps), we give a general-purpose method for assessing the confidence of the best-fit model. For the commonly used least-squares measure of fit, our method analyzes the statistics of the matching scores and estimates the probability that the best-fit conformation is the correct underlying model. To our knowledge, ours is the first method for computing such a confidence measure. To demonstrate the practical utility of our method, we study the problem of sidechain placement and show that our confidence measure can be used to detect and correct incorrect conformational predictions. Over nine proteins with density maps of varying resolutions, the Pearson correlation between predictive accuracy (of least-squares fit) and our confidence measure is quite high, about .89. We show that our approach can guide the use of stereochemical restraints when confidence is low in predictions. We also propose a Bayesian data fusion scheme that integrates our confidence measure to weight the contributon of each source of data, which could potentially be used for combining experimental, modeling, and empirical data in automated structure determination.","PeriodicalId":287007,"journal":{"name":"Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2506583.2506609","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Structure determination from X-ray crystallography requires numerous stages of iterative refinement between real and reciprocal space. Current methods that fit a model structure to X-ray data therefore utilize a refined experimental electron density map along with a scoring function that characterizes the fit of the density map to structure. Additional information (e.g., from an energy function or conformational statistics) may supplement this score. In this paper, we derive a novel confidence measure for fitting model fragments into X-ray crystallography data. Given any set of conformations under consideration (e.g., a set of sidechain rotamers, or backbone fragments), and a scoring function for those conformations (e.g., least squares fit of the associated model density maps), we give a general-purpose method for assessing the confidence of the best-fit model. For the commonly used least-squares measure of fit, our method analyzes the statistics of the matching scores and estimates the probability that the best-fit conformation is the correct underlying model. To our knowledge, ours is the first method for computing such a confidence measure. To demonstrate the practical utility of our method, we study the problem of sidechain placement and show that our confidence measure can be used to detect and correct incorrect conformational predictions. Over nine proteins with density maps of varying resolutions, the Pearson correlation between predictive accuracy (of least-squares fit) and our confidence measure is quite high, about .89. We show that our approach can guide the use of stereochemical restraints when confidence is low in predictions. We also propose a Bayesian data fusion scheme that integrates our confidence measure to weight the contributon of each source of data, which could potentially be used for combining experimental, modeling, and empirical data in automated structure determination.
x射线晶体学数据模型拟合的置信度度量
从x射线晶体学中确定结构需要在实空间和倒易空间之间进行多次迭代细化。因此,目前将模型结构与x射线数据拟合的方法利用了一个精炼的实验电子密度图以及一个表征密度图与结构拟合的评分函数。额外的信息(例如,从能量函数或构象统计)可以补充这个分数。在本文中,我们推导了一种新的置信度方法来拟合模型碎片到x射线晶体学数据中。给定考虑的任何构象集(例如,一组侧链转子,或主干片段),以及这些构象的评分函数(例如,相关模型密度图的最小二乘拟合),我们给出了评估最佳拟合模型置信度的通用方法。对于常用的最小二乘拟合度量,我们的方法分析匹配分数的统计数据,并估计最佳拟合构象是正确底层模型的概率。据我们所知,我们的方法是计算这种置信度度量的第一种方法。为了证明我们的方法的实用性,我们研究了侧链放置问题,并表明我们的置信度可以用来检测和纠正错误的构象预测。对于具有不同分辨率密度图的9种蛋白质,预测精度(最小二乘拟合)与我们的置信度测量之间的Pearson相关性相当高,约为0.89。我们表明,我们的方法可以指导使用立体化学约束时,信心是低的预测。我们还提出了一个贝叶斯数据融合方案,该方案集成了我们的置信度度量来加权每个数据源的贡献,这可能潜在地用于将实验、建模和经验数据结合起来进行自动化结构确定。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信