Designing and evaluating the MULTICOM protein local and global model quality prediction methods in the CASP10 experiment

Q3 Biochemistry, Genetics and Molecular Biology

BMC Structural Biology Pub Date : 2014-04-15 DOI:10.1186/1472-6807-14-13

Renzhi Cao, Zheng Wang, Jianlin Cheng

{"title":"Designing and evaluating the MULTICOM protein local and global model quality prediction methods in the CASP10 experiment","authors":"Renzhi Cao, Zheng Wang, Jianlin Cheng","doi":"10.1186/1472-6807-14-13","DOIUrl":null,"url":null,"abstract":"Protein model quality assessment is an essential component of generating and using protein structural models. During the Tenth Critical Assessment of Techniques for Protein Structure Prediction (CASP10), we developed and tested four automated methods (MULTICOM-REFINE, MULTICOM-CLUSTER, MULTICOM-NOVEL, and MULTICOM-CONSTRUCT) that predicted both local and global quality of protein structural models.MULTICOM-REFINE was a clustering approach that used the average pairwise structural similarity between models to measure the global quality and the average Euclidean distance between a model and several top ranked models to measure the local quality. MULTICOM-CLUSTER and MULTICOM-NOVEL were two new support vector machine-based methods of predicting both the local and global quality of a single protein model. MULTICOM-CONSTRUCT was a new weighted pairwise model comparison (clustering) method that used the weighted average similarity between models in a pool to measure the global model quality. Our experiments showed that the pairwise model assessment methods worked better when a large portion of models in the pool were of good quality, whereas single-model quality assessment methods performed better on some hard targets when only a small portion of models in the pool were of reasonable quality.Since digging out a few good models from a large pool of low-quality models is a major challenge in protein structure prediction, single model quality assessment methods appear to be poised to make important contributions to protein structure modeling. The other interesting finding was that single-model quality assessment scores could be used to weight the models by the consensus pairwise model comparison method to improve its accuracy.","PeriodicalId":51240,"journal":{"name":"BMC Structural Biology","volume":"14 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2014-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-14-13","citationCount":"44","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Structural Biology","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1186/1472-6807-14-13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}

引用次数: 44

Abstract

Protein model quality assessment is an essential component of generating and using protein structural models. During the Tenth Critical Assessment of Techniques for Protein Structure Prediction (CASP10), we developed and tested four automated methods (MULTICOM-REFINE, MULTICOM-CLUSTER, MULTICOM-NOVEL, and MULTICOM-CONSTRUCT) that predicted both local and global quality of protein structural models.

MULTICOM-REFINE was a clustering approach that used the average pairwise structural similarity between models to measure the global quality and the average Euclidean distance between a model and several top ranked models to measure the local quality. MULTICOM-CLUSTER and MULTICOM-NOVEL were two new support vector machine-based methods of predicting both the local and global quality of a single protein model. MULTICOM-CONSTRUCT was a new weighted pairwise model comparison (clustering) method that used the weighted average similarity between models in a pool to measure the global model quality. Our experiments showed that the pairwise model assessment methods worked better when a large portion of models in the pool were of good quality, whereas single-model quality assessment methods performed better on some hard targets when only a small portion of models in the pool were of reasonable quality.

Since digging out a few good models from a large pool of low-quality models is a major challenge in protein structure prediction, single model quality assessment methods appear to be poised to make important contributions to protein structure modeling. The other interesting finding was that single-model quality assessment scores could be used to weight the models by the consensus pairwise model comparison method to improve its accuracy.

Abstract Image

查看原文本刊更多论文

在CASP10实验中设计和评价MULTICOM蛋白局部和全局模型质量预测方法

蛋白质模型质量评估是生成和使用蛋白质结构模型的重要组成部分。在第十届蛋白质结构预测技术关键评估(CASP10)期间，我们开发并测试了四种自动化方法(MULTICOM-REFINE、MULTICOM-CLUSTER、MULTICOM-NOVEL和MULTICOM-CONSTRUCT)，这些方法预测了蛋白质结构模型的局部和全局质量。MULTICOM-REFINE是一种聚类方法，它使用模型之间的平均两两结构相似度来衡量整体质量，使用模型与几个排名靠前的模型之间的平均欧几里得距离来衡量局部质量。MULTICOM-CLUSTER和MULTICOM-NOVEL是两种基于支持向量机的预测单个蛋白质模型局部和全局质量的新方法。MULTICOM-CONSTRUCT是一种新的加权两两模型比较(聚类)方法，它使用池中模型之间的加权平均相似度来衡量整体模型质量。我们的实验表明，当池中大部分模型质量良好时，两两模型评估方法效果更好，而当池中只有一小部分模型质量合理时，单模型质量评估方法在一些硬目标上表现更好。由于从大量低质量模型中挖掘出几个好的模型是蛋白质结构预测的主要挑战，单一模型质量评估方法似乎有望为蛋白质结构建模做出重要贡献。另一个有趣的发现是，单模型质量评估分数可以通过共识两两模型比较方法来对模型进行加权，以提高其准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Structural Biology BIOPHYSICS-

CiteScore

3.60

自引率

0.00%

发文量

审稿时长

>12 weeks

期刊介绍： BMC Structural Biology is an open access, peer-reviewed journal that considers articles on investigations into the structure of biological macromolecules, including solving structures, structural and functional analyses, and computational modeling.