{"title":"A Hybrid Consensus and Clustering Method for Protein Structure Selection","authors":"Qingguo Wang, Yingzi Shang, Dong Xu","doi":"10.1109/ICTAI.2011.10","DOIUrl":null,"url":null,"abstract":"In protein tertiary structure prediction, a crucial step is to select near-native structures from a large number of predicted structural models. Over the years, many methods have been proposed for the protein structure selection problem. Despite significant advances, the discerning power of current approaches is still unsatisfactory. In this paper, we propose a new algorithm, CC-Select, that combines consensus with clustering techniques. Given a set of predicted models, CC-Select first calculates a consensus score for each structure based on its average pair wise structural similarity to other models. Then, similar structures are grouped into clusters using multidimensional scaling and clustering algorithms. In each cluster, the one with the highest consensus score is selected as a candidate model. Using extensive benchmark sets of a large collection of predicted models, we compare CC-Select with existing state-of-the-art quality assessment methods and show significant improvement.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAI.2011.10","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In protein tertiary structure prediction, a crucial step is to select near-native structures from a large number of predicted structural models. Over the years, many methods have been proposed for the protein structure selection problem. Despite significant advances, the discerning power of current approaches is still unsatisfactory. In this paper, we propose a new algorithm, CC-Select, that combines consensus with clustering techniques. Given a set of predicted models, CC-Select first calculates a consensus score for each structure based on its average pair wise structural similarity to other models. Then, similar structures are grouped into clusters using multidimensional scaling and clustering algorithms. In each cluster, the one with the highest consensus score is selected as a candidate model. Using extensive benchmark sets of a large collection of predicted models, we compare CC-Select with existing state-of-the-art quality assessment methods and show significant improvement.