{"title":"快速可靠地探索PDB宇宙寻求一种新的模板搜索算法","authors":"Sunil Nahata, Ashish Runthala","doi":"10.4172/2153-0602.1000206","DOIUrl":null,"url":null,"abstract":"Near-native protein structure prediction through Template Based Modelling (TBM) has been a major realistic goal of structural biology for several years. The TBM algorithms require the best-set of templates for a target protein sequence to maximally cover it and construct its correct topology. However, the accuracy of such prediction algorithms suffers from the algorithmic and logical problems of our template search measures which fail to quickly screen reliable structures for a target sequence. In this study, we employ the culled PDB95 dataset of 41,967 templates to predict the CASP10 target T0752 models for assessing the efficiency of the usually employ search engines PSI-BLAST and HHPred. Our analysis presents a detailed study in order to open new vistas for improving the accuracy of TBM prediction methodologies. It reveals weaknesses of most popular template search measures and thereby briefly provides a significant insight into the qualities of a foreseen template search algorithm to illustrate the need for a more reliable template search algorithm.","PeriodicalId":15630,"journal":{"name":"Journal of Data Mining in Genomics & Proteomics","volume":"52 1","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Quick Reliable Exploration of the PDB Universe Seeks a New Template Search Algorithm\",\"authors\":\"Sunil Nahata, Ashish Runthala\",\"doi\":\"10.4172/2153-0602.1000206\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Near-native protein structure prediction through Template Based Modelling (TBM) has been a major realistic goal of structural biology for several years. The TBM algorithms require the best-set of templates for a target protein sequence to maximally cover it and construct its correct topology. However, the accuracy of such prediction algorithms suffers from the algorithmic and logical problems of our template search measures which fail to quickly screen reliable structures for a target sequence. In this study, we employ the culled PDB95 dataset of 41,967 templates to predict the CASP10 target T0752 models for assessing the efficiency of the usually employ search engines PSI-BLAST and HHPred. Our analysis presents a detailed study in order to open new vistas for improving the accuracy of TBM prediction methodologies. It reveals weaknesses of most popular template search measures and thereby briefly provides a significant insight into the qualities of a foreseen template search algorithm to illustrate the need for a more reliable template search algorithm.\",\"PeriodicalId\":15630,\"journal\":{\"name\":\"Journal of Data Mining in Genomics & Proteomics\",\"volume\":\"52 1\",\"pages\":\"1-4\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Data Mining in Genomics & Proteomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4172/2153-0602.1000206\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Data Mining in Genomics & Proteomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4172/2153-0602.1000206","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Quick Reliable Exploration of the PDB Universe Seeks a New Template Search Algorithm
Near-native protein structure prediction through Template Based Modelling (TBM) has been a major realistic goal of structural biology for several years. The TBM algorithms require the best-set of templates for a target protein sequence to maximally cover it and construct its correct topology. However, the accuracy of such prediction algorithms suffers from the algorithmic and logical problems of our template search measures which fail to quickly screen reliable structures for a target sequence. In this study, we employ the culled PDB95 dataset of 41,967 templates to predict the CASP10 target T0752 models for assessing the efficiency of the usually employ search engines PSI-BLAST and HHPred. Our analysis presents a detailed study in order to open new vistas for improving the accuracy of TBM prediction methodologies. It reveals weaknesses of most popular template search measures and thereby briefly provides a significant insight into the qualities of a foreseen template search algorithm to illustrate the need for a more reliable template search algorithm.