{"title":"Performance analysis of protein structure clustering techniques and CUDA implementation of RMSD computation","authors":"Luibaiba Muhammad Kunhi, K. Raju, N. Chiplunkar","doi":"10.1109/DISCOVER.2016.7806220","DOIUrl":null,"url":null,"abstract":"Knowledge of the 3-dimensional structure of proteins is an important aspect in the field of structure based drug design. Structure prediction algorithms generally operate by computationally generating a large number of protein structures known as decoys and selecting the best candidates from among them. This is done by clustering the decoy set to identify the best models. RMSD (Root Mean Square Deviation) is the metric used for measuring similarity between protein structures. As the number of decoys becomes larger, the huge computational time of RMSD calculation affects the overall performance. This paper is about the performance analysis done on clustering techniques of SPICKER, Calibur and Hierarchical Ward's clustering and also describes 2 methods for parallelizing RMSD computation.","PeriodicalId":383554,"journal":{"name":"2016 IEEE Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DISCOVER.2016.7806220","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Knowledge of the 3-dimensional structure of proteins is an important aspect in the field of structure based drug design. Structure prediction algorithms generally operate by computationally generating a large number of protein structures known as decoys and selecting the best candidates from among them. This is done by clustering the decoy set to identify the best models. RMSD (Root Mean Square Deviation) is the metric used for measuring similarity between protein structures. As the number of decoys becomes larger, the huge computational time of RMSD calculation affects the overall performance. This paper is about the performance analysis done on clustering techniques of SPICKER, Calibur and Hierarchical Ward's clustering and also describes 2 methods for parallelizing RMSD computation.