{"title":"Disulfide bonding state prediction with SVM based on protein types","authors":"Chih-Ying Lin, Chang-Biau Yang, Chiou-Yi Hor, Kuo-Si Huang","doi":"10.1109/BICTA.2010.5645282","DOIUrl":null,"url":null,"abstract":"Disulfide bonds play the key role for predicting the three-dimensional structure and the function of a protein. In this paper, we propose an algorithm for predicting the disulfide bonding state of each cysteine in a protein sequence. This method is based on the multi-stage framework and the multi-classifier of the support vector machine. We also design a new training strategy to increase the prediction accuracy. It appends the probabilities to the existing features and then starts a new training procedure repeatedly to improve performance. We perform the experiments on the data set derived from the well-known database Protein Data Bank (PDB). We get 94.2% accuracy for predicting disulfide bonding state, which gets improvement 3.5% compared with the previous best result 90.7%.","PeriodicalId":302619,"journal":{"name":"2010 IEEE Fifth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE Fifth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BICTA.2010.5645282","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Disulfide bonds play the key role for predicting the three-dimensional structure and the function of a protein. In this paper, we propose an algorithm for predicting the disulfide bonding state of each cysteine in a protein sequence. This method is based on the multi-stage framework and the multi-classifier of the support vector machine. We also design a new training strategy to increase the prediction accuracy. It appends the probabilities to the existing features and then starts a new training procedure repeatedly to improve performance. We perform the experiments on the data set derived from the well-known database Protein Data Bank (PDB). We get 94.2% accuracy for predicting disulfide bonding state, which gets improvement 3.5% compared with the previous best result 90.7%.
二硫键在预测蛋白质的三维结构和功能方面起着关键作用。在本文中,我们提出了一种预测蛋白质序列中每个半胱氨酸的二硫键状态的算法。该方法基于支持向量机的多阶段框架和多分类器。我们还设计了一种新的训练策略来提高预测精度。它将概率附加到现有特征上,然后反复开始新的训练过程以提高性能。实验数据集来源于著名的蛋白质数据库(Protein data Bank, PDB)。对二硫键态的预测精度达到94.2%,比之前的最佳结果90.7%提高了3.5%。