Combining sequence and structural profiles for protein solvent accessibility prediction.

Computational systems bioinformatics. Computational Systems Bioinformatics Conference Pub Date : 2008-01-01

Rajkumar Bondugula, Dong Xu

引用次数: 0

Abstract

Solvent accessibility is an important structural feature for a protein. We propose a new method for solvent accessibility prediction that uses known structure and sequence information more efficiently. We first estimate the relative solvent accessibility of the query protein using fuzzy mean operator from the solvent accessibilities of known structure fragments that have similar sequences to the query protein. We then integrate the estimated solvent accessibility and the position specific scoring matrix of the query protein using a neural network. We tested our method on a large data set consisting of 3386 non-redundant proteins. The comparison with other methods show slightly improved prediction accuracies with our method. The resulting system does need not be re-trained when new data is available. We incorporated our method into the MUPRED system, which is available as a web server at http://digbio.missouri.edu/mupred.

本刊更多论文

结合序列和结构特征预测蛋白质溶剂可及性。

溶剂亲和性是蛋白质的重要结构特征。我们提出了一种利用已知结构和序列信息更有效地预测溶剂可及性的新方法。我们首先利用与查询蛋白序列相似的已知结构片段的溶剂可及性，利用模糊平均算子估计查询蛋白的相对溶剂可及性。然后，我们使用神经网络整合估计的溶剂可及性和查询蛋白的位置特定评分矩阵。我们在包含3386个非冗余蛋白的大型数据集上测试了我们的方法。与其他方法的比较表明，本文方法的预测精度略有提高。当有新的数据可用时，生成的系统不需要重新训练。我们将我们的方法合并到MUPRED系统中，该系统作为web服务器可在http://digbio.missouri.edu/mupred上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computational systems bioinformatics. Computational Systems Bioinformatics Conference

自引率

0.00%

发文量