Interaction prediction of PDZ domains using a machine learning approach

Sibel Kalyoncu, O. Keskin, A. Gursoy
{"title":"Interaction prediction of PDZ domains using a machine learning approach","authors":"Sibel Kalyoncu, O. Keskin, A. Gursoy","doi":"10.1109/HIBIT.2010.5478896","DOIUrl":null,"url":null,"abstract":"Protein interaction domains play crucial roles in many complex cellular pathways. PDZ domains are one of the most common protein interaction domains. Prediction of binding specificity of PDZ domains by a computational manner could eliminate unnecessary, time-consuming experiments. In this study, interactions of PDZ domains are predicted by using a machine learning approach in which only primary sequences of PDZ domains and peptides are used. In order to encode feature vectors for each interaction, trigram frequencies of primary sequences of PDZ domains and corresponding peptides are calculated. After construction of numerical interaction dataset, we compared different classifiers and ended up with Random Forest (RF) algorithm which gave the top performance. We obtained very high prediction accuracy (91.4%) for binary interaction prediction which outperforms all previous similar methods.","PeriodicalId":215457,"journal":{"name":"2010 5th International Symposium on Health Informatics and Bioinformatics","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 5th International Symposium on Health Informatics and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HIBIT.2010.5478896","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Protein interaction domains play crucial roles in many complex cellular pathways. PDZ domains are one of the most common protein interaction domains. Prediction of binding specificity of PDZ domains by a computational manner could eliminate unnecessary, time-consuming experiments. In this study, interactions of PDZ domains are predicted by using a machine learning approach in which only primary sequences of PDZ domains and peptides are used. In order to encode feature vectors for each interaction, trigram frequencies of primary sequences of PDZ domains and corresponding peptides are calculated. After construction of numerical interaction dataset, we compared different classifiers and ended up with Random Forest (RF) algorithm which gave the top performance. We obtained very high prediction accuracy (91.4%) for binary interaction prediction which outperforms all previous similar methods.
使用机器学习方法的PDZ域交互预测
蛋白质相互作用域在许多复杂的细胞通路中起着至关重要的作用。PDZ结构域是最常见的蛋白质相互作用结构域之一。通过计算方式预测PDZ结构域的结合特异性可以消除不必要的,耗时的实验。在这项研究中,通过使用机器学习方法预测PDZ结构域的相互作用,其中仅使用PDZ结构域和肽的初级序列。为了编码每个相互作用的特征向量,计算了PDZ结构域和相应肽的初级序列的三元频率。在构建了数值交互数据集之后,我们比较了不同的分类器,最终选择了随机森林(Random Forest, RF)算法。我们获得了非常高的二元相互作用预测精度(91.4%),优于以往所有类似的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信