Tyrosine Kinase Ligand-Receptor Pair Prediction by Using Support Vector Machine.

Q1 Biochemistry, Genetics and Molecular Biology
Advances in Bioinformatics Pub Date : 2015-01-01 Epub Date: 2015-08-11 DOI:10.1155/2015/528097
Masayuki Yarimizu, Cao Wei, Yusuke Komiyama, Kokoro Ueki, Shugo Nakamura, Kazuya Sumikoshi, Tohru Terada, Kentaro Shimizu
{"title":"Tyrosine Kinase Ligand-Receptor Pair Prediction by Using Support Vector Machine.","authors":"Masayuki Yarimizu,&nbsp;Cao Wei,&nbsp;Yusuke Komiyama,&nbsp;Kokoro Ueki,&nbsp;Shugo Nakamura,&nbsp;Kazuya Sumikoshi,&nbsp;Tohru Terada,&nbsp;Kentaro Shimizu","doi":"10.1155/2015/528097","DOIUrl":null,"url":null,"abstract":"<p><p>Receptor tyrosine kinases are essential proteins involved in cellular differentiation and proliferation in vivo and are heavily involved in allergic diseases, diabetes, and onset/proliferation of cancerous cells. Identifying the interacting partner of this protein, a growth factor ligand, will provide a deeper understanding of cellular proliferation/differentiation and other cell processes. In this study, we developed a method for predicting tyrosine kinase ligand-receptor pairs from their amino acid sequences. We collected tyrosine kinase ligand-receptor pairs from the Database of Interacting Proteins (DIP) and UniProtKB, filtered them by removing sequence redundancy, and used them as a dataset for machine learning and assessment of predictive performance. Our prediction method is based on support vector machines (SVMs), and we evaluated several input features suitable for tyrosine kinase for machine learning and compared and analyzed the results. Using sequence pattern information and domain information extracted from sequences as input features, we obtained 0.996 of the area under the receiver operating characteristic curve. This accuracy is higher than that obtained from general protein-protein interaction pair predictions. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2015 ","pages":"528097"},"PeriodicalIF":0.0000,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2015/528097","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2015/528097","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2015/8/11 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 4

Abstract

Receptor tyrosine kinases are essential proteins involved in cellular differentiation and proliferation in vivo and are heavily involved in allergic diseases, diabetes, and onset/proliferation of cancerous cells. Identifying the interacting partner of this protein, a growth factor ligand, will provide a deeper understanding of cellular proliferation/differentiation and other cell processes. In this study, we developed a method for predicting tyrosine kinase ligand-receptor pairs from their amino acid sequences. We collected tyrosine kinase ligand-receptor pairs from the Database of Interacting Proteins (DIP) and UniProtKB, filtered them by removing sequence redundancy, and used them as a dataset for machine learning and assessment of predictive performance. Our prediction method is based on support vector machines (SVMs), and we evaluated several input features suitable for tyrosine kinase for machine learning and compared and analyzed the results. Using sequence pattern information and domain information extracted from sequences as input features, we obtained 0.996 of the area under the receiver operating characteristic curve. This accuracy is higher than that obtained from general protein-protein interaction pair predictions.

Abstract Image

Abstract Image

Abstract Image

基于支持向量机的酪氨酸激酶配体-受体对预测。
受体酪氨酸激酶是参与体内细胞分化和增殖的必需蛋白,在过敏性疾病、糖尿病和癌细胞的发生/增殖中有重要作用。识别这种蛋白质的相互作用伙伴,一种生长因子配体,将提供对细胞增殖/分化和其他细胞过程的更深入了解。在这项研究中,我们开发了一种从氨基酸序列预测酪氨酸激酶配体-受体对的方法。我们从相互作用蛋白数据库(DIP)和UniProtKB中收集了酪氨酸激酶配体-受体对,通过去除序列冗余进行过滤,并将其用作机器学习和预测性能评估的数据集。我们的预测方法基于支持向量机(svm),我们评估了几种适合酪氨酸激酶机器学习的输入特征,并对结果进行了比较和分析。以序列模式信息和从序列中提取的域信息作为输入特征,得到接收者工作特征曲线下面积的0.996。这种准确度高于一般的蛋白质-蛋白质相互作用对预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advances in Bioinformatics
Advances in Bioinformatics Biochemistry, Genetics and Molecular Biology-Biochemistry, Genetics and Molecular Biology (miscellaneous)
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信