pLoc-mPlant:通过将最优GO信息整合到一般PseAAC†中来预测多位置植物蛋白的亚细胞定位

IF 3.743 Q2 Biochemistry, Genetics and Molecular Biology
Xiang Cheng, Xuan Xiao and Kuo-Chen Chou
{"title":"pLoc-mPlant:通过将最优GO信息整合到一般PseAAC†中来预测多位置植物蛋白的亚细胞定位","authors":"Xiang Cheng, Xuan Xiao and Kuo-Chen Chou","doi":"10.1039/C7MB00267J","DOIUrl":null,"url":null,"abstract":"<p >One of the fundamental goals in cellular biochemistry is to identify the functions of proteins in the context of compartments that organize them in the cellular environment. To realize this, it is indispensable to develop an automated method for fast and accurate identification of the subcellular locations of uncharacterized proteins. The current study is focused on plant protein subcellular location prediction based on the sequence information alone. Although considerable efforts have been made in this regard, the problem is far from being solved yet. Most of the existing methods can be used to deal with single-location proteins only. Actually, proteins with multi-locations may have some special biological functions. This kind of multiplex protein is particularly important for both basic research and drug design. Using the multi-label theory, we present a new predictor called “pLoc-mPlant” by extracting the optimal GO (Gene Ontology) information into the Chou's general PseAAC (Pseudo Amino Acid Composition). Rigorous cross-validation on the same stringent benchmark dataset indicated that the proposed pLoc-mPlant predictor is remarkably superior to iLoc-Plant, the state-of-the-art method for predicting plant protein subcellular localization. To maximize the convenience of most experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc-mPlant/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.</p>","PeriodicalId":90,"journal":{"name":"Molecular BioSystems","volume":" 9","pages":" 1722-1727"},"PeriodicalIF":3.7430,"publicationDate":"2017-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1039/C7MB00267J","citationCount":"184","resultStr":"{\"title\":\"pLoc-mPlant: predict subcellular localization of multi-location plant proteins by incorporating the optimal GO information into general PseAAC†\",\"authors\":\"Xiang Cheng, Xuan Xiao and Kuo-Chen Chou\",\"doi\":\"10.1039/C7MB00267J\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >One of the fundamental goals in cellular biochemistry is to identify the functions of proteins in the context of compartments that organize them in the cellular environment. To realize this, it is indispensable to develop an automated method for fast and accurate identification of the subcellular locations of uncharacterized proteins. The current study is focused on plant protein subcellular location prediction based on the sequence information alone. Although considerable efforts have been made in this regard, the problem is far from being solved yet. Most of the existing methods can be used to deal with single-location proteins only. Actually, proteins with multi-locations may have some special biological functions. This kind of multiplex protein is particularly important for both basic research and drug design. Using the multi-label theory, we present a new predictor called “pLoc-mPlant” by extracting the optimal GO (Gene Ontology) information into the Chou's general PseAAC (Pseudo Amino Acid Composition). Rigorous cross-validation on the same stringent benchmark dataset indicated that the proposed pLoc-mPlant predictor is remarkably superior to iLoc-Plant, the state-of-the-art method for predicting plant protein subcellular localization. To maximize the convenience of most experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc-mPlant/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.</p>\",\"PeriodicalId\":90,\"journal\":{\"name\":\"Molecular BioSystems\",\"volume\":\" 9\",\"pages\":\" 1722-1727\"},\"PeriodicalIF\":3.7430,\"publicationDate\":\"2017-07-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1039/C7MB00267J\",\"citationCount\":\"184\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular BioSystems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://pubs.rsc.org/en/content/articlelanding/2017/mb/c7mb00267j\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Biochemistry, Genetics and Molecular Biology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular BioSystems","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2017/mb/c7mb00267j","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 184

摘要

细胞生物化学的基本目标之一是确定蛋白质在细胞环境中组织它们的区室中的功能。为了实现这一点,开发一种快速准确地识别未表征蛋白质亚细胞位置的自动化方法是必不可少的。目前的研究主要集中在基于序列信息的植物蛋白亚细胞定位预测上。虽然在这方面作出了相当大的努力,但问题还远未解决。大多数现有的方法只能用于处理单位置的蛋白质。实际上,具有多位点的蛋白质可能具有一些特殊的生物学功能。这种多重蛋白对基础研究和药物设计都具有重要意义。利用多标签理论,将最优的GO (Gene Ontology)信息提取到Chou的通用PseAAC (Pseudo Amino Acid Composition)中,提出了一种新的预测器“pLoc-mPlant”。在同样严格的基准数据集上进行严格的交叉验证表明,所提出的pLoc-mPlant预测器明显优于iLoc-Plant, iLoc-Plant是预测植物蛋白亚细胞定位的最先进方法。为了最大限度地方便大多数实验科学家,在http://www.jci-bioinfo.cn/pLoc-mPlant/上建立了一个用户友好的网络服务器,用户可以很容易地得到他们想要的结果,而不需要经过复杂的数学运算。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

pLoc-mPlant: predict subcellular localization of multi-location plant proteins by incorporating the optimal GO information into general PseAAC†

pLoc-mPlant: predict subcellular localization of multi-location plant proteins by incorporating the optimal GO information into general PseAAC†

One of the fundamental goals in cellular biochemistry is to identify the functions of proteins in the context of compartments that organize them in the cellular environment. To realize this, it is indispensable to develop an automated method for fast and accurate identification of the subcellular locations of uncharacterized proteins. The current study is focused on plant protein subcellular location prediction based on the sequence information alone. Although considerable efforts have been made in this regard, the problem is far from being solved yet. Most of the existing methods can be used to deal with single-location proteins only. Actually, proteins with multi-locations may have some special biological functions. This kind of multiplex protein is particularly important for both basic research and drug design. Using the multi-label theory, we present a new predictor called “pLoc-mPlant” by extracting the optimal GO (Gene Ontology) information into the Chou's general PseAAC (Pseudo Amino Acid Composition). Rigorous cross-validation on the same stringent benchmark dataset indicated that the proposed pLoc-mPlant predictor is remarkably superior to iLoc-Plant, the state-of-the-art method for predicting plant protein subcellular localization. To maximize the convenience of most experimental scientists, a user-friendly web-server for the new predictor has been established at http://www.jci-bioinfo.cn/pLoc-mPlant/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Molecular BioSystems
Molecular BioSystems 生物-生化与分子生物学
CiteScore
2.94
自引率
0.00%
发文量
0
审稿时长
2.6 months
期刊介绍: Molecular Omics publishes molecular level experimental and bioinformatics research in the -omics sciences, including genomics, proteomics, transcriptomics and metabolomics. We will also welcome multidisciplinary papers presenting studies combining different types of omics, or the interface of omics and other fields such as systems biology or chemical biology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信