{"title":"利用AA指数进行蛋白亚细胞定位预测的LVQ方法","authors":"Kok-Sin Toh, M. N. Nguyen, Jagath Rajapakse","doi":"10.1109/CIBCB.2005.1594932","DOIUrl":null,"url":null,"abstract":"Knowledge of subcellular localisation of proteins is important in determining their function and involvement in different pathways. A wide variety of methods has been proposed over the recent years in order to predict the subcellular localisation of proteins, mainly based on amino acid composition or single sequence inputs. We propose a Learning Vector Quantization (LVQ) method for protein subcellular localisation prediction based on N-terminal sorting signals by using the information derived from Amino Acid (AA) index database. The LVQ approach achieved overall prediction accuracies of 84.7% for 2427 eukaryotic protein sequences on Reinhardt and Hubbard dataset and upto 86.8% on the non-plant (eukaryotes) dataset of 2738 sequences from the TargetP website, which are comparable or better than the results of existing prediction methods.","PeriodicalId":330810,"journal":{"name":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"LVQ Approach Using AA Indices for Protein Subcellular Localisation Prediction\",\"authors\":\"Kok-Sin Toh, M. N. Nguyen, Jagath Rajapakse\",\"doi\":\"10.1109/CIBCB.2005.1594932\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Knowledge of subcellular localisation of proteins is important in determining their function and involvement in different pathways. A wide variety of methods has been proposed over the recent years in order to predict the subcellular localisation of proteins, mainly based on amino acid composition or single sequence inputs. We propose a Learning Vector Quantization (LVQ) method for protein subcellular localisation prediction based on N-terminal sorting signals by using the information derived from Amino Acid (AA) index database. The LVQ approach achieved overall prediction accuracies of 84.7% for 2427 eukaryotic protein sequences on Reinhardt and Hubbard dataset and upto 86.8% on the non-plant (eukaryotes) dataset of 2738 sequences from the TargetP website, which are comparable or better than the results of existing prediction methods.\",\"PeriodicalId\":330810,\"journal\":{\"name\":\"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology\",\"volume\":\"58 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIBCB.2005.1594932\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2005.1594932","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
了解蛋白质的亚细胞定位对于确定它们的功能和参与不同的途径是重要的。近年来,为了预测蛋白质的亚细胞定位,提出了各种各样的方法,主要基于氨基酸组成或单序列输入。利用氨基酸(AA)索引数据库的信息,提出了一种基于n端排序信号的蛋白质亚细胞定位预测学习向量量化(LVQ)方法。LVQ方法对Reinhardt and Hubbard数据集上2427条真核蛋白序列的总体预测准确率为84.7%,对TargetP网站上2738条非植物(真核生物)数据集的总体预测准确率高达86.8%,与现有预测方法的结果相当或更好。
LVQ Approach Using AA Indices for Protein Subcellular Localisation Prediction
Knowledge of subcellular localisation of proteins is important in determining their function and involvement in different pathways. A wide variety of methods has been proposed over the recent years in order to predict the subcellular localisation of proteins, mainly based on amino acid composition or single sequence inputs. We propose a Learning Vector Quantization (LVQ) method for protein subcellular localisation prediction based on N-terminal sorting signals by using the information derived from Amino Acid (AA) index database. The LVQ approach achieved overall prediction accuracies of 84.7% for 2427 eukaryotic protein sequences on Reinhardt and Hubbard dataset and upto 86.8% on the non-plant (eukaryotes) dataset of 2738 sequences from the TargetP website, which are comparable or better than the results of existing prediction methods.