{"title":"一种预测多位点蛋白亚细胞定位的新特征融合方法","authors":"Dong Wang, Shiyuan Han, Xumi Qu, Wenzheng Bao, Yuehui Chen, Yuling Fan, Jin Zhou","doi":"10.1109/ICCSS.2015.7281141","DOIUrl":null,"url":null,"abstract":"This paper proposes a novel feature fusion method for the protein subcellular multiple-site localization prediction. Several types of features are employed in this novel protein coding method. The first one is the composition of amino acids. The second is pseudo amino acid composition, which mainly extract the location information of each amino acid residues in protein sequence. Lastly, the information for local sequence of amino acids is taken into consideration in this research. Generally, k nearest neighbor, supporting vector machine and other methods, has been used in the field of protein subcellular localization prediction. In our research, the multi-label k nearest neighbor algorithm has been employed in the classification model. The overall accuracy rate may reach 66.7304% in Gnos-mploc dataset.","PeriodicalId":299619,"journal":{"name":"2015 International Conference on Informative and Cybernetics for Computational Social Systems (ICCSS)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A novel feature fusion method for predicting protein subcellular localization with multiple sites\",\"authors\":\"Dong Wang, Shiyuan Han, Xumi Qu, Wenzheng Bao, Yuehui Chen, Yuling Fan, Jin Zhou\",\"doi\":\"10.1109/ICCSS.2015.7281141\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a novel feature fusion method for the protein subcellular multiple-site localization prediction. Several types of features are employed in this novel protein coding method. The first one is the composition of amino acids. The second is pseudo amino acid composition, which mainly extract the location information of each amino acid residues in protein sequence. Lastly, the information for local sequence of amino acids is taken into consideration in this research. Generally, k nearest neighbor, supporting vector machine and other methods, has been used in the field of protein subcellular localization prediction. In our research, the multi-label k nearest neighbor algorithm has been employed in the classification model. The overall accuracy rate may reach 66.7304% in Gnos-mploc dataset.\",\"PeriodicalId\":299619,\"journal\":{\"name\":\"2015 International Conference on Informative and Cybernetics for Computational Social Systems (ICCSS)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Informative and Cybernetics for Computational Social Systems (ICCSS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSS.2015.7281141\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Informative and Cybernetics for Computational Social Systems (ICCSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSS.2015.7281141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A novel feature fusion method for predicting protein subcellular localization with multiple sites
This paper proposes a novel feature fusion method for the protein subcellular multiple-site localization prediction. Several types of features are employed in this novel protein coding method. The first one is the composition of amino acids. The second is pseudo amino acid composition, which mainly extract the location information of each amino acid residues in protein sequence. Lastly, the information for local sequence of amino acids is taken into consideration in this research. Generally, k nearest neighbor, supporting vector machine and other methods, has been used in the field of protein subcellular localization prediction. In our research, the multi-label k nearest neighbor algorithm has been employed in the classification model. The overall accuracy rate may reach 66.7304% in Gnos-mploc dataset.