{"title":"一种新的流形学习算法预测蛋白质的四级结构","authors":"Tong Wang, Jian Wang, Lixiu Yao","doi":"10.1109/ISA.2011.5873419","DOIUrl":null,"url":null,"abstract":"With the explosion of protein sequences generated in the Post-Genomic Age, it is urgent to develop an automated method to predict protein quaternary structure. To explore this problem, we adopted an approach based on a sequence encoding descriptor by fusing PseAA (Pseudo Amino Acid) and DC (Dipeptide Composition) representing a protein sample. Here, a completely different approach, manifold learning algorithm MVP (Maximum variance projection) is introduced to extract the key features from the high-dimensional feature space. The dimension-reduced descriptor vector thus obtained is a compact representation of the original high dimensional vector. Our jackknife test results indicate that it is very promising to use the dimensionality reduction approaches to cope with complicated problems in biological systems, such as predicting the quaternary structure of proteins.","PeriodicalId":128163,"journal":{"name":"2011 3rd International Workshop on Intelligent Systems and Applications","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of Protein Quaternary Structure by a Novel Manifold Learning Algorithm\",\"authors\":\"Tong Wang, Jian Wang, Lixiu Yao\",\"doi\":\"10.1109/ISA.2011.5873419\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the explosion of protein sequences generated in the Post-Genomic Age, it is urgent to develop an automated method to predict protein quaternary structure. To explore this problem, we adopted an approach based on a sequence encoding descriptor by fusing PseAA (Pseudo Amino Acid) and DC (Dipeptide Composition) representing a protein sample. Here, a completely different approach, manifold learning algorithm MVP (Maximum variance projection) is introduced to extract the key features from the high-dimensional feature space. The dimension-reduced descriptor vector thus obtained is a compact representation of the original high dimensional vector. Our jackknife test results indicate that it is very promising to use the dimensionality reduction approaches to cope with complicated problems in biological systems, such as predicting the quaternary structure of proteins.\",\"PeriodicalId\":128163,\"journal\":{\"name\":\"2011 3rd International Workshop on Intelligent Systems and Applications\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-05-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 3rd International Workshop on Intelligent Systems and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISA.2011.5873419\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 3rd International Workshop on Intelligent Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISA.2011.5873419","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Prediction of Protein Quaternary Structure by a Novel Manifold Learning Algorithm
With the explosion of protein sequences generated in the Post-Genomic Age, it is urgent to develop an automated method to predict protein quaternary structure. To explore this problem, we adopted an approach based on a sequence encoding descriptor by fusing PseAA (Pseudo Amino Acid) and DC (Dipeptide Composition) representing a protein sample. Here, a completely different approach, manifold learning algorithm MVP (Maximum variance projection) is introduced to extract the key features from the high-dimensional feature space. The dimension-reduced descriptor vector thus obtained is a compact representation of the original high dimensional vector. Our jackknife test results indicate that it is very promising to use the dimensionality reduction approaches to cope with complicated problems in biological systems, such as predicting the quaternary structure of proteins.