{"title":"线粒体组织下酿酒酵母蛋白质的集成预测","authors":"D. Sumanaweera, Amal Perera","doi":"10.1109/BIBE.2016.61","DOIUrl":null,"url":null,"abstract":"Protein function annotation is vital for identifying disease causative factors and for solving mysteries behind biological system complexities. As manual annotation requires costly and laborious in vitro methods, in silico protein function prediction is preferred nowadays. According to literature, one in five yeast mitochondrial proteins are known to be human disease related. This paper presents a genetic algorithmically weighted heterogeneous data ensemble to classify Saccharomyces cerevisiae proteins under 'mitochondrion organization'(GO:0007005) function defined in Gene Ontology. It consists of five euclidean-distance based nearest neighbour models and three affinity-based neighbourhood models, utilizing protein properties data, four gene expression datasets and protein interactions. The overall prediction is the weighted average of the posterior probability outputs given by the base models. Weights are determined by the standard Genetic Algorithm. The constituted base models show a fair agreement and improve the best performing base classifier by ~ 14.3%.","PeriodicalId":377504,"journal":{"name":"2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Ensemble Based in Silico Prediction of Saccharomyces Cerevisiae Proteins under Mitochondrion Organization\",\"authors\":\"D. Sumanaweera, Amal Perera\",\"doi\":\"10.1109/BIBE.2016.61\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Protein function annotation is vital for identifying disease causative factors and for solving mysteries behind biological system complexities. As manual annotation requires costly and laborious in vitro methods, in silico protein function prediction is preferred nowadays. According to literature, one in five yeast mitochondrial proteins are known to be human disease related. This paper presents a genetic algorithmically weighted heterogeneous data ensemble to classify Saccharomyces cerevisiae proteins under 'mitochondrion organization'(GO:0007005) function defined in Gene Ontology. It consists of five euclidean-distance based nearest neighbour models and three affinity-based neighbourhood models, utilizing protein properties data, four gene expression datasets and protein interactions. The overall prediction is the weighted average of the posterior probability outputs given by the base models. Weights are determined by the standard Genetic Algorithm. The constituted base models show a fair agreement and improve the best performing base classifier by ~ 14.3%.\",\"PeriodicalId\":377504,\"journal\":{\"name\":\"2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE)\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBE.2016.61\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 16th International Conference on Bioinformatics and Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2016.61","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An Ensemble Based in Silico Prediction of Saccharomyces Cerevisiae Proteins under Mitochondrion Organization
Protein function annotation is vital for identifying disease causative factors and for solving mysteries behind biological system complexities. As manual annotation requires costly and laborious in vitro methods, in silico protein function prediction is preferred nowadays. According to literature, one in five yeast mitochondrial proteins are known to be human disease related. This paper presents a genetic algorithmically weighted heterogeneous data ensemble to classify Saccharomyces cerevisiae proteins under 'mitochondrion organization'(GO:0007005) function defined in Gene Ontology. It consists of five euclidean-distance based nearest neighbour models and three affinity-based neighbourhood models, utilizing protein properties data, four gene expression datasets and protein interactions. The overall prediction is the weighted average of the posterior probability outputs given by the base models. Weights are determined by the standard Genetic Algorithm. The constituted base models show a fair agreement and improve the best performing base classifier by ~ 14.3%.