Hanaa M. Hussain, H. Seker, Malde Gorania, Newcastle Upon-Tyne United Kingdom Environment Newcastle
{"title":"Bioinformatics Approach to Classification of Four Classes of Organism in Relation to Their Optimal Growth Temperature","authors":"Hanaa M. Hussain, H. Seker, Malde Gorania, Newcastle Upon-Tyne United Kingdom Environment Newcastle","doi":"10.18178/ijpmbs.7.4.78-83","DOIUrl":null,"url":null,"abstract":" —Identifying the temperature class of proteins in prokaryotic organisms is one of the vital problems in enzyme and protein engineering. In this work, an efficient K-NN predictive models have been developed to discriminate hyperthermophilic, thermophilic, psychrophilic, and mesophilic proteins using Amino acid and Pseudo amino acid compositions. The two predictive models were built and tested with a large dataset consisting of 6631 hyperthermophiles, 11,700 thermophiles, 6267 psychrophiles, and 67,037 mesophiles. Implementation and analysis results showed that the proposed K-NN based predictive models were capable of discriminating the four classes efficiently and with high accuracies, whereby the Amino acid composition model achieved 94% accuracy when using 10-fold cross-validation, and 98% when using hold-out test. on the other hand, the Pseud amino acid composition based model achieved an accuracy of 99% using hold-out test.","PeriodicalId":281523,"journal":{"name":"International Journal of Pharma Medicine and Biological Sciences","volume":"341 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Pharma Medicine and Biological Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18178/ijpmbs.7.4.78-83","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
—Identifying the temperature class of proteins in prokaryotic organisms is one of the vital problems in enzyme and protein engineering. In this work, an efficient K-NN predictive models have been developed to discriminate hyperthermophilic, thermophilic, psychrophilic, and mesophilic proteins using Amino acid and Pseudo amino acid compositions. The two predictive models were built and tested with a large dataset consisting of 6631 hyperthermophiles, 11,700 thermophiles, 6267 psychrophiles, and 67,037 mesophiles. Implementation and analysis results showed that the proposed K-NN based predictive models were capable of discriminating the four classes efficiently and with high accuracies, whereby the Amino acid composition model achieved 94% accuracy when using 10-fold cross-validation, and 98% when using hold-out test. on the other hand, the Pseud amino acid composition based model achieved an accuracy of 99% using hold-out test.