{"title":"用监督学习方法进行蛋白质结构域注释的挑战:系统综述","authors":"A. A. Parikesit, Rizky Nurdiansyah","doi":"10.5614/jms.2019.24.1.1","DOIUrl":null,"url":null,"abstract":"The protein domain is still considered one of the most critical evolutionary unit in the cellular, molecular mechanism. Hence, providing the quantitative measurements of the domain contents of proteins would be essential to give insights on the dynamics of the cell’s biochemical machinery. Bioinformatics, as a quantitative science, has successfully provided several approaches to comprehend the domain contents and their dynamics. It was done mainly with supervised learning approach. In eukaryote domain, there is the tendency of transcription factor domain avoidance in the higher organism, and co-occurrence in a single cell has shed light on the complexity of domain functionality. It is widely assumed that protein domain tends to avoid each other as the organism gained more sophisticated molecular features. However, more sample organisms should be provided to obtain better insight on the domain co-occurrence in the cells. This systematic review was conducted by searching literature in Google Scholars and PubMed. To this end, automatic pipelines should be provided by software packages such as DOMOSAIC with the help of the R-based scripting to uphold the statistical significance. The latest updates on this topic are covering annotation on the orphan domain in Drosophila, and architecture plasticity in the eukaryote.","PeriodicalId":31765,"journal":{"name":"Jurnal Pendidikan Matematika dan Sains","volume":"87 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"The Challenge of Protein Domain Annotation with Supervised Learn-ing Approach: A Systematic Review\",\"authors\":\"A. A. Parikesit, Rizky Nurdiansyah\",\"doi\":\"10.5614/jms.2019.24.1.1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The protein domain is still considered one of the most critical evolutionary unit in the cellular, molecular mechanism. Hence, providing the quantitative measurements of the domain contents of proteins would be essential to give insights on the dynamics of the cell’s biochemical machinery. Bioinformatics, as a quantitative science, has successfully provided several approaches to comprehend the domain contents and their dynamics. It was done mainly with supervised learning approach. In eukaryote domain, there is the tendency of transcription factor domain avoidance in the higher organism, and co-occurrence in a single cell has shed light on the complexity of domain functionality. It is widely assumed that protein domain tends to avoid each other as the organism gained more sophisticated molecular features. However, more sample organisms should be provided to obtain better insight on the domain co-occurrence in the cells. This systematic review was conducted by searching literature in Google Scholars and PubMed. To this end, automatic pipelines should be provided by software packages such as DOMOSAIC with the help of the R-based scripting to uphold the statistical significance. The latest updates on this topic are covering annotation on the orphan domain in Drosophila, and architecture plasticity in the eukaryote.\",\"PeriodicalId\":31765,\"journal\":{\"name\":\"Jurnal Pendidikan Matematika dan Sains\",\"volume\":\"87 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jurnal Pendidikan Matematika dan Sains\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5614/jms.2019.24.1.1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Pendidikan Matematika dan Sains","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5614/jms.2019.24.1.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The Challenge of Protein Domain Annotation with Supervised Learn-ing Approach: A Systematic Review
The protein domain is still considered one of the most critical evolutionary unit in the cellular, molecular mechanism. Hence, providing the quantitative measurements of the domain contents of proteins would be essential to give insights on the dynamics of the cell’s biochemical machinery. Bioinformatics, as a quantitative science, has successfully provided several approaches to comprehend the domain contents and their dynamics. It was done mainly with supervised learning approach. In eukaryote domain, there is the tendency of transcription factor domain avoidance in the higher organism, and co-occurrence in a single cell has shed light on the complexity of domain functionality. It is widely assumed that protein domain tends to avoid each other as the organism gained more sophisticated molecular features. However, more sample organisms should be provided to obtain better insight on the domain co-occurrence in the cells. This systematic review was conducted by searching literature in Google Scholars and PubMed. To this end, automatic pipelines should be provided by software packages such as DOMOSAIC with the help of the R-based scripting to uphold the statistical significance. The latest updates on this topic are covering annotation on the orphan domain in Drosophila, and architecture plasticity in the eukaryote.