{"title":"基于LCS的全局方法预测GPCR和酶的功能","authors":"L. M. Romão, J. C. Nievola","doi":"10.1109/BIBE.2012.6399665","DOIUrl":null,"url":null,"abstract":"The families of G-Protein Coupled Receptor (GPCR) and enzymes are among the main protein family. They represent to the scientific and medical communities, a significant target for bioactive and drug discovery programs. The model of classification of enzymes and GPCR is characterized by its hierarchical structure in format of tree and this makes more difficult its prediction. In this work we propose an adapted version of Learning Classifier Systems (LCS) data mining algorithm which tends to be more efficient than statistical methods based on homology used in tool such as PSI-BLAST. Hence, a new global model approach, called HLCS (Hierarchical Learning Classifier System) is used to predict the function of enzymes and GPCR, respecting its organizational structure of classes throughout the model development. The HLCS is expressed as a set of IF-THEN classification rules, which have the advantage of representing comprehensible knowledge to biologist users. The HLCS is evaluated with eight datasets from enzymes and GPCR, and compared with a Global Naive Bayes algorithm, named GMNB. In the tests realized the HLCS outperformed the GMNB in the databases of the GPCR proteins group type.","PeriodicalId":330164,"journal":{"name":"2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)","volume":"47 7","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Predicting GPCR and enzymes function with a global approach based on LCS\",\"authors\":\"L. M. Romão, J. C. Nievola\",\"doi\":\"10.1109/BIBE.2012.6399665\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The families of G-Protein Coupled Receptor (GPCR) and enzymes are among the main protein family. They represent to the scientific and medical communities, a significant target for bioactive and drug discovery programs. The model of classification of enzymes and GPCR is characterized by its hierarchical structure in format of tree and this makes more difficult its prediction. In this work we propose an adapted version of Learning Classifier Systems (LCS) data mining algorithm which tends to be more efficient than statistical methods based on homology used in tool such as PSI-BLAST. Hence, a new global model approach, called HLCS (Hierarchical Learning Classifier System) is used to predict the function of enzymes and GPCR, respecting its organizational structure of classes throughout the model development. The HLCS is expressed as a set of IF-THEN classification rules, which have the advantage of representing comprehensible knowledge to biologist users. The HLCS is evaluated with eight datasets from enzymes and GPCR, and compared with a Global Naive Bayes algorithm, named GMNB. In the tests realized the HLCS outperformed the GMNB in the databases of the GPCR proteins group type.\",\"PeriodicalId\":330164,\"journal\":{\"name\":\"2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)\",\"volume\":\"47 7\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBE.2012.6399665\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2012.6399665","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Predicting GPCR and enzymes function with a global approach based on LCS
The families of G-Protein Coupled Receptor (GPCR) and enzymes are among the main protein family. They represent to the scientific and medical communities, a significant target for bioactive and drug discovery programs. The model of classification of enzymes and GPCR is characterized by its hierarchical structure in format of tree and this makes more difficult its prediction. In this work we propose an adapted version of Learning Classifier Systems (LCS) data mining algorithm which tends to be more efficient than statistical methods based on homology used in tool such as PSI-BLAST. Hence, a new global model approach, called HLCS (Hierarchical Learning Classifier System) is used to predict the function of enzymes and GPCR, respecting its organizational structure of classes throughout the model development. The HLCS is expressed as a set of IF-THEN classification rules, which have the advantage of representing comprehensible knowledge to biologist users. The HLCS is evaluated with eight datasets from enzymes and GPCR, and compared with a Global Naive Bayes algorithm, named GMNB. In the tests realized the HLCS outperformed the GMNB in the databases of the GPCR proteins group type.