Weng Howe Chan, M. S. Mohamad, S. Deris, J. Corchado, S. Omatu, Z. Ibrahim, S. Kasim
{"title":"基于萤火虫算法的改进gSVM-SCADL2信息基因和路径识别","authors":"Weng Howe Chan, M. S. Mohamad, S. Deris, J. Corchado, S. Omatu, Z. Ibrahim, S. Kasim","doi":"10.1504/IJBRA.2016.075404","DOIUrl":null,"url":null,"abstract":"Incorporation of pathway knowledge into microarray analysis has been favoured by researchers owing to the improved biological interpretation of the analysis outcome. However, most of the pathway data are manually curated without specific biological context. Inclusion of non-informative genes in the analysis of context specific microarray data could lead to classifier with poor discriminative power. Thus, one of the main challenges is how to effectively identify informative genes from the pathway data. This paper proposes a firefly optimised penalised support vector machine with SCADL2 penalty function SVM-SCADL2-FFA in optimising tuning parameters for each pathway for efficient identification of informative genes and pathways. Experiments are done on lung cancer and gender data sets. Tenfold CV is used to evaluate the performance in terms of accuracy, specificity, sensitivity and F-score. The identified informative genes are validated through online databases. Our proposed method shows consistent improvements compared to previous works.","PeriodicalId":434900,"journal":{"name":"Int. J. Bioinform. Res. Appl.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"An improved gSVM-SCADL2 with firefly algorithm for identification of informative genes and pathways\",\"authors\":\"Weng Howe Chan, M. S. Mohamad, S. Deris, J. Corchado, S. Omatu, Z. Ibrahim, S. Kasim\",\"doi\":\"10.1504/IJBRA.2016.075404\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Incorporation of pathway knowledge into microarray analysis has been favoured by researchers owing to the improved biological interpretation of the analysis outcome. However, most of the pathway data are manually curated without specific biological context. Inclusion of non-informative genes in the analysis of context specific microarray data could lead to classifier with poor discriminative power. Thus, one of the main challenges is how to effectively identify informative genes from the pathway data. This paper proposes a firefly optimised penalised support vector machine with SCADL2 penalty function SVM-SCADL2-FFA in optimising tuning parameters for each pathway for efficient identification of informative genes and pathways. Experiments are done on lung cancer and gender data sets. Tenfold CV is used to evaluate the performance in terms of accuracy, specificity, sensitivity and F-score. The identified informative genes are validated through online databases. Our proposed method shows consistent improvements compared to previous works.\",\"PeriodicalId\":434900,\"journal\":{\"name\":\"Int. J. Bioinform. Res. Appl.\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Bioinform. Res. Appl.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJBRA.2016.075404\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Bioinform. Res. Appl.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJBRA.2016.075404","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An improved gSVM-SCADL2 with firefly algorithm for identification of informative genes and pathways
Incorporation of pathway knowledge into microarray analysis has been favoured by researchers owing to the improved biological interpretation of the analysis outcome. However, most of the pathway data are manually curated without specific biological context. Inclusion of non-informative genes in the analysis of context specific microarray data could lead to classifier with poor discriminative power. Thus, one of the main challenges is how to effectively identify informative genes from the pathway data. This paper proposes a firefly optimised penalised support vector machine with SCADL2 penalty function SVM-SCADL2-FFA in optimising tuning parameters for each pathway for efficient identification of informative genes and pathways. Experiments are done on lung cancer and gender data sets. Tenfold CV is used to evaluate the performance in terms of accuracy, specificity, sensitivity and F-score. The identified informative genes are validated through online databases. Our proposed method shows consistent improvements compared to previous works.