Srirupa Dasgupta, Goutam Saha, Ritwik Mondal, R. Pal, A. Chanda
{"title":"从微阵列数据中产生差异表达基因用于疾病预测的方法的比较","authors":"Srirupa Dasgupta, Goutam Saha, Ritwik Mondal, R. Pal, A. Chanda","doi":"10.1109/C3IT.2015.7060148","DOIUrl":null,"url":null,"abstract":"Feature selection from microarray data has become an ever evolving area of research. Numerous techniques have widely been applied for extraction of genes which are expressed differentially in microarray data. Some of these comprise of studies related to fold-change approach, classical t-statistics and modified t-statistics. It has been found that the gene lists returned by these methods are dissimilar. In this work we compare the outputs of two different feature selection methods using three classifiers based on different algorithms namely the Random Forest Ensemble based method, the Support vector machine (SVM) and the KNN methods, using the prediction accuracy of the test datasets.","PeriodicalId":402311,"journal":{"name":"Proceedings of the 2015 Third International Conference on Computer, Communication, Control and Information Technology (C3IT)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A comparision between methods for generating differentially expressed genes from microarray data for prediction of disease\",\"authors\":\"Srirupa Dasgupta, Goutam Saha, Ritwik Mondal, R. Pal, A. Chanda\",\"doi\":\"10.1109/C3IT.2015.7060148\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature selection from microarray data has become an ever evolving area of research. Numerous techniques have widely been applied for extraction of genes which are expressed differentially in microarray data. Some of these comprise of studies related to fold-change approach, classical t-statistics and modified t-statistics. It has been found that the gene lists returned by these methods are dissimilar. In this work we compare the outputs of two different feature selection methods using three classifiers based on different algorithms namely the Random Forest Ensemble based method, the Support vector machine (SVM) and the KNN methods, using the prediction accuracy of the test datasets.\",\"PeriodicalId\":402311,\"journal\":{\"name\":\"Proceedings of the 2015 Third International Conference on Computer, Communication, Control and Information Technology (C3IT)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-03-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2015 Third International Conference on Computer, Communication, Control and Information Technology (C3IT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/C3IT.2015.7060148\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2015 Third International Conference on Computer, Communication, Control and Information Technology (C3IT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/C3IT.2015.7060148","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A comparision between methods for generating differentially expressed genes from microarray data for prediction of disease
Feature selection from microarray data has become an ever evolving area of research. Numerous techniques have widely been applied for extraction of genes which are expressed differentially in microarray data. Some of these comprise of studies related to fold-change approach, classical t-statistics and modified t-statistics. It has been found that the gene lists returned by these methods are dissimilar. In this work we compare the outputs of two different feature selection methods using three classifiers based on different algorithms namely the Random Forest Ensemble based method, the Support vector machine (SVM) and the KNN methods, using the prediction accuracy of the test datasets.