{"title":"基于相互近邻法的基因选择","authors":"H. Shashirekha, A. Wani","doi":"10.1109/ERECT.2015.7499048","DOIUrl":null,"url":null,"abstract":"Gene expression data suffer from the curse of dimensionality due to the presence of several thousands of genes (features) but a small number of samples. This problem of large feature space is addressed by feature selection algorithms which aim at finding a comparatively small set of significant features by removing the redundant and irrelevant features thereby increasing the performance (e.g., higher accuracy for classification), decreasing the computational cost and improving the model interpretability and comprehending the results in an better way. In this paper, we explore the possible application of Mutual Nearest Neighbor (MNN) and Mean Test approaches to select significant genes from high dimensional gene expression data and compare their performances with three other well known algorithms. kNN classifier is used to measure the performances of these algorithms and the results are illustrated.","PeriodicalId":140556,"journal":{"name":"2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Gene selection by Mutual Nearest Neighbor approach\",\"authors\":\"H. Shashirekha, A. Wani\",\"doi\":\"10.1109/ERECT.2015.7499048\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Gene expression data suffer from the curse of dimensionality due to the presence of several thousands of genes (features) but a small number of samples. This problem of large feature space is addressed by feature selection algorithms which aim at finding a comparatively small set of significant features by removing the redundant and irrelevant features thereby increasing the performance (e.g., higher accuracy for classification), decreasing the computational cost and improving the model interpretability and comprehending the results in an better way. In this paper, we explore the possible application of Mutual Nearest Neighbor (MNN) and Mean Test approaches to select significant genes from high dimensional gene expression data and compare their performances with three other well known algorithms. kNN classifier is used to measure the performances of these algorithms and the results are illustrated.\",\"PeriodicalId\":140556,\"journal\":{\"name\":\"2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ERECT.2015.7499048\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Emerging Research in Electronics, Computer Science and Technology (ICERECT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ERECT.2015.7499048","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Gene selection by Mutual Nearest Neighbor approach
Gene expression data suffer from the curse of dimensionality due to the presence of several thousands of genes (features) but a small number of samples. This problem of large feature space is addressed by feature selection algorithms which aim at finding a comparatively small set of significant features by removing the redundant and irrelevant features thereby increasing the performance (e.g., higher accuracy for classification), decreasing the computational cost and improving the model interpretability and comprehending the results in an better way. In this paper, we explore the possible application of Mutual Nearest Neighbor (MNN) and Mean Test approaches to select significant genes from high dimensional gene expression data and compare their performances with three other well known algorithms. kNN classifier is used to measure the performances of these algorithms and the results are illustrated.