{"title":"基因子集选择的正则化线性判别分析及其递归实现","authors":"K. Mao, Feng Yang, W. Tang","doi":"10.1109/CIBCB.2011.5948468","DOIUrl":null,"url":null,"abstract":"Although mostly used for pattern classification, linear discriminant analysis (LDA) may also be used for feature selection. When employed to select genes for microarray data, which has high dimensionality and small sample size, LDA encounters three problems, including singularity of scatter matrix, overfitting and prohibitive computational complexity. In this study, we propose a new regularization technique to address the singularity and overfitting problem. In addition, we develop a recursive implementation for LDA to reduce computational overhead. Experimental studies on 5 gene microarray problems show that the regularized linear discriminant analysis (RLDA) and its recursive implementation produce gene subsets with excellent classification performance.","PeriodicalId":395505,"journal":{"name":"2011 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)","volume":"105 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Regularized linear discriminant analysis and its recursive implementation for gene subset selection\",\"authors\":\"K. Mao, Feng Yang, W. Tang\",\"doi\":\"10.1109/CIBCB.2011.5948468\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Although mostly used for pattern classification, linear discriminant analysis (LDA) may also be used for feature selection. When employed to select genes for microarray data, which has high dimensionality and small sample size, LDA encounters three problems, including singularity of scatter matrix, overfitting and prohibitive computational complexity. In this study, we propose a new regularization technique to address the singularity and overfitting problem. In addition, we develop a recursive implementation for LDA to reduce computational overhead. Experimental studies on 5 gene microarray problems show that the regularized linear discriminant analysis (RLDA) and its recursive implementation produce gene subsets with excellent classification performance.\",\"PeriodicalId\":395505,\"journal\":{\"name\":\"2011 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)\",\"volume\":\"105 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIBCB.2011.5948468\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2011.5948468","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Regularized linear discriminant analysis and its recursive implementation for gene subset selection
Although mostly used for pattern classification, linear discriminant analysis (LDA) may also be used for feature selection. When employed to select genes for microarray data, which has high dimensionality and small sample size, LDA encounters three problems, including singularity of scatter matrix, overfitting and prohibitive computational complexity. In this study, we propose a new regularization technique to address the singularity and overfitting problem. In addition, we develop a recursive implementation for LDA to reduce computational overhead. Experimental studies on 5 gene microarray problems show that the regularized linear discriminant analysis (RLDA) and its recursive implementation produce gene subsets with excellent classification performance.