利用无监督学习改进基因选择性能

International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003 Pub Date : 1900-01-01 DOI:10.1109/ICNNSP.2003.1279209

Mingyi Wang, Ping Wu, Shu-Quan Xia

{"title":"利用无监督学习改进基因选择性能","authors":"Mingyi Wang, Ping Wu, Shu-Quan Xia","doi":"10.1109/ICNNSP.2003.1279209","DOIUrl":null,"url":null,"abstract":"Selection of significant genes via expression profiles is an important problem in microarray experiments for diseases classification and prediction. Genes of interest are typically selected by a statistical significance test and the top ranked genes were used. A problem with this approach is that many of these genes are highly correlated. For classification purposes it required to have distinct but still highly informative genes. In this paper, we proposed an unsupervised feature selection algorithm to resolve this problem. The method retrieves groups of similar genes by measuring similarity between them whereby redundancy therein is removed. This does not need any search and therefore, is fast. Real biological data experiments have shown that this approach will significantly improve existing classifiers.","PeriodicalId":336216,"journal":{"name":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","volume":"109 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Improving performance of gene selection by unsupervised learning\",\"authors\":\"Mingyi Wang, Ping Wu, Shu-Quan Xia\",\"doi\":\"10.1109/ICNNSP.2003.1279209\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Selection of significant genes via expression profiles is an important problem in microarray experiments for diseases classification and prediction. Genes of interest are typically selected by a statistical significance test and the top ranked genes were used. A problem with this approach is that many of these genes are highly correlated. For classification purposes it required to have distinct but still highly informative genes. In this paper, we proposed an unsupervised feature selection algorithm to resolve this problem. The method retrieves groups of similar genes by measuring similarity between them whereby redundancy therein is removed. This does not need any search and therefore, is fast. Real biological data experiments have shown that this approach will significantly improve existing classifiers.\",\"PeriodicalId\":336216,\"journal\":{\"name\":\"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003\",\"volume\":\"109 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNNSP.2003.1279209\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNNSP.2003.1279209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

通过表达谱选择重要基因是微阵列实验中用于疾病分类和预测的重要问题。通常通过统计显著性检验选择感兴趣的基因，并使用排名靠前的基因。这种方法的一个问题是，许多这些基因是高度相关的。为了分类的目的，它需要有不同的，但仍然是高度信息的基因。在本文中，我们提出了一种无监督特征选择算法来解决这个问题。该方法通过测量它们之间的相似性来检索相似基因组，从而消除冗余。它不需要任何搜索，因此速度很快。真实的生物数据实验表明，该方法将显著改善现有的分类器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improving performance of gene selection by unsupervised learning

Selection of significant genes via expression profiles is an important problem in microarray experiments for diseases classification and prediction. Genes of interest are typically selected by a statistical significance test and the top ranked genes were used. A problem with this approach is that many of these genes are highly correlated. For classification purposes it required to have distinct but still highly informative genes. In this paper, we proposed an unsupervised feature selection algorithm to resolve this problem. The method retrieves groups of similar genes by measuring similarity between them whereby redundancy therein is removed. This does not need any search and therefore, is fast. Real biological data experiments have shown that this approach will significantly improve existing classifiers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference on Neural Networks and Signal Processing, 2003. Proceedings of the 2003

自引率

0.00%

发文量