Supervised gene clustering for extraction of discriminative features from microarray data

2010 Annual IEEE India Conference (INDICON) Pub Date : 2010-12-01 DOI:10.1109/INDCON.2010.5712629

C. Das, P. Maji, Samiran Chattopadhyay

引用次数: 1

Abstract

Among the large number of genes presented in microarray data, only a small fraction of them are effective for performing a certain diagnostic test. However, it is very difficult to identify these genes for disease diagnosis. In this regard, a new supervised gene clustering algorithm is proposed to cluster genes from microarray data. The proposed method directly incorporates the information of response variables in the grouping process for finding such groups of genes. Significant cluster representatives are then taken to form the reduced feature set that can be used to build the classifiers with very high classification accuracy. The effectiveness of the proposed method, along with a comparison with existing methods, is demonstrated on three microarray data sets based on predictive accuracy of the naive Bayes'classifier, the K-nearest neighbor rule, and the support vector machine.

查看原文本刊更多论文

监督基因聚类从微阵列数据中提取判别特征

在微阵列数据中呈现的大量基因中，只有一小部分对进行某种诊断测试有效。然而，鉴定这些基因用于疾病诊断是非常困难的。为此，提出了一种新的监督基因聚类算法，对芯片数据中的基因进行聚类。该方法直接将响应变量信息纳入到分组过程中，以寻找此类基因群。然后采用重要的聚类代表来形成可用于构建具有非常高分类精度的分类器的约简特征集。基于朴素贝叶斯分类器、k近邻规则和支持向量机的预测精度，在三种微阵列数据集上证明了该方法的有效性，并与现有方法进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 Annual IEEE India Conference (INDICON)

自引率

0.00%

发文量