改进的K-means算法在基因表达数据分析中的应用

2011 IEEE International Conference on Systems Biology (ISB) Pub Date : 2011-10-03 DOI:10.1109/ISB.2011.6033126

Qian Ren, X. Zhuo

{"title":"改进的K-means算法在基因表达数据分析中的应用","authors":"Qian Ren, X. Zhuo","doi":"10.1109/ISB.2011.6033126","DOIUrl":null,"url":null,"abstract":"K-means algorithm is one of the most classic partition algorithms in clustering algorithms. The result obtained by K-means algorithm varies with the choice of the initial clustering centers. Motivated by this, an improved K-means algorithm is proposed based on the Kruskal algorithm, which is famous in graph theory. The procedure of this algorithm is shown as follows: Firstly, the minimum spanning tree (MST) of the clustered objects is obtained by using Kruskal algorithm. Then K-1 edges are deleted based on weights in a descending order. At last, the average values of the objects contained by the k-connected graphs resulting from last two steps are regarded as the initial clustering centers to cluster. Make the improved K-means algorithm used in gene expression data analysis, simulation experiment shows that the improved K-means algorithm has a better clustering effect and higher efficiency than the traditional one.","PeriodicalId":355056,"journal":{"name":"2011 IEEE International Conference on Systems Biology (ISB)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Application of an improved K-means algorithm in gene expression data analysis\",\"authors\":\"Qian Ren, X. Zhuo\",\"doi\":\"10.1109/ISB.2011.6033126\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"K-means algorithm is one of the most classic partition algorithms in clustering algorithms. The result obtained by K-means algorithm varies with the choice of the initial clustering centers. Motivated by this, an improved K-means algorithm is proposed based on the Kruskal algorithm, which is famous in graph theory. The procedure of this algorithm is shown as follows: Firstly, the minimum spanning tree (MST) of the clustered objects is obtained by using Kruskal algorithm. Then K-1 edges are deleted based on weights in a descending order. At last, the average values of the objects contained by the k-connected graphs resulting from last two steps are regarded as the initial clustering centers to cluster. Make the improved K-means algorithm used in gene expression data analysis, simulation experiment shows that the improved K-means algorithm has a better clustering effect and higher efficiency than the traditional one.\",\"PeriodicalId\":355056,\"journal\":{\"name\":\"2011 IEEE International Conference on Systems Biology (ISB)\",\"volume\":\"97 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE International Conference on Systems Biology (ISB)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISB.2011.6033126\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Systems Biology (ISB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISB.2011.6033126","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

K-means算法是聚类算法中最经典的划分算法之一。K-means算法得到的结果随着初始聚类中心的选择而变化。受此启发，在图论中著名的Kruskal算法的基础上，提出了一种改进的K-means算法。该算法的实现过程如下:首先，利用Kruskal算法得到聚类对象的最小生成树(MST);然后根据权值降序删除K-1条边。最后，将前两步得到的k连通图所包含对象的平均值作为初始聚类中心进行聚类。将改进的K-means算法应用于基因表达数据分析，仿真实验表明，改进的K-means算法比传统算法具有更好的聚类效果和更高的效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Application of an improved K-means algorithm in gene expression data analysis

K-means algorithm is one of the most classic partition algorithms in clustering algorithms. The result obtained by K-means algorithm varies with the choice of the initial clustering centers. Motivated by this, an improved K-means algorithm is proposed based on the Kruskal algorithm, which is famous in graph theory. The procedure of this algorithm is shown as follows: Firstly, the minimum spanning tree (MST) of the clustered objects is obtained by using Kruskal algorithm. Then K-1 edges are deleted based on weights in a descending order. At last, the average values of the objects contained by the k-connected graphs resulting from last two steps are regarded as the initial clustering centers to cluster. Make the improved K-means algorithm used in gene expression data analysis, simulation experiment shows that the improved K-means algorithm has a better clustering effect and higher efficiency than the traditional one.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2011 IEEE International Conference on Systems Biology (ISB)

自引率

0.00%

发文量