K-Means Clustering on 3rd order polynomial based normalization of Acute Myeloid Leukemia (AML) and Acute Lymphocyte Leukemia (ALL)

2009 Third International Conference on Electrical Engineering Pub Date : 2009-04-09 DOI:10.1109/ICEE.2009.5173170

Ahmed M. Mehdi, Mohammad Shoaib Sehgal, A. Zayegh, R. Begg, Abdul Manan

{"title":"K-Means Clustering on 3rd order polynomial based normalization of Acute Myeloid Leukemia (AML) and Acute Lymphocyte Leukemia (ALL)","authors":"Ahmed M. Mehdi, Mohammad Shoaib Sehgal, A. Zayegh, R. Begg, Abdul Manan","doi":"10.1109/ICEE.2009.5173170","DOIUrl":null,"url":null,"abstract":"Microarray expression data is one of the most widely used to find patterns in genetic expressions. The DNA Microarray Technique participates as one of the leading methods in Cancer Research. Due to the presence of immense noise, fewer numbers of samples and huge amount of genes, the useful genomic knowledge extraction from this technique is an important question in today's Biological Research. Scientists and Researchers are exploring efficient mathematical procedure to find realistic gene expressed knowledge. In this study K-Means Clustering technique is used on an efficient 3rd order polynomial based technique to normalize the genomic data of Acute Myeloid Leukemia (AML) and Acute Lymphocyte Leukemia (ALL). AML was used as a model to generate the coefficients of the polynomial by considering non trending, decorellation and offset based techniques. The K nearest Neighbor technique is used to estimate the missing values of microarray data and avoid the impact of missing data on clustering algorithm. The data can be regenerated easily using 3rd order polynomial normalization based on model generated by AML. Top ranked genes in each cluster have been presented in this paper which helps in finding functionally coregulated genes in ALL and AML.","PeriodicalId":244218,"journal":{"name":"2009 Third International Conference on Electrical Engineering","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Third International Conference on Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEE.2009.5173170","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Microarray expression data is one of the most widely used to find patterns in genetic expressions. The DNA Microarray Technique participates as one of the leading methods in Cancer Research. Due to the presence of immense noise, fewer numbers of samples and huge amount of genes, the useful genomic knowledge extraction from this technique is an important question in today's Biological Research. Scientists and Researchers are exploring efficient mathematical procedure to find realistic gene expressed knowledge. In this study K-Means Clustering technique is used on an efficient 3rd order polynomial based technique to normalize the genomic data of Acute Myeloid Leukemia (AML) and Acute Lymphocyte Leukemia (ALL). AML was used as a model to generate the coefficients of the polynomial by considering non trending, decorellation and offset based techniques. The K nearest Neighbor technique is used to estimate the missing values of microarray data and avoid the impact of missing data on clustering algorithm. The data can be regenerated easily using 3rd order polynomial normalization based on model generated by AML. Top ranked genes in each cluster have been presented in this paper which helps in finding functionally coregulated genes in ALL and AML.

查看原文本刊更多论文

基于三阶多项式的急性髓性白血病和急性淋巴细胞白血病K-Means聚类归一化

微阵列表达数据是发现基因表达模式最广泛使用的方法之一。DNA微阵列技术是癌症研究的主要方法之一。由于噪声大、样本数量少、基因量大，从该技术中提取有用的基因组知识是当今生物学研究中的一个重要问题。科学家和研究人员正在探索有效的数学程序来寻找现实的基因表达知识。本研究将k均值聚类技术应用于一种高效的基于三阶多项式的技术上，对急性髓性白血病(AML)和急性淋巴细胞白血病(ALL)的基因组数据进行归一化。采用AML作为模型，通过考虑非趋势、去相关和基于偏移的技术来生成多项式系数。采用K近邻技术估计微阵列数据的缺失值，避免缺失数据对聚类算法的影响。基于AML生成的模型，采用三阶多项式归一化方法，可以很容易地重新生成数据。本文给出了每个簇中排名靠前的基因，这有助于在ALL和AML中找到功能协同调节的基因。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 Third International Conference on Electrical Engineering

自引率

0.00%

发文量