K-Means Clustering on 3rd order polynomial based normalization of Acute Myeloid Leukemia (AML) and Acute Lymphocyte Leukemia (ALL)

Ahmed M. Mehdi, Mohammad Shoaib Sehgal, A. Zayegh, R. Begg, Abdul Manan
{"title":"K-Means Clustering on 3rd order polynomial based normalization of Acute Myeloid Leukemia (AML) and Acute Lymphocyte Leukemia (ALL)","authors":"Ahmed M. Mehdi, Mohammad Shoaib Sehgal, A. Zayegh, R. Begg, Abdul Manan","doi":"10.1109/ICEE.2009.5173170","DOIUrl":null,"url":null,"abstract":"Microarray expression data is one of the most widely used to find patterns in genetic expressions. The DNA Microarray Technique participates as one of the leading methods in Cancer Research. Due to the presence of immense noise, fewer numbers of samples and huge amount of genes, the useful genomic knowledge extraction from this technique is an important question in today's Biological Research. Scientists and Researchers are exploring efficient mathematical procedure to find realistic gene expressed knowledge. In this study K-Means Clustering technique is used on an efficient 3rd order polynomial based technique to normalize the genomic data of Acute Myeloid Leukemia (AML) and Acute Lymphocyte Leukemia (ALL). AML was used as a model to generate the coefficients of the polynomial by considering non trending, decorellation and offset based techniques. The K nearest Neighbor technique is used to estimate the missing values of microarray data and avoid the impact of missing data on clustering algorithm. The data can be regenerated easily using 3rd order polynomial normalization based on model generated by AML. Top ranked genes in each cluster have been presented in this paper which helps in finding functionally coregulated genes in ALL and AML.","PeriodicalId":244218,"journal":{"name":"2009 Third International Conference on Electrical Engineering","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Third International Conference on Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEE.2009.5173170","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Microarray expression data is one of the most widely used to find patterns in genetic expressions. The DNA Microarray Technique participates as one of the leading methods in Cancer Research. Due to the presence of immense noise, fewer numbers of samples and huge amount of genes, the useful genomic knowledge extraction from this technique is an important question in today's Biological Research. Scientists and Researchers are exploring efficient mathematical procedure to find realistic gene expressed knowledge. In this study K-Means Clustering technique is used on an efficient 3rd order polynomial based technique to normalize the genomic data of Acute Myeloid Leukemia (AML) and Acute Lymphocyte Leukemia (ALL). AML was used as a model to generate the coefficients of the polynomial by considering non trending, decorellation and offset based techniques. The K nearest Neighbor technique is used to estimate the missing values of microarray data and avoid the impact of missing data on clustering algorithm. The data can be regenerated easily using 3rd order polynomial normalization based on model generated by AML. Top ranked genes in each cluster have been presented in this paper which helps in finding functionally coregulated genes in ALL and AML.
基于三阶多项式的急性髓性白血病和急性淋巴细胞白血病K-Means聚类归一化
微阵列表达数据是发现基因表达模式最广泛使用的方法之一。DNA微阵列技术是癌症研究的主要方法之一。由于噪声大、样本数量少、基因量大,从该技术中提取有用的基因组知识是当今生物学研究中的一个重要问题。科学家和研究人员正在探索有效的数学程序来寻找现实的基因表达知识。本研究将k均值聚类技术应用于一种高效的基于三阶多项式的技术上,对急性髓性白血病(AML)和急性淋巴细胞白血病(ALL)的基因组数据进行归一化。采用AML作为模型,通过考虑非趋势、去相关和基于偏移的技术来生成多项式系数。采用K近邻技术估计微阵列数据的缺失值,避免缺失数据对聚类算法的影响。基于AML生成的模型,采用三阶多项式归一化方法,可以很容易地重新生成数据。本文给出了每个簇中排名靠前的基因,这有助于在ALL和AML中找到功能协同调节的基因。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信