Gene clustering with partition around mediods algorithm based on weighted and normalized mahalanobis distance

Nwayyin Najat Mohammed, A. Abdulazeez
{"title":"Gene clustering with partition around mediods algorithm based on weighted and normalized mahalanobis distance","authors":"Nwayyin Najat Mohammed, A. Abdulazeez","doi":"10.1109/ICIIBMS.2017.8279707","DOIUrl":null,"url":null,"abstract":"The partition around medoids (PAM) algorithm is a robust and flexible unsupervised learning algorithm that depends on the underlying distance and default distance metric is Euclidean distance. The PAM algorithm is more efficient than K-Mean since medoids assess a minimum distance from the other objects. In this study, we have integrated the Mahalanobis distance with PAM algorithm, since Mahalanobis distance has been defined in cluster analysis for different applications and is used to overcome the problem of scaling and correlation with Euclidean distance. However, the performance of PAM algorithm based on Mahalanobis distance was found to be inadequate when employed on selected microarray expression datasets, which were pre-processed prior to the analyses. We proposed an enhanced PAM algorithm based on the weighted and normalized Mahalanobis distance, and the results obtained using our proposed algorithm reveal an ultimate cluster solution for selected microarray datasets. The algorithms were evaluated using the Dunn's validity index.","PeriodicalId":122969,"journal":{"name":"2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"21","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIIBMS.2017.8279707","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 21

Abstract

The partition around medoids (PAM) algorithm is a robust and flexible unsupervised learning algorithm that depends on the underlying distance and default distance metric is Euclidean distance. The PAM algorithm is more efficient than K-Mean since medoids assess a minimum distance from the other objects. In this study, we have integrated the Mahalanobis distance with PAM algorithm, since Mahalanobis distance has been defined in cluster analysis for different applications and is used to overcome the problem of scaling and correlation with Euclidean distance. However, the performance of PAM algorithm based on Mahalanobis distance was found to be inadequate when employed on selected microarray expression datasets, which were pre-processed prior to the analyses. We proposed an enhanced PAM algorithm based on the weighted and normalized Mahalanobis distance, and the results obtained using our proposed algorithm reveal an ultimate cluster solution for selected microarray datasets. The algorithms were evaluated using the Dunn's validity index.
基于加权归一化马氏距离的基因聚类算法
围绕介质的划分(PAM)算法是一种鲁棒且灵活的无监督学习算法,它依赖于底层距离,默认距离度量为欧几里德距离。PAM算法比K-Mean算法更有效,因为介质评估与其他对象的最小距离。在本研究中,我们将马氏距离与PAM算法相结合,因为马氏距离已经在不同应用的聚类分析中定义,并用于克服缩放和与欧几里得距离相关的问题。然而,基于马氏距离的PAM算法在选定的微阵列表达数据集上使用时,发现性能不足,这些数据集在分析之前经过预处理。我们提出了一种基于加权和归一化马氏距离的增强PAM算法,该算法的结果揭示了所选微阵列数据集的最终聚类解。采用邓恩有效性指数对算法进行评价。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信