Incorporating Local Data and KL Membership Divergence into Hard C-Means Clustering for Fuzzy and Noise-Robust Data Segmentation

Recent Applications in Data Clustering Pub Date : 2018-08-01 DOI:10.5772/INTECHOPEN.74514

R. Gharieb

{"title":"Incorporating Local Data and KL Membership Divergence into Hard C-Means Clustering for Fuzzy and Noise-Robust Data Segmentation","authors":"R. Gharieb","doi":"10.5772/INTECHOPEN.74514","DOIUrl":null,"url":null,"abstract":"Hard C-means (HCM) and fuzzy C-means (FCM) algorithms are among the most popular ones for data clustering including image data. The HCM algorithm offers each data entity with a cluster membership of 0 or 1. This implies that the entity will be assigned to only one cluster. On the contrary, the FCM algorithm provides an entity with a membership value between 0 and 1, which means that the entity may belong to all clusters but with different membership values. The main disadvantage of both HCM and FCM algorithms is that they cluster an entity based on only its self-features and do not incorporate the influence of the entity ’ s neighborhoods, which makes clustering prone to additive noise. In this chapter, Kullback-Leibler (KL) membership divergence is incorporated into the HCM for image data clustering. This HCM-KL-based clustering algorithm provides twofold advantage. The first one is that it offers a fuzzification approach to the HCM cluster- ing algorithm. The second one is that by incorporating a local spatial membership function into the HCM objective function, additive noise can be tolerated. Also spatial data is incorporated for more noise-robust clustering. pixels. Results of segmentation of synthetic, simulated medical and real-world images have shown that the proposed local membership KL divergence-based FCM (LMKLFCM) and the local data and membership KL divergence-based entropy FCM (LDMKLFCM) algorithms outperform several widely used FCM related algorithms. Moreover, the average runtimes of all algorithms have been measured via simulation. In all runs, all algorithms start from the same randomly generated initial conditions, as mentioned in the simulation section, and stopped at the same fixed point. The LDMKLFCM, LMKLFCM, standard FCM, MEFCM, and SFCM algorithms have provided average runtime of 1.5, 1.75, 1, 0.9 and 1 sec respectively. The simulation results have been done using Matlab R2013b under windows on a processor of Intel (R) core (TM) i3, CPU M370 2.4 GHZ, 4 GB RAM.","PeriodicalId":236959,"journal":{"name":"Recent Applications in Data Clustering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Recent Applications in Data Clustering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5772/INTECHOPEN.74514","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Hard C-means (HCM) and fuzzy C-means (FCM) algorithms are among the most popular ones for data clustering including image data. The HCM algorithm offers each data entity with a cluster membership of 0 or 1. This implies that the entity will be assigned to only one cluster. On the contrary, the FCM algorithm provides an entity with a membership value between 0 and 1, which means that the entity may belong to all clusters but with different membership values. The main disadvantage of both HCM and FCM algorithms is that they cluster an entity based on only its self-features and do not incorporate the influence of the entity ’ s neighborhoods, which makes clustering prone to additive noise. In this chapter, Kullback-Leibler (KL) membership divergence is incorporated into the HCM for image data clustering. This HCM-KL-based clustering algorithm provides twofold advantage. The first one is that it offers a fuzzification approach to the HCM cluster- ing algorithm. The second one is that by incorporating a local spatial membership function into the HCM objective function, additive noise can be tolerated. Also spatial data is incorporated for more noise-robust clustering. pixels. Results of segmentation of synthetic, simulated medical and real-world images have shown that the proposed local membership KL divergence-based FCM (LMKLFCM) and the local data and membership KL divergence-based entropy FCM (LDMKLFCM) algorithms outperform several widely used FCM related algorithms. Moreover, the average runtimes of all algorithms have been measured via simulation. In all runs, all algorithms start from the same randomly generated initial conditions, as mentioned in the simulation section, and stopped at the same fixed point. The LDMKLFCM, LMKLFCM, standard FCM, MEFCM, and SFCM algorithms have provided average runtime of 1.5, 1.75, 1, 0.9 and 1 sec respectively. The simulation results have been done using Matlab R2013b under windows on a processor of Intel (R) core (TM) i3, CPU M370 2.4 GHZ, 4 GB RAM.

查看原文本刊更多论文

基于局部数据和KL隶属度散度的硬c均值聚类模糊鲁棒数据分割

硬c均值(HCM)和模糊c均值(FCM)算法是包括图像数据在内的数据聚类中最常用的算法。HCM算法为每个数据实体提供0或1的集群成员。这意味着实体将只分配给一个集群。相反，FCM算法提供了一个隶属度值在0到1之间的实体，这意味着该实体可能属于所有集群，但隶属度值不同。HCM和FCM算法的主要缺点是它们仅基于实体的自特征聚类，而不考虑实体邻域的影响，这使得聚类容易受到加性噪声的影响。在本章中，将Kullback-Leibler (KL)隶属度散度纳入HCM中用于图像数据聚类。这种基于hcm - kl的聚类算法具有双重优势。首先，它为HCM聚类算法提供了一种模糊化方法。第二，通过在HCM目标函数中加入局部空间隶属函数，可以容忍加性噪声。此外，空间数据被纳入更强的噪声鲁棒聚类。像素。合成图像、模拟医学图像和真实世界图像的分割结果表明，本文提出的基于局部隶属度KL散度的FCM (LMKLFCM)和基于局部数据和隶属度KL散度的熵值FCM (LDMKLFCM)算法优于几种广泛使用的FCM相关算法。此外，还通过仿真测量了所有算法的平均运行时间。在所有的运行中，所有的算法都从相同的随机生成的初始条件开始，如模拟部分所述，并在相同的固定点停止。LDMKLFCM、LMKLFCM、标准FCM、MEFCM和SFCM算法的平均运行时间分别为1.5、1.75、1、0.9和1秒。仿真结果是在Intel (R) core (TM) i3处理器，CPU M370 2.4 GHZ，内存4gb的情况下，在windows下使用Matlab R2013b进行的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Recent Applications in Data Clustering

自引率

0.00%

发文量