Recent Applications in Data Clustering最新文献

New Approaches in Multi-View Clustering 多视图聚类的新方法

Recent Applications in Data Clustering Pub Date : 2018-08-01 DOI: 10.5772/INTECHOPEN.75598

Fanghua Ye, Zitai Chen, Hui Qian, Ruiqun Li, Chuan Chen, ZibinZheng

引用次数: 5

Clustering Algorithms for Incomplete Datasets 不完整数据集的聚类算法

Recent Applications in Data Clustering Pub Date : 2018-08-01 DOI: 10.5772/INTECHOPEN.78272

Loai Abdallah, I. Shimshoni

引用次数: 2

Point Cloud Clustering Using Panoramic Layered Range Image 使用全景分层范围图像的点云聚类

Recent Applications in Data Clustering Pub Date : 2018-08-01 DOI: 10.5772/INTECHOPEN.76407

M. Nakagawa, Kounosuke Kataoka, Shouta Ouma

引用次数: 2

Incorporating Local Data and KL Membership Divergence into Hard C-Means Clustering for Fuzzy and Noise-Robust Data Segmentation 基于局部数据和KL隶属度散度的硬c均值聚类模糊鲁棒数据分割

Recent Applications in Data Clustering Pub Date : 2018-08-01 DOI: 10.5772/INTECHOPEN.74514

R. Gharieb

{"title":"Incorporating Local Data and KL Membership Divergence into Hard C-Means Clustering for Fuzzy and Noise-Robust Data Segmentation","authors":"R. Gharieb","doi":"10.5772/INTECHOPEN.74514","DOIUrl":"https://doi.org/10.5772/INTECHOPEN.74514","url":null,"abstract":"Hard C-means (HCM) and fuzzy C-means (FCM) algorithms are among the most popular ones for data clustering including image data. The HCM algorithm offers each data entity with a cluster membership of 0 or 1. This implies that the entity will be assigned to only one cluster. On the contrary, the FCM algorithm provides an entity with a membership value between 0 and 1, which means that the entity may belong to all clusters but with different membership values. The main disadvantage of both HCM and FCM algorithms is that they cluster an entity based on only its self-features and do not incorporate the influence of the entity ’ s neighborhoods, which makes clustering prone to additive noise. In this chapter, Kullback-Leibler (KL) membership divergence is incorporated into the HCM for image data clustering. This HCM-KL-based clustering algorithm provides twofold advantage. The first one is that it offers a fuzzification approach to the HCM cluster- ing algorithm. The second one is that by incorporating a local spatial membership function into the HCM objective function, additive noise can be tolerated. Also spatial data is incorporated for more noise-robust clustering. pixels. Results of segmentation of synthetic, simulated medical and real-world images have shown that the proposed local membership KL divergence-based FCM (LMKLFCM) and the local data and membership KL divergence-based entropy FCM (LDMKLFCM) algorithms outperform several widely used FCM related algorithms. Moreover, the average runtimes of all algorithms have been measured via simulation. In all runs, all algorithms start from the same randomly generated initial conditions, as mentioned in the simulation section, and stopped at the same fixed point. The LDMKLFCM, LMKLFCM, standard FCM, MEFCM, and SFCM algorithms have provided average runtime of 1.5, 1.75, 1, 0.9 and 1 sec respectively. The simulation results have been done using Matlab R2013b under windows on a processor of Intel (R) core (TM) i3, CPU M370 2.4 GHZ, 4 GB RAM.","PeriodicalId":236959,"journal":{"name":"Recent Applications in Data Clustering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130058738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

A Class of Parametric Tree-Based Clustering Methods 一类参数树聚类方法

Recent Applications in Data Clustering Pub Date : 2018-08-01 DOI: 10.5772/INTECHOPEN.76406

F. Glover, Yang Wang

引用次数: 0

Centroid-Based Lexical Clustering 基于质心的词汇聚类

Recent Applications in Data Clustering Pub Date : 2018-08-01 DOI: 10.5772/INTECHOPEN.75433

Khaled Abdalgader

{"title":"Centroid-Based Lexical Clustering","authors":"Khaled Abdalgader","doi":"10.5772/INTECHOPEN.75433","DOIUrl":"https://doi.org/10.5772/INTECHOPEN.75433","url":null,"abstract":"Conventional lexical-clustering algorithms treat text fragments as a mixed collection of words, with a semantic similarity between them calculated based on the term of how many the particular word occurs within the compared fragments. Whereas this technique is appropriate for clustering large-sized textual collections, it operates poorly when clustering small-sized texts such as sentences. This is due to compared sentences that may be linguistically similar despite having no words in common. This chapter presents a new version of the original k-means method for sentence-level text clustering that is relay on the idea of use of the related synonyms in order to construct the rich semantic vectors. These vectors represent a sentence using linguistic information resulting from a lexical database founded to determine the actual sense to a word, based on the context in which it occurs. Therefore, while traditional k-means method application is relay on calculating the distance between patterns, the new proposed version operates by calculating the semantic similarity between sentences. This allows it to capture a higher degree of semantic or linguistic information existing within the clustered sentences. Experimental results illustrate that the proposed version of clustering algorithm performs favorably against other well-known clustering algorithms on several standard datasets.","PeriodicalId":236959,"journal":{"name":"Recent Applications in Data Clustering","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122253229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

emporal Clustering for Behavior Variation and Anomaly Detection from Data Acquired Through IoT in Smart Cities 智能城市中物联网获取数据的行为变化和异常检测的时间聚类

Recent Applications in Data Clustering Pub Date : 2018-08-01 DOI: 10.5772/INTECHOPEN.75203

V. Urosevic, Ana Kovačević, Firas Kaddachi, MilanVukicevic

{"title":"emporal Clustering for Behavior Variation and Anomaly Detection from Data Acquired Through IoT in Smart Cities","authors":"V. Urosevic, Ana Kovačević, Firas Kaddachi, MilanVukicevic","doi":"10.5772/INTECHOPEN.75203","DOIUrl":"https://doi.org/10.5772/INTECHOPEN.75203","url":null,"abstract":"In this chapter, we propose a methodology for behavior variation and anomaly detection from acquired sensory data, based on temporal clustering models. Data are collected from five prominent European smart cities, and Singapore, that aim to become fully “elderly-friendly,” with the development and deployment of ubiquitous systems for assessment and prediction of early risks of elderly Mild Cognitive Impairments (MCI) and frailty, and for supporting generation and delivery of optimal personalized preventive interventions that mitigate those risks, utilizing smart city datasets and IoT infrastructure. Low level data collected from IoT devices are preprocessed as sequences of activities, with temporal and causal variations in sequences classified as normal or anomalous behavior. The goals of proposed methodology are to (1) recognize significant behavioral variation patterns and (2) support early identification of pattern changes. Temporal clustering models are applied in detection and prediction of the following variation types: intra-activity (single activity, single citizen) and inter-activity (multi- ple-activities, single citizen). Identified behavioral variations and anomalies are further mapped to MCI/frailty onset behavior and risk factors, following the developed geriatric expert model.","PeriodicalId":236959,"journal":{"name":"Recent Applications in Data Clustering","volume":"95 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132178529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Robust Spectral Clustering via Sparse Representation 基于稀疏表示的鲁棒谱聚类

Recent Applications in Data Clustering Pub Date : 2018-08-01 DOI: 10.5772/INTECHOPEN.76586

Xiaodong Feng

{"title":"Robust Spectral Clustering via Sparse Representation","authors":"Xiaodong Feng","doi":"10.5772/INTECHOPEN.76586","DOIUrl":"https://doi.org/10.5772/INTECHOPEN.76586","url":null,"abstract":"Clustering high-dimensional data has been a challenging problem in data mining and machining learning. Spectral clustering via sparse representation has been proposed for clustering high-dimensional data. A critical step in spectral clustering is to effectively construct a weight matrix by assessing the proximity between each pair of objects. While sparse representation proves its effectiveness for compressing high-dimensional signals, existing spectral clustering algorithms based on sparse representation use those sparse coefficients directly. We believe that the similarity measure exploiting more global information from the coefficient vectors will provide more truthful similarity among data objects. The intuition is that the sparse coefficient vectors corresponding to two similar objects are similar and those of two dissimilar objects are also dissimilar. In particular, we propose two approaches of weight matrix construction according to the similarity of the sparse coefficient vectors. Experimental results on several real-world high-dimensional data sets demonstrate that spectral clustering based on the proposed similarity matrices outperforms existing spectral clustering algorithms via sparse representation.","PeriodicalId":236959,"journal":{"name":"Recent Applications in Data Clustering","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126280240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Partitional Clustering Partitional集群

Recent Applications in Data Clustering Pub Date : 2018-08-01 DOI: 10.5772/intechopen.75836

Uğurhan Kutbay

引用次数: 2

Performance Assessment of Unsupervised Clustering Algorithms Combined MDL Index 结合MDL指标的无监督聚类算法性能评价

Recent Applications in Data Clustering Pub Date : 2018-08-01 DOI: 10.5772/INTECHOPEN.74506

Hadeel K. Aljobouri, Hussain A. Jaber, Ilyas Çankaya

引用次数: 1