{"title":"A clustering algorithm based on elitist evolutionary approach","authors":"Lydia Boudjeloud-Assala, Ta Minh Thuy","doi":"10.1504/IJBIC.2016.10004315","DOIUrl":null,"url":null,"abstract":"The k-means algorithm is a popular clustering algorithm. However, while k-means is convenient to implement, it produces solutions that are locally optimal. It depends on the number of clusters k and initialisation seeds. We introduce a method that can be used directly as a clustering algorithm or as an initialisation of the k-means algorithm based on the cluster number optimisation. The problem is the number of parameters required to find an optimal solution. We propose to apply diversity of population maintained through different evolutionary sub-populations and to apply the elitist strategy to select only the best concurrent solution. We also propose a new mutation strategy according to the neighbourhood search. This cooperative strategy allows us to find the global optimal solution for clustering tasks and optimal cluster seeds. We conduct numerical experiments to evaluate the effectiveness of the proposed algorithms on multi-class datasets, overlapped datasets and large-size datasets.","PeriodicalId":49059,"journal":{"name":"International Journal of Bio-Inspired Computation","volume":"42 1","pages":"258-266"},"PeriodicalIF":1.7000,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Bio-Inspired Computation","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1504/IJBIC.2016.10004315","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 2
Abstract
The k-means algorithm is a popular clustering algorithm. However, while k-means is convenient to implement, it produces solutions that are locally optimal. It depends on the number of clusters k and initialisation seeds. We introduce a method that can be used directly as a clustering algorithm or as an initialisation of the k-means algorithm based on the cluster number optimisation. The problem is the number of parameters required to find an optimal solution. We propose to apply diversity of population maintained through different evolutionary sub-populations and to apply the elitist strategy to select only the best concurrent solution. We also propose a new mutation strategy according to the neighbourhood search. This cooperative strategy allows us to find the global optimal solution for clustering tasks and optimal cluster seeds. We conduct numerical experiments to evaluate the effectiveness of the proposed algorithms on multi-class datasets, overlapped datasets and large-size datasets.
期刊介绍:
IJBIC discusses the new bio-inspired computation methodologies derived from the animal and plant world, such as new algorithms mimicking the wolf schooling, the plant survival process, etc.
Topics covered include:
-New bio-inspired methodologies coming from
creatures living in nature
artificial society-
physical/chemical phenomena-
New bio-inspired methodology analysis tools, e.g. rough sets, stochastic processes-
Brain-inspired methods: models and algorithms-
Bio-inspired computation with big data: algorithms and structures-
Applications associated with bio-inspired methodologies, e.g. bioinformatics.