Gödel Number based Clustering Algorithm with Decimal First Degree Cellular Automata

arXiv - CS - Formal Languages and Automata Theory Pub Date : 2024-05-08 DOI:arxiv-2405.04881

Vicky Vikrant, Narodia Parth P, Kamalika Bhattacharjee

引用次数: 0

Abstract

In this paper, a decimal first degree cellular automata (FDCA) based clustering algorithm is proposed where clusters are created based on reachability. Cyclic spaces are created and configurations which are in the same cycle are treated as the same cluster. Here, real-life data objects are encoded into decimal strings using G\"odel number based encoding. The benefits of the scheme is, it reduces the encoded string length while maintaining the features properties. Candidate CA rules are identified based on some theoretical criteria such as self-replication and information flow. An iterative algorithm is developed to generate the desired number of clusters over three stages. The results of the clustering are evaluated based on benchmark clustering metrics such as Silhouette score, Davis Bouldin, Calinski Harabasz and Dunn Index. In comparison with the existing state-of-the-art clustering algorithms, our proposed algorithm gives better performance.

查看原文本刊更多论文

基于哥德尔数的十进制一级细胞自动机聚类算法

本文提出了一种基于十进制一级细胞自动机（FDCA）的聚类算法，根据可达性创建聚类。循环空间被创建，处于同一循环中的配置被视为同一聚类。在这里，现实生活中的数据对象使用基于模型数的编码方式编码成十进制字符串。该方案的好处是，在保持特征属性的同时减少了编码字符串的长度。候选 CA 规则是根据自我复制和信息流等理论标准确定的。我们开发了一种迭代算法，分三个阶段生成所需的聚类数量。聚类结果根据基准聚类指标（如 Silhouette score、Davis Bouldin、CalinskiHarabasz 和 Dunn Index）进行评估。与现有的最先进的聚类算法相比，我们提出的算法性能更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Formal Languages and Automata Theory

自引率

0.00%

发文量