{"title":"Enhanced Image Clustering with Random-Bond Ising Models Using LDPC Graph Representations and Nishimori Temperature","authors":"V. S. Usatyuk, D. A. Sapozhnikov, S. I. Egorov","doi":"10.3103/S0027134924702102","DOIUrl":null,"url":null,"abstract":"<p>This paper addresses the challenge of improving clustering accuracy of image data, particularly focusing on feature representations extracted from convolutional deep neural networks (CNNs). Traditional spectral clustering methods often struggle with high dimension features tensors generated by CNNs like the VGG model. To overcome these limitations, this work proposes a novel approach that enhances spectral clustering by utilizing sparse graph representations (hyperbolic embedding) based on quasi-cyclic low-density parity check (QC-LDPC) and multiedge type (MET) QC-LDPC codes. These graphs are constructed using progressive edge growth (PEG), simulated annealing methods. The paper tackles the specific problem of effectively clustering high-dimensional, sparse image features by modeling their interactions with a random-bond Ising model (RBIM). The optimization process leverages Nishimori temperature estimation to assign weights to graph edges based on image features, leading to more accurate grouping of images into distinct clusters. This approach can be applied to various tasks, including classification. The proposed method not only improves clustering accuracy but also reduces the number of required parameters. It achieves a 17.39<span>\\(\\%\\)</span> improvement in accuracy (90.60<span>\\(\\%\\)</span>) compared to state-of-the-art Erdõs–Rényi graphs (73.21<span>\\(\\%\\)</span>), which lack the hardware-efficient structure of QC-LDPC graphs. By utilizing sparse feature parameters, an efficient MET QC-LDPC multigraph is created that outperforms conventional techniques such as mean-field approximation and Laplacian methods in graph clustering, binary classification. These findings highlight the potential of this approach for a wide range of applications, including image clustering, neural network pruning, data representation, and neuron activation pattern prediction.</p>","PeriodicalId":711,"journal":{"name":"Moscow University Physics Bulletin","volume":"79 2 supplement","pages":"S647 - S665"},"PeriodicalIF":0.4000,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Moscow University Physics Bulletin","FirstCategoryId":"101","ListUrlMain":"https://link.springer.com/article/10.3103/S0027134924702102","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
This paper addresses the challenge of improving clustering accuracy of image data, particularly focusing on feature representations extracted from convolutional deep neural networks (CNNs). Traditional spectral clustering methods often struggle with high dimension features tensors generated by CNNs like the VGG model. To overcome these limitations, this work proposes a novel approach that enhances spectral clustering by utilizing sparse graph representations (hyperbolic embedding) based on quasi-cyclic low-density parity check (QC-LDPC) and multiedge type (MET) QC-LDPC codes. These graphs are constructed using progressive edge growth (PEG), simulated annealing methods. The paper tackles the specific problem of effectively clustering high-dimensional, sparse image features by modeling their interactions with a random-bond Ising model (RBIM). The optimization process leverages Nishimori temperature estimation to assign weights to graph edges based on image features, leading to more accurate grouping of images into distinct clusters. This approach can be applied to various tasks, including classification. The proposed method not only improves clustering accuracy but also reduces the number of required parameters. It achieves a 17.39\(\%\) improvement in accuracy (90.60\(\%\)) compared to state-of-the-art Erdõs–Rényi graphs (73.21\(\%\)), which lack the hardware-efficient structure of QC-LDPC graphs. By utilizing sparse feature parameters, an efficient MET QC-LDPC multigraph is created that outperforms conventional techniques such as mean-field approximation and Laplacian methods in graph clustering, binary classification. These findings highlight the potential of this approach for a wide range of applications, including image clustering, neural network pruning, data representation, and neuron activation pattern prediction.
期刊介绍:
Moscow University Physics Bulletin publishes original papers (reviews, articles, and brief communications) in the following fields of experimental and theoretical physics: theoretical and mathematical physics; physics of nuclei and elementary particles; radiophysics, electronics, acoustics; optics and spectroscopy; laser physics; condensed matter physics; chemical physics, physical kinetics, and plasma physics; biophysics and medical physics; astronomy, astrophysics, and cosmology; physics of the Earth’s, atmosphere, and hydrosphere.