Dimension Reduction of Multidimensional Structured and Unstructured Datasets through Ensemble Learning of Neural Embeddings

IF 6.8 Q1 AUTOMATION & CONTROL SYSTEMS

Advanced intelligent systems (Weinheim an der Bergstrasse, Germany) Pub Date : 2024-08-04 DOI:10.1002/aisy.202400178

Juan Carlos Alvarado-Pérez, Miguel Angel Garcia, Domenec Puig

{"title":"Dimension Reduction of Multidimensional Structured and Unstructured Datasets through Ensemble Learning of Neural Embeddings","authors":"Juan Carlos Alvarado-Pérez, Miguel Angel Garcia, Domenec Puig","doi":"10.1002/aisy.202400178","DOIUrl":null,"url":null,"abstract":"Dimension reduction aims to project a high-dimensional dataset into a low-dimensional space. It tries to preserve the topological relationships among the original data points and/or induce clusters. NetDRm, an online dimensionality reduction method based on neural ensemble learning that integrates different dimension reduction methods in a synergistic way, is introduced. NetDRm is designed for datasets of multidimensional points that can be either structured (e.g., images) or unstructured (e.g., point clouds, tabular data). It starts by training a collection of deep residual encoders that learn the embeddings induced by multiple dimension reduction methods applied to the input dataset. Subsequently, a dense neural network integrates the generated encoders by emphasizing topological preservation or cluster induction. Experiments conducted on widely used multidimensional datasets (point-cloud manifolds, image datasets, tabular record datasets) show that the proposed method yields better results in terms of topological preservation (<math>\n <semantics>\n <mrow>\n <msub>\n <mi>R</mi>\n <mrow>\n <mtext>NX</mtext>\n </mrow>\n </msub>\n </mrow>\n <annotation>$R_{\\text{NX}}$</annotation>\n </semantics></math> curves), cluster induction (V measure), and classification accuracy than the most relevant dimension reduction methods.","PeriodicalId":93858,"journal":{"name":"Advanced intelligent systems (Weinheim an der Bergstrasse, Germany)","volume":"6 11","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/aisy.202400178","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced intelligent systems (Weinheim an der Bergstrasse, Germany)","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/aisy.202400178","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Dimension reduction aims to project a high-dimensional dataset into a low-dimensional space. It tries to preserve the topological relationships among the original data points and/or induce clusters. NetDRm, an online dimensionality reduction method based on neural ensemble learning that integrates different dimension reduction methods in a synergistic way, is introduced. NetDRm is designed for datasets of multidimensional points that can be either structured (e.g., images) or unstructured (e.g., point clouds, tabular data). It starts by training a collection of deep residual encoders that learn the embeddings induced by multiple dimension reduction methods applied to the input dataset. Subsequently, a dense neural network integrates the generated encoders by emphasizing topological preservation or cluster induction. Experiments conducted on widely used multidimensional datasets (point-cloud manifolds, image datasets, tabular record datasets) show that the proposed method yields better results in terms of topological preservation ( $R_{NX}$ curves), cluster induction (V measure), and classification accuracy than the most relevant dimension reduction methods.

Abstract Image

查看原文本刊更多论文

通过神经嵌入的集合学习降低多维结构化和非结构化数据集的维度

降维旨在将高维数据集投射到低维空间中。它试图保留原始数据点之间的拓扑关系和/或诱导聚类。NetDRm 是一种基于神经集合学习的在线降维方法，它以协同的方式整合了不同的降维方法。NetDRm 专为结构化（如图像）或非结构化（如点云、表格数据）的多维点数据集而设计。它首先要训练一组深度残差编码器，学习应用于输入数据集的多种降维方法所引起的嵌入。随后，密集神经网络通过强调拓扑保存或聚类归纳来整合生成的编码器。在广泛使用的多维数据集（点云流形、图像数据集、表格记录数据集）上进行的实验表明，与最相关的降维方法相比，所提出的方法在拓扑保持（R NX $R_{text\{NX}}$ 曲线）、聚类诱导（V 测量）和分类准确性方面都能产生更好的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Advanced intelligent systems (Weinheim an der Bergstrasse, Germany)

CiteScore

1.30

自引率

0.00%

发文量

审稿时长

4 weeks