A study on the role of latent variables in the encoder-decoder model using image datasets

IF 0.6 Q4 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

IEICE Nonlinear Theory and Its Applications Pub Date : 2023-01-01 DOI:10.1587/nolta.14.652

Saki Okamoto, Kenya Jin'no

引用次数: 1

Abstract

An encoder-decoder model consists of an encoder that encodes the input into a low-dimensional latent variable and a decoder that decodes the obtained latent variable to the same dimension as the input. The encoder-decoder model performs representation learning to automatically extract features of the data, but the model is a black box and it is not clear what features are extracted. We focused on whether including a skip connection between the encoder and decoder increased accuracy. It is generally believed that this skip connection plays a role in conveying high-resolution information. However, its actual role remains unclear. In this study, we focused on this concatenation. We experimentally clarified the role of the latent variables conveyed by this concatenation when the images given to the input and output were the same or different during training.

查看原文本刊更多论文

利用图像数据集研究隐变量在编码器-解码器模型中的作用

编码器-解码器模型包括将输入编码为低维潜在变量的编码器和将所获得的潜在变量解码为与输入相同维数的解码器。编码器-解码器模型通过表示学习来自动提取数据的特征，但该模型是一个黑盒子，不清楚提取了哪些特征。我们关注的是，在编码器和解码器之间加入跳跃式连接是否能提高精度。一般认为，这种跳跃连接在传递高分辨率信息方面起着作用。然而，它的实际作用仍不清楚。在这项研究中，我们关注的是这种连接。在训练过程中，当输入和输出的图像相同或不同时，我们通过实验澄清了这种连接所传达的潜在变量的作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEICE Nonlinear Theory and Its Applications MATHEMATICS, INTERDISCIPLINARY APPLICATIONS-

自引率

20.00%

发文量