{"title":"A study on the role of latent variables in the encoder-decoder model using image datasets","authors":"Saki Okamoto, Kenya Jin'no","doi":"10.1587/nolta.14.652","DOIUrl":null,"url":null,"abstract":"An encoder-decoder model consists of an encoder that encodes the input into a low-dimensional latent variable and a decoder that decodes the obtained latent variable to the same dimension as the input. The encoder-decoder model performs representation learning to automatically extract features of the data, but the model is a black box and it is not clear what features are extracted. We focused on whether including a skip connection between the encoder and decoder increased accuracy. It is generally believed that this skip connection plays a role in conveying high-resolution information. However, its actual role remains unclear. In this study, we focused on this concatenation. We experimentally clarified the role of the latent variables conveyed by this concatenation when the images given to the input and output were the same or different during training.","PeriodicalId":54110,"journal":{"name":"IEICE Nonlinear Theory and Its Applications","volume":"127 1","pages":"0"},"PeriodicalIF":0.5000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEICE Nonlinear Theory and Its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1587/nolta.14.652","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 1
Abstract
An encoder-decoder model consists of an encoder that encodes the input into a low-dimensional latent variable and a decoder that decodes the obtained latent variable to the same dimension as the input. The encoder-decoder model performs representation learning to automatically extract features of the data, but the model is a black box and it is not clear what features are extracted. We focused on whether including a skip connection between the encoder and decoder increased accuracy. It is generally believed that this skip connection plays a role in conveying high-resolution information. However, its actual role remains unclear. In this study, we focused on this concatenation. We experimentally clarified the role of the latent variables conveyed by this concatenation when the images given to the input and output were the same or different during training.