An Empirical Study on Unsupervised Pre-training Approaches in Regression Problems

2018 IEEE Symposium Series on Computational Intelligence (SSCI) Pub Date : 2018-11-01 DOI:10.1109/SSCI.2018.8628674

P. Saikia, R. Baruah

引用次数: 1

Abstract

Unsupervised pre-training allows for efficient training of deep architectures. It provides a good set of initialised weights to the deep architecture that can provide better generalisation of the data. In this paper, we aim to empirically analyse the effect of different unsupervised pre-training approaches for the task of regression on different datasets. We have considered two most common pre-training methods namely deep belief network and stacked autoencoder, and compared the results with the standard training algorithm without pretraining. The models with pretraining performed better than the model without pretraining in terms of error, convergence and the prediction of pattern. The results of the experiments also show the importance of hyperparameters tuning, specially learning rate, in providing a better prediction result. This study once again confirmed the effectiveness and potential of pretraining approach in nonlinear regression problem.

查看原文本刊更多论文

回归问题中无监督预训练方法的实证研究

无监督预训练允许深度架构的有效训练。它为深度体系结构提供了一组良好的初始化权重，可以提供更好的数据泛化。在本文中，我们旨在实证分析不同的无监督预训练方法对不同数据集的回归任务的影响。我们考虑了两种最常见的预训练方法，即深度信念网络和堆叠自编码器，并将结果与不进行预训练的标准训练算法进行了比较。经过预训练的模型在误差、收敛性和模式预测方面都优于未经过预训练的模型。实验结果也表明了超参数整定，特别是学习率对于提供更好的预测结果的重要性。本研究再次证实了预训练方法在非线性回归问题中的有效性和潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE Symposium Series on Computational Intelligence (SSCI)

自引率

0.00%

发文量