Optimization Matters: Guidelines to Improve Representation Learning with Deep Networks

Anais do XVIII Encontro Nacional de Inteligência Artificial e Computacional (ENIAC 2021) Pub Date : 2021-11-29 DOI:10.5753/eniac.2021.18287

Aline R. Becher, M. Ponti

引用次数: 0

Abstract

Training deep neural networks is a relevant problem with open questions related to convergence and quality of learned representations. Gradient-based optimization methods are used in practice, but cases of failure and success are still to be investigated. In this context, we set out to better understand the convergence properties of different optimization strategies, under different parameter options. Our results show that (i) feature embeddings are impacted by different optimization settings, (ii) suboptimal results are achieved by the use of default parameters, (iii) significant improvement is obtained by making educated choices of parameters, (iv) learning rate decay should always be considered. Such findings offer guidelines for training and deployment of deep networks.

查看原文本刊更多论文

优化事项:用深度网络改进表示学习的指南

训练深度神经网络是一个与学习表征的收敛性和质量相关的开放性问题。基于梯度的优化方法在实践中得到了应用，但失败和成功的案例仍有待研究。在这种情况下，我们开始更好地理解不同的优化策略的收敛性质，在不同的参数选项。我们的研究结果表明:(i)特征嵌入受到不同优化设置的影响，(ii)使用默认参数可获得次优结果，(iii)通过合理选择参数可获得显著改进，(iv)应始终考虑学习率衰减。这些发现为深度网络的训练和部署提供了指导。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Anais do XVIII Encontro Nacional de Inteligência Artificial e Computacional (ENIAC 2021)

自引率

0.00%

发文量