Internal node bagging: a layer-wise ensemble training method

International Conference on Artificial Intelligence and Pattern Recognition Pub Date : 2019-08-16 DOI:10.1145/3357254.3357268

Jinhong Li, Shun Yi

引用次数: 0

Abstract

When training neural networks, regularization methods are needed to avoid model overfitting. Dropout is a widely used regularization method, but its working principle is inconclusive and it does not work well for small models. This paper introduced a novel view to understand how dropout works as a layer-wise ensemble training method, that each feature in hidden layers is learned by multiple nodes, and next layer integrates the outputs of these nodes. Basing on the novel understanding of dropout, we proposed a new neural network training algorithm named internal node bagging, which explicitly forces a group of nodes to learn the same feature during training phase and combines these nodes into one node during testing phase. This means that more parameters can be used during training phase to improve the fitting ability of models while keeping model remains small during testing phase. After experimenting on three datasets, it is found that this algorithm can significantly improve the test performance of small models.

查看原文本刊更多论文

内部节点套袋:一种分层集成训练方法

在训练神经网络时，需要使用正则化方法来避免模型过拟合。Dropout是一种应用广泛的正则化方法，但其工作原理尚无定论，且对小模型效果不佳。本文引入了一种新颖的观点来理解dropout作为一种分层集成训练方法是如何工作的，即隐藏层中的每个特征由多个节点学习，下一层集成这些节点的输出。基于对dropout的全新理解，我们提出了一种新的神经网络训练算法——内部节点bagging，该算法在训练阶段明确地强制一组节点学习相同的特征，并在测试阶段将这些节点合并为一个节点。这意味着在训练阶段可以使用更多的参数来提高模型的拟合能力，同时在测试阶段保持模型的小。经过在三个数据集上的实验，发现该算法可以显著提高小模型的测试性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference on Artificial Intelligence and Pattern Recognition

自引率

0.00%

发文量