Deep, super-narrow neural network is a universal classifier

The 2012 International Joint Conference on Neural Networks (IJCNN) Pub Date : 2012-06-10 DOI:10.1109/IJCNN.2012.6252513

Lech Szymanski, B. McCane

引用次数: 8

Abstract

Deep architecture models are known to be conducive to good generalisation for certain types of classification tasks. Existing unsupervised and semi-supervised training methods do not explain why and when deep internal representations will be effective. We investigate the fundamental principles of representation in deep architectures by devising a method for binary classification in multi-layer feed forward networks with limited breadth. We show that, given enough layers, a super-narrow neural network, with two neurons per layer, is capable of shattering any separable binary dataset. We also show that datasets that exhibit certain type of symmetries are better suited for deep representation and may require only few hidden layers to produce desired classification.

查看原文本刊更多论文

深度、超窄神经网络是一种通用分类器

众所周知，深度架构模型有助于对某些类型的分类任务进行良好的泛化。现有的无监督和半监督训练方法并不能解释深层内部表征为何以及何时有效。我们通过设计一种在有限宽度的多层前馈网络中进行二值分类的方法来研究深度体系结构中表示的基本原理。我们证明，给定足够的层，一个超窄的神经网络，每层有两个神经元，能够粉碎任何可分离的二进制数据集。我们还表明，表现出某种对称性的数据集更适合深度表示，并且可能只需要很少的隐藏层就可以产生所需的分类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The 2012 International Joint Conference on Neural Networks (IJCNN)

自引率

0.00%

发文量