Learning Deep Architectures for AI

Found. Trends Mach. Learn. Pub Date : 1900-01-01 DOI:10.1561/2200000006

Yoshua Bengio

引用次数: 8239

Abstract

Theoretical results strongly suggest that in order to learn the kind of complicated functions that can represent high-level abstractions (e.g. in vision, language, and other AI-level tasks), one needs deep architectures. Deep architectures are composed of multiple levels of non-linear operations, such as in neural nets with many hidden layers or in complicated propositional formulae re-using many sub-formulae. Searching the parameter space of deep architectures is a difficult optimization task, but learning algorithms such as those for Deep Belief Networks have recently been proposed to tackle this problem with notable success, beating the state-of-the-art in certain areas. This paper discusses the motivations and principles regarding learning algorithms for deep architectures, in particular those exploiting as building blocks unsupervised learning of single-layer models such as Restricted Boltzmann Machines, used to construct deeper models such as Deep Belief Networks.

查看原文本刊更多论文

学习人工智能的深度架构

理论结果强烈表明，为了学习能够表示高级抽象的复杂功能(例如，在视觉，语言和其他ai级别的任务中)，人们需要深度架构。深度架构是由多层非线性操作组成的，例如在具有许多隐藏层的神经网络中，或者在复杂的命题公式中重用许多子公式。搜索深度架构的参数空间是一项困难的优化任务，但最近提出的学习算法(如深度信念网络的学习算法)在解决这一问题方面取得了显著的成功，在某些领域超过了最先进的技术。本文讨论了关于深度架构学习算法的动机和原则，特别是那些利用单层模型(如受限玻尔兹曼机)的无监督学习作为构建块的算法，用于构建更深的模型(如深度信念网络)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Found. Trends Mach. Learn.

自引率

0.00%

发文量