Quantitative Gaussian approximation of randomly initialized deep neural networks

IF 2.9 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Pub Date : 2024-06-25 DOI:10.1007/s10994-024-06578-z

Andrea Basteri, Dario Trevisan

引用次数: 0

Abstract

Given any deep fully connected neural network, initialized with random Gaussian parameters, we bound from above the quadratic Wasserstein distance between its output distribution and a suitable Gaussian process. Our explicit inequalities indicate how the hidden and output layers sizes affect the Gaussian behaviour of the network and quantitatively recover the distributional convergence results in the wide limit, i.e., if all the hidden layers sizes become large.

Abstract Image

查看原文本刊更多论文

随机初始化深度神经网络的定量高斯逼近

给定任何以随机高斯参数初始化的深度全连接神经网络，我们从上面约束了其输出分布与合适的高斯过程之间的二次瓦瑟斯坦距离。我们的显式不等式指出了隐藏层和输出层的大小如何影响网络的高斯行为，并定量地恢复了广义极限的分布收敛结果，即如果所有隐藏层的大小都变得很大。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Machine Learning 工程技术-计算机：人工智能

CiteScore

11.00

自引率

2.70%

发文量

162

审稿时长

3 months

期刊介绍： Machine Learning serves as a global platform dedicated to computational approaches in learning. The journal reports substantial findings on diverse learning methods applied to various problems, offering support through empirical studies, theoretical analysis, or connections to psychological phenomena. It demonstrates the application of learning methods to solve significant problems and aims to enhance the conduct of machine learning research with a focus on verifiable and replicable evidence in published papers.