多层神经网络平均场极限的严格框架

Mathematical statistics and learning Pub Date : 2023-10-09 DOI:10.4171/msl/42

Phan-Minh Nguyen, Huy Tuan Pham

{"title":"多层神经网络平均场极限的严格框架","authors":"Phan-Minh Nguyen, Huy Tuan Pham","doi":"10.4171/msl/42","DOIUrl":null,"url":null,"abstract":"We develop a mathematically rigorous framework for multilayer neural networks in the mean field regime. As the network's widths increase, the network's learning trajectory is shown to be well captured by a meaningful and dynamically nonlinear limit (the \\textit{mean field} limit), which is characterized by a system of ODEs. Our framework applies to a broad range of network architectures, learning dynamics and network initializations. Central to the framework is the new idea of a \\textit{neuronal embedding}, which comprises of a non-evolving probability space that allows to embed neural networks of arbitrary widths. ","PeriodicalId":489458,"journal":{"name":"Mathematical statistics and learning","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"70","resultStr":"{\"title\":\"A rigorous framework for the mean field limit of multilayer neural networks\",\"authors\":\"Phan-Minh Nguyen, Huy Tuan Pham\",\"doi\":\"10.4171/msl/42\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We develop a mathematically rigorous framework for multilayer neural networks in the mean field regime. As the network's widths increase, the network's learning trajectory is shown to be well captured by a meaningful and dynamically nonlinear limit (the \\\\textit{mean field} limit), which is characterized by a system of ODEs. Our framework applies to a broad range of network architectures, learning dynamics and network initializations. Central to the framework is the new idea of a \\\\textit{neuronal embedding}, which comprises of a non-evolving probability space that allows to embed neural networks of arbitrary widths. \",\"PeriodicalId\":489458,\"journal\":{\"name\":\"Mathematical statistics and learning\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"70\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical statistics and learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4171/msl/42\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical statistics and learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4171/msl/42","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 70

摘要

我们为平均场状态下的多层神经网络开发了一个数学上严格的框架。随着网络宽度的增加，网络的学习轨迹被一个有意义的动态非线性极限(\textit{平均场}极限)很好地捕获，该极限以ode系统为特征。我们的框架适用于广泛的网络架构、学习动态和网络初始化。该框架的核心是\textit{神经元嵌入}的新思想，它由一个非进化的概率空间组成，允许嵌入任意宽度的神经网络。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A rigorous framework for the mean field limit of multilayer neural networks

We develop a mathematically rigorous framework for multilayer neural networks in the mean field regime. As the network's widths increase, the network's learning trajectory is shown to be well captured by a meaningful and dynamically nonlinear limit (the \textit{mean field} limit), which is characterized by a system of ODEs. Our framework applies to a broad range of network architectures, learning dynamics and network initializations. Central to the framework is the new idea of a \textit{neuronal embedding}, which comprises of a non-evolving probability space that allows to embed neural networks of arbitrary widths.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Mathematical statistics and learning

自引率

0.00%

发文量