{"title":"Continuous limits of residual neural networks in case of large input data","authors":"M. Herty, Anna Thünen, T. Trimborn, G. Visconti","doi":"10.2478/caim-2022-0008","DOIUrl":null,"url":null,"abstract":"Abstract Residual deep neural networks (ResNets) are mathematically described as interacting particle systems. In the case of infinitely many layers the ResNet leads to a system of coupled system of ordinary differential equations known as neural differential equations. For large scale input data we derive a mean–field limit and show well–posedness of the resulting description. Further, we analyze the existence of solutions to the training process by using both a controllability and an optimal control point of view. Numerical investigations based on the solution of a formal optimality system illustrate the theoretical findings.","PeriodicalId":37903,"journal":{"name":"Communications in Applied and Industrial Mathematics","volume":"13 1","pages":"96 - 120"},"PeriodicalIF":0.3000,"publicationDate":"2021-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Applied and Industrial Mathematics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/caim-2022-0008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICS","Score":null,"Total":0}
引用次数: 1
Abstract
Abstract Residual deep neural networks (ResNets) are mathematically described as interacting particle systems. In the case of infinitely many layers the ResNet leads to a system of coupled system of ordinary differential equations known as neural differential equations. For large scale input data we derive a mean–field limit and show well–posedness of the resulting description. Further, we analyze the existence of solutions to the training process by using both a controllability and an optimal control point of view. Numerical investigations based on the solution of a formal optimality system illustrate the theoretical findings.
期刊介绍:
Communications in Applied and Industrial Mathematics (CAIM) is one of the official journals of the Italian Society for Applied and Industrial Mathematics (SIMAI). Providing immediate open access to original, unpublished high quality contributions, CAIM is devoted to timely report on ongoing original research work, new interdisciplinary subjects, and new developments. The journal focuses on the applications of mathematics to the solution of problems in industry, technology, environment, cultural heritage, and natural sciences, with a special emphasis on new and interesting mathematical ideas relevant to these fields of application . Encouraging novel cross-disciplinary approaches to mathematical research, CAIM aims to provide an ideal platform for scientists who cooperate in different fields including pure and applied mathematics, computer science, engineering, physics, chemistry, biology, medicine and to link scientist with professionals active in industry, research centres, academia or in the public sector. Coverage includes research articles describing new analytical or numerical methods, descriptions of modelling approaches, simulations for more accurate predictions or experimental observations of complex phenomena, verification/validation of numerical and experimental methods; invited or submitted reviews and perspectives concerning mathematical techniques in relation to applications, and and fields in which new problems have arisen for which mathematical models and techniques are not yet available.