{"title":"Simple Neural Networks as Wavefront Slope Predictors: Training and Performance Issues","authors":"P. Gallant, G. Aitken","doi":"10.1364/adop.1996.athc.8","DOIUrl":null,"url":null,"abstract":"Artificial neural networks have gained significant popularity over the past several years in a wide variety of engineering applications. This popularity is due in part to the ability of a neural network that is trained using a supervised training rule such as error backpropagation to acquire a nonparametric representation of the mapping between a set of inputs and outputs without any specific knowledge of the application domain. Given a sufficient number of nonlinear terms, represented by a number of hidden-layer neurons, a multilayer neural network can model any mathematical function that is continuous and differentiable (Hecht-Nielsen, 1990). Difficulties can arise however when a network is trained with a limited amount of noisy “real” data and is then expected to operate as part of a system for a specific application. The network must acquire an internal representation, as stored in its weights, during the training phase that subsequently generalizes well to unseen data. In the case of a prediction application, generalization capability becomes the paramount design criteria. The generalization performance of a trained network is a strong function of several factors, including: the architecture and complexity of the network, the type of supervised training rule employed, and the manner in which data is preprocessed and presented to the network.","PeriodicalId":256393,"journal":{"name":"Adaptive Optics","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Adaptive Optics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1364/adop.1996.athc.8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Artificial neural networks have gained significant popularity over the past several years in a wide variety of engineering applications. This popularity is due in part to the ability of a neural network that is trained using a supervised training rule such as error backpropagation to acquire a nonparametric representation of the mapping between a set of inputs and outputs without any specific knowledge of the application domain. Given a sufficient number of nonlinear terms, represented by a number of hidden-layer neurons, a multilayer neural network can model any mathematical function that is continuous and differentiable (Hecht-Nielsen, 1990). Difficulties can arise however when a network is trained with a limited amount of noisy “real” data and is then expected to operate as part of a system for a specific application. The network must acquire an internal representation, as stored in its weights, during the training phase that subsequently generalizes well to unseen data. In the case of a prediction application, generalization capability becomes the paramount design criteria. The generalization performance of a trained network is a strong function of several factors, including: the architecture and complexity of the network, the type of supervised training rule employed, and the manner in which data is preprocessed and presented to the network.