{"title":"构造和评估链式规则传播算法的一种简单方法","authors":"Russell L. Smith","doi":"10.1109/ANNES.1995.499434","DOIUrl":null,"url":null,"abstract":"This paper provides some insight into the gradient based training of adaptive dynamic systems such as recurrent neural networks or neural network based controllers. In the neural network literature, training algorithms for such systems are generally of two types: those which propagate derivative information forwards in time, and those which propagate it backwards. These two types of algorithm are derived and analyzed for a simple prototype system. It is shown that they are very closely related because they compute the same components of the gradient vector but in a different order. The well known computational properties of each algorithm are then explained using a simple matrix multiplication analogy. Extensions of the prototype to control systems are demonstrated.","PeriodicalId":123427,"journal":{"name":"Proceedings 1995 Second New Zealand International Two-Stream Conference on Artificial Neural Networks and Expert Systems","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A simple method for constructing and evaluating chain-rule propagation algorithms\",\"authors\":\"Russell L. Smith\",\"doi\":\"10.1109/ANNES.1995.499434\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper provides some insight into the gradient based training of adaptive dynamic systems such as recurrent neural networks or neural network based controllers. In the neural network literature, training algorithms for such systems are generally of two types: those which propagate derivative information forwards in time, and those which propagate it backwards. These two types of algorithm are derived and analyzed for a simple prototype system. It is shown that they are very closely related because they compute the same components of the gradient vector but in a different order. The well known computational properties of each algorithm are then explained using a simple matrix multiplication analogy. Extensions of the prototype to control systems are demonstrated.\",\"PeriodicalId\":123427,\"journal\":{\"name\":\"Proceedings 1995 Second New Zealand International Two-Stream Conference on Artificial Neural Networks and Expert Systems\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1995-11-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 1995 Second New Zealand International Two-Stream Conference on Artificial Neural Networks and Expert Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ANNES.1995.499434\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 1995 Second New Zealand International Two-Stream Conference on Artificial Neural Networks and Expert Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ANNES.1995.499434","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A simple method for constructing and evaluating chain-rule propagation algorithms
This paper provides some insight into the gradient based training of adaptive dynamic systems such as recurrent neural networks or neural network based controllers. In the neural network literature, training algorithms for such systems are generally of two types: those which propagate derivative information forwards in time, and those which propagate it backwards. These two types of algorithm are derived and analyzed for a simple prototype system. It is shown that they are very closely related because they compute the same components of the gradient vector but in a different order. The well known computational properties of each algorithm are then explained using a simple matrix multiplication analogy. Extensions of the prototype to control systems are demonstrated.