{"title":"强化学习神经网络在跟踪系统控制器中的应用","authors":"O. Grigore, O. Grigore","doi":"10.1109/ROMAN.2000.892472","DOIUrl":null,"url":null,"abstract":"This paper presents a method of designing a controller for nonlinear systems based on a recurrent neural network which is trained in real time using the reinforcement learning (RL) procedure. The advantage of this method is to overcome the difficulties implied by the direct solving method of the differential models which are necessary in a classical approach. Moreover, this new technique using a real-time training is better then the MLP network controller as well as the RBF network implementation which needs both of them in a preliminary training process, based on a set of input-output data that has to be a priory experimentally determined.","PeriodicalId":337709,"journal":{"name":"Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Reinforcement learning neural network used in a tracking system controller\",\"authors\":\"O. Grigore, O. Grigore\",\"doi\":\"10.1109/ROMAN.2000.892472\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a method of designing a controller for nonlinear systems based on a recurrent neural network which is trained in real time using the reinforcement learning (RL) procedure. The advantage of this method is to overcome the difficulties implied by the direct solving method of the differential models which are necessary in a classical approach. Moreover, this new technique using a real-time training is better then the MLP network controller as well as the RBF network implementation which needs both of them in a preliminary training process, based on a set of input-output data that has to be a priory experimentally determined.\",\"PeriodicalId\":337709,\"journal\":{\"name\":\"Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROMAN.2000.892472\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROMAN.2000.892472","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement learning neural network used in a tracking system controller
This paper presents a method of designing a controller for nonlinear systems based on a recurrent neural network which is trained in real time using the reinforcement learning (RL) procedure. The advantage of this method is to overcome the difficulties implied by the direct solving method of the differential models which are necessary in a classical approach. Moreover, this new technique using a real-time training is better then the MLP network controller as well as the RBF network implementation which needs both of them in a preliminary training process, based on a set of input-output data that has to be a priory experimentally determined.