{"title":"Reinforcement learning neural network used in a tracking system controller","authors":"O. Grigore, O. Grigore","doi":"10.1109/ROMAN.2000.892472","DOIUrl":null,"url":null,"abstract":"This paper presents a method of designing a controller for nonlinear systems based on a recurrent neural network which is trained in real time using the reinforcement learning (RL) procedure. The advantage of this method is to overcome the difficulties implied by the direct solving method of the differential models which are necessary in a classical approach. Moreover, this new technique using a real-time training is better then the MLP network controller as well as the RBF network implementation which needs both of them in a preliminary training process, based on a set of input-output data that has to be a priory experimentally determined.","PeriodicalId":337709,"journal":{"name":"Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROMAN.2000.892472","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper presents a method of designing a controller for nonlinear systems based on a recurrent neural network which is trained in real time using the reinforcement learning (RL) procedure. The advantage of this method is to overcome the difficulties implied by the direct solving method of the differential models which are necessary in a classical approach. Moreover, this new technique using a real-time training is better then the MLP network controller as well as the RBF network implementation which needs both of them in a preliminary training process, based on a set of input-output data that has to be a priory experimentally determined.