强化学习神经网络在跟踪系统控制器中的应用

Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499) Pub Date : 2000-09-27 DOI:10.1109/ROMAN.2000.892472

O. Grigore, O. Grigore

{"title":"强化学习神经网络在跟踪系统控制器中的应用","authors":"O. Grigore, O. Grigore","doi":"10.1109/ROMAN.2000.892472","DOIUrl":null,"url":null,"abstract":"This paper presents a method of designing a controller for nonlinear systems based on a recurrent neural network which is trained in real time using the reinforcement learning (RL) procedure. The advantage of this method is to overcome the difficulties implied by the direct solving method of the differential models which are necessary in a classical approach. Moreover, this new technique using a real-time training is better then the MLP network controller as well as the RBF network implementation which needs both of them in a preliminary training process, based on a set of input-output data that has to be a priory experimentally determined.","PeriodicalId":337709,"journal":{"name":"Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Reinforcement learning neural network used in a tracking system controller\",\"authors\":\"O. Grigore, O. Grigore\",\"doi\":\"10.1109/ROMAN.2000.892472\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a method of designing a controller for nonlinear systems based on a recurrent neural network which is trained in real time using the reinforcement learning (RL) procedure. The advantage of this method is to overcome the difficulties implied by the direct solving method of the differential models which are necessary in a classical approach. Moreover, this new technique using a real-time training is better then the MLP network controller as well as the RBF network implementation which needs both of them in a preliminary training process, based on a set of input-output data that has to be a priory experimentally determined.\",\"PeriodicalId\":337709,\"journal\":{\"name\":\"Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROMAN.2000.892472\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROMAN.2000.892472","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

提出了一种基于递归神经网络的非线性系统控制器设计方法，该神经网络采用强化学习(RL)过程进行实时训练。该方法的优点是克服了经典方法直接求解微分模型所带来的困难。此外，这种采用实时训练的新技术优于MLP网络控制器和RBF网络实现，后者需要在基于一组必须经过实验确定的输入输出数据的初步训练过程中同时需要两者。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Reinforcement learning neural network used in a tracking system controller

This paper presents a method of designing a controller for nonlinear systems based on a recurrent neural network which is trained in real time using the reinforcement learning (RL) procedure. The advantage of this method is to overcome the difficulties implied by the direct solving method of the differential models which are necessary in a classical approach. Moreover, this new technique using a real-time training is better then the MLP network controller as well as the RBF network implementation which needs both of them in a preliminary training process, based on a set of input-output data that has to be a priory experimentally determined.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings 9th IEEE International Workshop on Robot and Human Interactive Communication. IEEE RO-MAN 2000 (Cat. No.00TH8499)

自引率

0.00%

发文量