{"title":"Application of reinforcement learning to improve control performance of plant","authors":"M. Shadi, Mahdi Sargolzaei","doi":"10.1109/CIMSA.2008.4595837","DOIUrl":null,"url":null,"abstract":"This paper is concerned with the development of an online Reinforcement Learning (RL) technique that significantly improves the control systems behavior. The reinforcement learner is based on Q-learning and the final controller is an artificial neural network whose weights are tuned by on line learning. In order to speed up the learning processes and prevent the plant from the instability, initially a PID is utilized as an augmented controller until the reinforcement learning becomes capable of keep the system stable and prevent the system from undesirable behavior. Example of use is presented and the effectiveness of the proposed approach is shown.","PeriodicalId":302812,"journal":{"name":"2008 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Conference on Computational Intelligence for Measurement Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIMSA.2008.4595837","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper is concerned with the development of an online Reinforcement Learning (RL) technique that significantly improves the control systems behavior. The reinforcement learner is based on Q-learning and the final controller is an artificial neural network whose weights are tuned by on line learning. In order to speed up the learning processes and prevent the plant from the instability, initially a PID is utilized as an augmented controller until the reinforcement learning becomes capable of keep the system stable and prevent the system from undesirable behavior. Example of use is presented and the effectiveness of the proposed approach is shown.