Thanh Tung Bui, Thanh Trung Cao, Trong Hieu Nguyen, D. Le, Huy Hoang Dao, P. Dao
{"title":"基于扰动观测器的桥式起重机系统强化学习","authors":"Thanh Tung Bui, Thanh Trung Cao, Trong Hieu Nguyen, D. Le, Huy Hoang Dao, P. Dao","doi":"10.1109/ICCAIS56082.2022.9990215","DOIUrl":null,"url":null,"abstract":"In this work, a disturbance-observer based reinforcement learning control scheme is presented for the overhead crane system. First, the approximate/adaptive dynamic programming (ADP) method is applied to obtain the solution of a discounted optimal control problem. Here, we use only one neural network as a critic network. The weights of this network are updated iteratively using a novel updating rule law. A disturbance-observer is then designed to compensate the effect of the unknown input disturbance, therefore improve the robustness of the system. The convergence of each module as well as the stability of the whole closed-loop system is guaranteed by proving rigorously. Finally, numerical simulations are given to illustrate the effectiveness of the proposed method.","PeriodicalId":273404,"journal":{"name":"2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Disturbance-Observer based Reinforcement Learning for Overhead Crane Systems\",\"authors\":\"Thanh Tung Bui, Thanh Trung Cao, Trong Hieu Nguyen, D. Le, Huy Hoang Dao, P. Dao\",\"doi\":\"10.1109/ICCAIS56082.2022.9990215\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, a disturbance-observer based reinforcement learning control scheme is presented for the overhead crane system. First, the approximate/adaptive dynamic programming (ADP) method is applied to obtain the solution of a discounted optimal control problem. Here, we use only one neural network as a critic network. The weights of this network are updated iteratively using a novel updating rule law. A disturbance-observer is then designed to compensate the effect of the unknown input disturbance, therefore improve the robustness of the system. The convergence of each module as well as the stability of the whole closed-loop system is guaranteed by proving rigorously. Finally, numerical simulations are given to illustrate the effectiveness of the proposed method.\",\"PeriodicalId\":273404,\"journal\":{\"name\":\"2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCAIS56082.2022.9990215\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 11th International Conference on Control, Automation and Information Sciences (ICCAIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAIS56082.2022.9990215","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Disturbance-Observer based Reinforcement Learning for Overhead Crane Systems
In this work, a disturbance-observer based reinforcement learning control scheme is presented for the overhead crane system. First, the approximate/adaptive dynamic programming (ADP) method is applied to obtain the solution of a discounted optimal control problem. Here, we use only one neural network as a critic network. The weights of this network are updated iteratively using a novel updating rule law. A disturbance-observer is then designed to compensate the effect of the unknown input disturbance, therefore improve the robustness of the system. The convergence of each module as well as the stability of the whole closed-loop system is guaranteed by proving rigorously. Finally, numerical simulations are given to illustrate the effectiveness of the proposed method.