{"title":"基于不确定性和干扰估计的增强学习鲁棒性改进","authors":"Jinsuk Choi, H. Park, Jongchan Baek, Soohee Han","doi":"10.23919/ICCAS55662.2022.10003692","DOIUrl":null,"url":null,"abstract":"This paper proposes a method to improve the robustness of RLs based on model-free uncertainty and disturbance estimator (RL-based UDE). In the real environment, instead of using optimal trajectory and control techniques to perform complex tasks, it learns through RL and supplements robustness by using uncertainty and disturbance estimator (UDE). From UDE, the robotics system can be improved the stability by appropriately canceling the uncertainty and disturbance without efforts to obtain model information; hence the UDE can compensate for the performance degradation of RL when system is non-stationary. In addition, the performance can be improved by reducing the sensor noise from low-pass filter of UDE. It is shown through an experiment that the proposed RL-based UDE provides robustness.","PeriodicalId":129856,"journal":{"name":"2022 22nd International Conference on Control, Automation and Systems (ICCAS)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improved Robustness of Reinforcement Learning Based on Uncertainty and Disturbance Estimator\",\"authors\":\"Jinsuk Choi, H. Park, Jongchan Baek, Soohee Han\",\"doi\":\"10.23919/ICCAS55662.2022.10003692\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a method to improve the robustness of RLs based on model-free uncertainty and disturbance estimator (RL-based UDE). In the real environment, instead of using optimal trajectory and control techniques to perform complex tasks, it learns through RL and supplements robustness by using uncertainty and disturbance estimator (UDE). From UDE, the robotics system can be improved the stability by appropriately canceling the uncertainty and disturbance without efforts to obtain model information; hence the UDE can compensate for the performance degradation of RL when system is non-stationary. In addition, the performance can be improved by reducing the sensor noise from low-pass filter of UDE. It is shown through an experiment that the proposed RL-based UDE provides robustness.\",\"PeriodicalId\":129856,\"journal\":{\"name\":\"2022 22nd International Conference on Control, Automation and Systems (ICCAS)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 22nd International Conference on Control, Automation and Systems (ICCAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/ICCAS55662.2022.10003692\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 22nd International Conference on Control, Automation and Systems (ICCAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ICCAS55662.2022.10003692","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved Robustness of Reinforcement Learning Based on Uncertainty and Disturbance Estimator
This paper proposes a method to improve the robustness of RLs based on model-free uncertainty and disturbance estimator (RL-based UDE). In the real environment, instead of using optimal trajectory and control techniques to perform complex tasks, it learns through RL and supplements robustness by using uncertainty and disturbance estimator (UDE). From UDE, the robotics system can be improved the stability by appropriately canceling the uncertainty and disturbance without efforts to obtain model information; hence the UDE can compensate for the performance degradation of RL when system is non-stationary. In addition, the performance can be improved by reducing the sensor noise from low-pass filter of UDE. It is shown through an experiment that the proposed RL-based UDE provides robustness.