{"title":"实现方面的q -学习控制器的一类动态过程","authors":"J. Musial, K. Stebel, Jacek Czeczot","doi":"10.1109/MMAR55195.2022.9874270","DOIUrl":null,"url":null,"abstract":"This paper presents a new approach to the general-purpose self-improving controller based on Q-learning control strategies. The previous approach was based on a three-dimensional Q-matrix, significantly slowing down the learning process and limiting the ability of practical implementation in industrial practice. The proposed new algorithm solves this problem by reducing the size of the matrix and the number of tuning parameters of the method without sacrificing its accuracy and learning capabilities. The algorithm is validated by simulation on the two second-order dynamics models and the results show significant improvement compared to the previous version of the developed method.","PeriodicalId":169528,"journal":{"name":"2022 26th International Conference on Methods and Models in Automation and Robotics (MMAR)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Implementation aspects of Q-learning controller for a class of dynamical processes\",\"authors\":\"J. Musial, K. Stebel, Jacek Czeczot\",\"doi\":\"10.1109/MMAR55195.2022.9874270\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a new approach to the general-purpose self-improving controller based on Q-learning control strategies. The previous approach was based on a three-dimensional Q-matrix, significantly slowing down the learning process and limiting the ability of practical implementation in industrial practice. The proposed new algorithm solves this problem by reducing the size of the matrix and the number of tuning parameters of the method without sacrificing its accuracy and learning capabilities. The algorithm is validated by simulation on the two second-order dynamics models and the results show significant improvement compared to the previous version of the developed method.\",\"PeriodicalId\":169528,\"journal\":{\"name\":\"2022 26th International Conference on Methods and Models in Automation and Robotics (MMAR)\",\"volume\":\"115 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 26th International Conference on Methods and Models in Automation and Robotics (MMAR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MMAR55195.2022.9874270\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 26th International Conference on Methods and Models in Automation and Robotics (MMAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMAR55195.2022.9874270","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Implementation aspects of Q-learning controller for a class of dynamical processes
This paper presents a new approach to the general-purpose self-improving controller based on Q-learning control strategies. The previous approach was based on a three-dimensional Q-matrix, significantly slowing down the learning process and limiting the ability of practical implementation in industrial practice. The proposed new algorithm solves this problem by reducing the size of the matrix and the number of tuning parameters of the method without sacrificing its accuracy and learning capabilities. The algorithm is validated by simulation on the two second-order dynamics models and the results show significant improvement compared to the previous version of the developed method.