{"title":"Application Research of Model-Free Reinforcement Learning under the Condition of Conditional Transfer Function with Coupling Factors","authors":"Xiaoya Yang, Youtian Guo, Rui Wang, Xiaohui Hu","doi":"10.1145/3430199.3430210","DOIUrl":null,"url":null,"abstract":"Dynamic systems are ubiquitous in nature. The analysis of the stability and performance of dynamic systems has been a research hotspot in control science and operations research for a long time. In this paper, we construct and analyze an actual sequential decision-making problem of dynamic system. The Model-Free reinforcement learning algorithms are used to optimize this problem. The problem is analyzed in detail through adaptive control theory and information theory, also the extreme performance of the algorithm is pointed out. In this paper, we select three classic Model-Free reinforcement learning algorithms, DQN, DQN-PER, and PPO, to compare and analyze their performance on the timing series decision problem we construct.","PeriodicalId":371055,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition","volume":"215 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 3rd International Conference on Artificial Intelligence and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3430199.3430210","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Dynamic systems are ubiquitous in nature. The analysis of the stability and performance of dynamic systems has been a research hotspot in control science and operations research for a long time. In this paper, we construct and analyze an actual sequential decision-making problem of dynamic system. The Model-Free reinforcement learning algorithms are used to optimize this problem. The problem is analyzed in detail through adaptive control theory and information theory, also the extreme performance of the algorithm is pointed out. In this paper, we select three classic Model-Free reinforcement learning algorithms, DQN, DQN-PER, and PPO, to compare and analyze their performance on the timing series decision problem we construct.