{"title":"利用强化学习方法对逆向流动过程进行自适应温度控制","authors":"A. Binid , I. Aksikas , M.A. Mabrok , N. Meskin","doi":"10.1016/j.jprocont.2024.103259","DOIUrl":null,"url":null,"abstract":"<div><p>This work focuses on the design of an optimal adaptive control system for temperature regulation in a catalytic flow reversal reactor (CFRR), utilizing a reinforcement learning (RL) approach. First, a policy iteration algorithm is introduced to learn the optimal solution of the associated linear-quadratic control problem online. It should be mentioned that this approach is not reliant on the internal dynamics of the CFRR system, which is a complex process and is most effectively modeled using Partial Differential Equations (PDEs). The convergence of the iteration algorithm is established, assuming the initial policy is stabilizing. Additionally, a second algorithm is presented to enhance the implementability of the reinforcement learning algorithm from a practical perspective. Numerical simulations are carried out to illustrate the efficacy of the proposed algorithm.</p></div>","PeriodicalId":50079,"journal":{"name":"Journal of Process Control","volume":"140 ","pages":"Article 103259"},"PeriodicalIF":3.3000,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0959152424000994/pdfft?md5=702c8908d85111642c81f7faccf8348f&pid=1-s2.0-S0959152424000994-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Adaptive temperature control of a reverse flow process by using reinforcement learning approach\",\"authors\":\"A. Binid , I. Aksikas , M.A. Mabrok , N. Meskin\",\"doi\":\"10.1016/j.jprocont.2024.103259\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This work focuses on the design of an optimal adaptive control system for temperature regulation in a catalytic flow reversal reactor (CFRR), utilizing a reinforcement learning (RL) approach. First, a policy iteration algorithm is introduced to learn the optimal solution of the associated linear-quadratic control problem online. It should be mentioned that this approach is not reliant on the internal dynamics of the CFRR system, which is a complex process and is most effectively modeled using Partial Differential Equations (PDEs). The convergence of the iteration algorithm is established, assuming the initial policy is stabilizing. Additionally, a second algorithm is presented to enhance the implementability of the reinforcement learning algorithm from a practical perspective. Numerical simulations are carried out to illustrate the efficacy of the proposed algorithm.</p></div>\",\"PeriodicalId\":50079,\"journal\":{\"name\":\"Journal of Process Control\",\"volume\":\"140 \",\"pages\":\"Article 103259\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0959152424000994/pdfft?md5=702c8908d85111642c81f7faccf8348f&pid=1-s2.0-S0959152424000994-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Process Control\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0959152424000994\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Process Control","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0959152424000994","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Adaptive temperature control of a reverse flow process by using reinforcement learning approach
This work focuses on the design of an optimal adaptive control system for temperature regulation in a catalytic flow reversal reactor (CFRR), utilizing a reinforcement learning (RL) approach. First, a policy iteration algorithm is introduced to learn the optimal solution of the associated linear-quadratic control problem online. It should be mentioned that this approach is not reliant on the internal dynamics of the CFRR system, which is a complex process and is most effectively modeled using Partial Differential Equations (PDEs). The convergence of the iteration algorithm is established, assuming the initial policy is stabilizing. Additionally, a second algorithm is presented to enhance the implementability of the reinforcement learning algorithm from a practical perspective. Numerical simulations are carried out to illustrate the efficacy of the proposed algorithm.
期刊介绍:
This international journal covers the application of control theory, operations research, computer science and engineering principles to the solution of process control problems. In addition to the traditional chemical processing and manufacturing applications, the scope of process control problems involves a wide range of applications that includes energy processes, nano-technology, systems biology, bio-medical engineering, pharmaceutical processing technology, energy storage and conversion, smart grid, and data analytics among others.
Papers on the theory in these areas will also be accepted provided the theoretical contribution is aimed at the application and the development of process control techniques.
Topics covered include:
• Control applications• Process monitoring• Plant-wide control• Process control systems• Control techniques and algorithms• Process modelling and simulation• Design methods
Advanced design methods exclude well established and widely studied traditional design techniques such as PID tuning and its many variants. Applications in fields such as control of automotive engines, machinery and robotics are not deemed suitable unless a clear motivation for the relevance to process control is provided.