{"title":"具有输入输出约束的非线性系统的实用强化学习控制设计","authors":"Hesam Hassanpour , Brandon Corbett , Prashant Mhaskar","doi":"10.1016/j.compchemeng.2025.109248","DOIUrl":null,"url":null,"abstract":"<div><div>In this work, a practically implementable reinforcement learning (RL)-based controller is designed to handle process input and output constraints. In a typical RL problem, an RL agent is employed to learn an optimal control policy through interactions with the environment. This is unimplementable in practical situations due to the excessive exploration needed by the RL-based controller and exacerbated by the possible violation of the input and output constraints. We previously proposed an implementable RL controller that can circumvent random exploration needs by leveraging existing model predictive control (MPC) to pre-train/warm start the RL agent. The pre-trained agent is subsequently employed in real-time to engage with the process to improve its performance by gaining more knowledge about the nonlinear behavior of the system. This work generalizes our previous method to handle constraints on the outputs and the rate of change of the inputs by modifying the reward function. The effectiveness of the proposed algorithm is illustrated through simulations conducted for control of a pH neutralization process. The findings indicate that the proposed RL method enhances closed-loop performance in comparison to the nominal MPC while satisfying all input and output constraints.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"201 ","pages":"Article 109248"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A practical reinforcement learning control design for nonlinear systems with input and output constraints\",\"authors\":\"Hesam Hassanpour , Brandon Corbett , Prashant Mhaskar\",\"doi\":\"10.1016/j.compchemeng.2025.109248\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In this work, a practically implementable reinforcement learning (RL)-based controller is designed to handle process input and output constraints. In a typical RL problem, an RL agent is employed to learn an optimal control policy through interactions with the environment. This is unimplementable in practical situations due to the excessive exploration needed by the RL-based controller and exacerbated by the possible violation of the input and output constraints. We previously proposed an implementable RL controller that can circumvent random exploration needs by leveraging existing model predictive control (MPC) to pre-train/warm start the RL agent. The pre-trained agent is subsequently employed in real-time to engage with the process to improve its performance by gaining more knowledge about the nonlinear behavior of the system. This work generalizes our previous method to handle constraints on the outputs and the rate of change of the inputs by modifying the reward function. The effectiveness of the proposed algorithm is illustrated through simulations conducted for control of a pH neutralization process. The findings indicate that the proposed RL method enhances closed-loop performance in comparison to the nominal MPC while satisfying all input and output constraints.</div></div>\",\"PeriodicalId\":286,\"journal\":{\"name\":\"Computers & Chemical Engineering\",\"volume\":\"201 \",\"pages\":\"Article 109248\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2025-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Chemical Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0098135425002522\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425002522","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
A practical reinforcement learning control design for nonlinear systems with input and output constraints
In this work, a practically implementable reinforcement learning (RL)-based controller is designed to handle process input and output constraints. In a typical RL problem, an RL agent is employed to learn an optimal control policy through interactions with the environment. This is unimplementable in practical situations due to the excessive exploration needed by the RL-based controller and exacerbated by the possible violation of the input and output constraints. We previously proposed an implementable RL controller that can circumvent random exploration needs by leveraging existing model predictive control (MPC) to pre-train/warm start the RL agent. The pre-trained agent is subsequently employed in real-time to engage with the process to improve its performance by gaining more knowledge about the nonlinear behavior of the system. This work generalizes our previous method to handle constraints on the outputs and the rate of change of the inputs by modifying the reward function. The effectiveness of the proposed algorithm is illustrated through simulations conducted for control of a pH neutralization process. The findings indicate that the proposed RL method enhances closed-loop performance in comparison to the nominal MPC while satisfying all input and output constraints.
期刊介绍:
Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.