{"title":"Safe reinforcement learning for optimization of batch processes with uncertainties: A Bayesian predictive exploration approach","authors":"Jingsheng Qin , Lingjian Ye , Xiaofeng Yuan","doi":"10.1016/j.compchemeng.2025.109391","DOIUrl":null,"url":null,"abstract":"<div><div>Optimization of batch processes is a challenging task due to their complex non-linear dynamics and various uncertainties. Recently, Reinforcement learning (RL) has been recognized as a promising alternative to solving this challenging problem. In this paper, we present a new safe RL method which is referred to as the Bayesian Predictive Exploration Approach. Firstly, the Bayesian neural networks (BNN) are introduced with variational mixture posteriors to represent the value function distributions, such that uncertainties can be more efficiently characterized. For the sake of safe explorations, we evaluate the profits and safety-risks by exploring multiple future decisions. The decisions are optimized to maximize the expected profit while avoiding constraint violations in the face of stochastic uncertainties. Both the expectations and variances of rewards and safety-risks are taken into considerations within the learning process. Finally, the effectiveness of the proposed approach is illustrated on two batch process examples.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"204 ","pages":"Article 109391"},"PeriodicalIF":3.9000,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425003941","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Optimization of batch processes is a challenging task due to their complex non-linear dynamics and various uncertainties. Recently, Reinforcement learning (RL) has been recognized as a promising alternative to solving this challenging problem. In this paper, we present a new safe RL method which is referred to as the Bayesian Predictive Exploration Approach. Firstly, the Bayesian neural networks (BNN) are introduced with variational mixture posteriors to represent the value function distributions, such that uncertainties can be more efficiently characterized. For the sake of safe explorations, we evaluate the profits and safety-risks by exploring multiple future decisions. The decisions are optimized to maximize the expected profit while avoiding constraint violations in the face of stochastic uncertainties. Both the expectations and variances of rewards and safety-risks are taken into considerations within the learning process. Finally, the effectiveness of the proposed approach is illustrated on two batch process examples.
期刊介绍:
Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.