Offline constrained reinforcement learning for batch-to-batch optimization of cobalt oxalate synthesis process

IF 3.7 3区 工程技术 Q2 ENGINEERING, CHEMICAL
{"title":"Offline constrained reinforcement learning for batch-to-batch optimization of cobalt oxalate synthesis process","authors":"","doi":"10.1016/j.cherd.2024.08.013","DOIUrl":null,"url":null,"abstract":"<div><p>The cobalt oxalate synthesis, a batch process, plays a crucial role in the refinement of cobalt metal. The mean particle size of cobalt oxalate is a critical indicator that reflects product quality. However, excessive ammonium oxalate solution flow can heighten waste disposal costs in the production process. To address these issues, we propose a novel offline reinforcement learning (RL) algorithm that guarantees compliance with constraints in the cobalt oxalate synthesis process, utilizing exclusively static datasets. This method employs cost critic networks to assess costs, transforming the constrained optimization problem into an unconstrained one by introducing Lagrangian multipliers. We use exponential moving average (EMA) to optimize the update of proportional integral derivative (PID) control multipliers, reduce overshoot and oscillation in the control process, and thus improve the overall stability of the system. Furthermore, to optimize algorithm performance, a deep residual network (DResNet) is integrated into the policy network. Experimental results indicate that the algorithm’s optimization policy performs significantly better under constraints.</p></div>","PeriodicalId":10019,"journal":{"name":"Chemical Engineering Research & Design","volume":null,"pages":null},"PeriodicalIF":3.7000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical Engineering Research & Design","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S026387622400491X","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0

Abstract

The cobalt oxalate synthesis, a batch process, plays a crucial role in the refinement of cobalt metal. The mean particle size of cobalt oxalate is a critical indicator that reflects product quality. However, excessive ammonium oxalate solution flow can heighten waste disposal costs in the production process. To address these issues, we propose a novel offline reinforcement learning (RL) algorithm that guarantees compliance with constraints in the cobalt oxalate synthesis process, utilizing exclusively static datasets. This method employs cost critic networks to assess costs, transforming the constrained optimization problem into an unconstrained one by introducing Lagrangian multipliers. We use exponential moving average (EMA) to optimize the update of proportional integral derivative (PID) control multipliers, reduce overshoot and oscillation in the control process, and thus improve the overall stability of the system. Furthermore, to optimize algorithm performance, a deep residual network (DResNet) is integrated into the policy network. Experimental results indicate that the algorithm’s optimization policy performs significantly better under constraints.

Abstract Image

用于草酸钴合成工艺批次间优化的离线约束强化学习
草酸钴合成是一种间歇式工艺,在金属钴的精炼过程中起着至关重要的作用。草酸钴的平均粒度是反映产品质量的关键指标。然而,草酸铵溶液流量过大会增加生产过程中的废物处理成本。为解决这些问题,我们提出了一种新颖的离线强化学习(RL)算法,该算法完全利用静态数据集,可确保草酸钴合成过程中符合约束条件。该方法采用成本批判网络来评估成本,通过引入拉格朗日乘数将约束优化问题转化为无约束优化问题。我们使用指数移动平均(EMA)来优化比例积分导数(PID)控制乘数的更新,减少控制过程中的过冲和振荡,从而提高系统的整体稳定性。此外,为了优化算法性能,还在策略网络中集成了深度残差网络(DResNet)。实验结果表明,该算法的优化策略在约束条件下表现明显更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Chemical Engineering Research & Design
Chemical Engineering Research & Design 工程技术-工程:化工
CiteScore
6.10
自引率
7.70%
发文量
623
审稿时长
42 days
期刊介绍: ChERD aims to be the principal international journal for publication of high quality, original papers in chemical engineering. Papers showing how research results can be used in chemical engineering design, and accounts of experimental or theoretical research work bringing new perspectives to established principles, highlighting unsolved problems or indicating directions for future research, are particularly welcome. Contributions that deal with new developments in plant or processes and that can be given quantitative expression are encouraged. The journal is especially interested in papers that extend the boundaries of traditional chemical engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信