{"title":"基于离线强化学习的乙烯裂解炉进料策略","authors":"Haojun Zhong, Zhenlei Wang, Yuzhe Hao","doi":"10.1016/j.compchemeng.2024.108864","DOIUrl":null,"url":null,"abstract":"<div><p>The feeding process of the ethylene cracking furnace necessitates the synchronized adjustment of multiple controlled factors. The process mainly relies on operators to do it manually, which is burdensome and may lead to significant variations in coil out temperature (COT) due to the differing expertise of operators. This paper proposes a method for learning the feeding strategy of the ethylene cracking furnace using offline reinforcement learning. The agent learns and optimizes the operating strategy directly from datasets, eliminating the need for sophisticated process simulator modeling. In addition, the advantage function is incorporated into the Twin Delayed Deep Deterministic Behavioral Cloning (TD3BC) algorithm, which enables the agent to acquire more effective operational experience. The proposed method is initially evaluated using benchmark datasets. Further, the proposed method is validated through comparative experiments on a feeding process validation model, demonstrating superior rewards and outperforming manual operating experience as well as other offline reinforcement learning methods.</p></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"192 ","pages":"Article 108864"},"PeriodicalIF":3.9000,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0098135424002825/pdfft?md5=85eb0196a921c27b8bd237adcbca4c1d&pid=1-s2.0-S0098135424002825-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Offline reinforcement learning based feeding strategy of ethylene cracking furnace\",\"authors\":\"Haojun Zhong, Zhenlei Wang, Yuzhe Hao\",\"doi\":\"10.1016/j.compchemeng.2024.108864\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The feeding process of the ethylene cracking furnace necessitates the synchronized adjustment of multiple controlled factors. The process mainly relies on operators to do it manually, which is burdensome and may lead to significant variations in coil out temperature (COT) due to the differing expertise of operators. This paper proposes a method for learning the feeding strategy of the ethylene cracking furnace using offline reinforcement learning. The agent learns and optimizes the operating strategy directly from datasets, eliminating the need for sophisticated process simulator modeling. In addition, the advantage function is incorporated into the Twin Delayed Deep Deterministic Behavioral Cloning (TD3BC) algorithm, which enables the agent to acquire more effective operational experience. The proposed method is initially evaluated using benchmark datasets. Further, the proposed method is validated through comparative experiments on a feeding process validation model, demonstrating superior rewards and outperforming manual operating experience as well as other offline reinforcement learning methods.</p></div>\",\"PeriodicalId\":286,\"journal\":{\"name\":\"Computers & Chemical Engineering\",\"volume\":\"192 \",\"pages\":\"Article 108864\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S0098135424002825/pdfft?md5=85eb0196a921c27b8bd237adcbca4c1d&pid=1-s2.0-S0098135424002825-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Chemical Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0098135424002825\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135424002825","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Offline reinforcement learning based feeding strategy of ethylene cracking furnace
The feeding process of the ethylene cracking furnace necessitates the synchronized adjustment of multiple controlled factors. The process mainly relies on operators to do it manually, which is burdensome and may lead to significant variations in coil out temperature (COT) due to the differing expertise of operators. This paper proposes a method for learning the feeding strategy of the ethylene cracking furnace using offline reinforcement learning. The agent learns and optimizes the operating strategy directly from datasets, eliminating the need for sophisticated process simulator modeling. In addition, the advantage function is incorporated into the Twin Delayed Deep Deterministic Behavioral Cloning (TD3BC) algorithm, which enables the agent to acquire more effective operational experience. The proposed method is initially evaluated using benchmark datasets. Further, the proposed method is validated through comparative experiments on a feeding process validation model, demonstrating superior rewards and outperforming manual operating experience as well as other offline reinforcement learning methods.
期刊介绍:
Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.