Model-free adjustment of reducing agent for SCR device under label deficiency: Regulation-oriented stage-wise reward deep Q-learning with transfer-learned state

IF 6.9 2区 环境科学与生态学 Q1 ENGINEERING, CHEMICAL
Han Jiang, Shucai Zhang, Jingru Liu, Xin Peng, Weimin Zhong
{"title":"Model-free adjustment of reducing agent for SCR device under label deficiency: Regulation-oriented stage-wise reward deep Q-learning with transfer-learned state","authors":"Han Jiang, Shucai Zhang, Jingru Liu, Xin Peng, Weimin Zhong","doi":"10.1016/j.psep.2024.12.126","DOIUrl":null,"url":null,"abstract":"Data-driven methods of nitrogen oxides (<ce:italic>NO</ce:italic><ce:inf loc=\"post\"><ce:italic>X</ce:italic></ce:inf>) soft-sensing and selective catalytic reduction (SCR) operation for fluid catalytic cracking (FCC) process have two terms of issues. Firstly, labeled data might be deficient to train a prediction model due to the lack of monitoring devices. Secondly, the operational data can not be directly used to learn a reinforcement learning model. To address these issues, a latent temporal feature adaptation transfer learning and long-short reward deep q-learning network (LTFATL-LSRDQN) is proposed. It transfers the knowledge from another similar FCC process to realize the soft-sensing of <ce:italic>NO</ce:italic><ce:inf loc=\"post\"><ce:italic>X</ce:italic></ce:inf>. Maximum mean discrepancy loss is introduced to the objective function of autoencoder (AE) to unify the probability distribution of transformed latent features. The operation of the treatment device is abstracted to a Markov decision process. A long- and short-term reward mechanism is introduced to DQN to constrain the selection of action. The effectiveness of LTFATL-LSRDQN is verified with the data from industrial FCC processes. The introduction of domain adaptation successfully aligns the latent features, and achieves higher soft-sensing accuracy than some state-of-the-art methods. Using the results from LTFATL as inputs, LSRDQN accomplishes more enduringly continual operation of SCR device on the premise of the regulations and constrained actions.","PeriodicalId":20743,"journal":{"name":"Process Safety and Environmental Protection","volume":"29 1","pages":""},"PeriodicalIF":6.9000,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Process Safety and Environmental Protection","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.psep.2024.12.126","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Data-driven methods of nitrogen oxides (NOX) soft-sensing and selective catalytic reduction (SCR) operation for fluid catalytic cracking (FCC) process have two terms of issues. Firstly, labeled data might be deficient to train a prediction model due to the lack of monitoring devices. Secondly, the operational data can not be directly used to learn a reinforcement learning model. To address these issues, a latent temporal feature adaptation transfer learning and long-short reward deep q-learning network (LTFATL-LSRDQN) is proposed. It transfers the knowledge from another similar FCC process to realize the soft-sensing of NOX. Maximum mean discrepancy loss is introduced to the objective function of autoencoder (AE) to unify the probability distribution of transformed latent features. The operation of the treatment device is abstracted to a Markov decision process. A long- and short-term reward mechanism is introduced to DQN to constrain the selection of action. The effectiveness of LTFATL-LSRDQN is verified with the data from industrial FCC processes. The introduction of domain adaptation successfully aligns the latent features, and achieves higher soft-sensing accuracy than some state-of-the-art methods. Using the results from LTFATL as inputs, LSRDQN accomplishes more enduringly continual operation of SCR device on the premise of the regulations and constrained actions.
标签缺失下SCR装置还原剂的无模型调节:基于迁移学习状态的基于调节的分阶段奖励深度q学习
流体催化裂化(FCC)过程中氮氧化物(NOX)软测量和选择性催化还原(SCR)操作的数据驱动方法存在两个方面的问题。首先,由于缺乏监测设备,标记的数据可能不足以训练预测模型。其次,操作数据不能直接用于学习强化学习模型。为了解决这些问题,提出了一种潜在时间特征适应迁移学习和长-短奖励深度q-学习网络(LTFATL-LSRDQN)。它借鉴了其他类似FCC工艺的知识,实现了NOX的软测量。在自编码器(AE)的目标函数中引入最大平均差异损失,统一了变换后的潜在特征的概率分布。将处理装置的运行抽象为一个马尔可夫决策过程。在DQN中引入了长期和短期奖励机制来约束行为的选择。LTFATL-LSRDQN的有效性得到了工业催化裂化过程数据的验证。领域自适应的引入成功地对齐了潜在特征,并获得了比现有方法更高的软测量精度。LSRDQN利用LTFATL的结果作为输入,在规定和约束动作的前提下,实现可控硅器件更持久的连续运行。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Process Safety and Environmental Protection
Process Safety and Environmental Protection 环境科学-工程:化工
CiteScore
11.40
自引率
15.40%
发文量
929
审稿时长
8.0 months
期刊介绍: The Process Safety and Environmental Protection (PSEP) journal is a leading international publication that focuses on the publication of high-quality, original research papers in the field of engineering, specifically those related to the safety of industrial processes and environmental protection. The journal encourages submissions that present new developments in safety and environmental aspects, particularly those that show how research findings can be applied in process engineering design and practice. PSEP is particularly interested in research that brings fresh perspectives to established engineering principles, identifies unsolved problems, or suggests directions for future research. The journal also values contributions that push the boundaries of traditional engineering and welcomes multidisciplinary papers. PSEP's articles are abstracted and indexed by a range of databases and services, which helps to ensure that the journal's research is accessible and recognized in the academic and professional communities. These databases include ANTE, Chemical Abstracts, Chemical Hazards in Industry, Current Contents, Elsevier Engineering Information database, Pascal Francis, Web of Science, Scopus, Engineering Information Database EnCompass LIT (Elsevier), and INSPEC. This wide coverage facilitates the dissemination of the journal's content to a global audience interested in process safety and environmental engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信