Neural Operator based Reinforcement Learning for Control of first-order PDEs with Spatially-Varying State Delay⁎

Q3 Engineering
Jiaqi Hu , Jie Qi , Jing Zhang
{"title":"Neural Operator based Reinforcement Learning for Control of first-order PDEs with Spatially-Varying State Delay⁎","authors":"Jiaqi Hu ,&nbsp;Jie Qi ,&nbsp;Jing Zhang","doi":"10.1016/j.ifacol.2025.08.085","DOIUrl":null,"url":null,"abstract":"<div><div>Control of distributed parameter systems affected by delays is a challenging task, particularly when the delays depend on spatial variables. The idea of integrating analytical control theory with learning-based control within a unified control scheme is becoming increasingly promising and advantageous. In this paper, we design a integrated control strategy combining PDE backstepping and deep reinforcement learning (RL) for an unstable first-order hyperbolic PDE with spatially-varying delays. This method eliminates extra constraint on the delay function required for the backstepping design. We embed a DeepONet, trained to learn the backstepping controller, into a soft actor-critic (SAC) framework as a feature extractor for both the actor and critic networks. Simulation results demonstrate that the proposed algorithm outperforms standard SAC in reducing steady-state error and surpasses the backstepping controller in mitigating overshoot.</div></div>","PeriodicalId":37894,"journal":{"name":"IFAC-PapersOnLine","volume":"59 8","pages":"Pages 167-172"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IFAC-PapersOnLine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S240589632500669X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0

Abstract

Control of distributed parameter systems affected by delays is a challenging task, particularly when the delays depend on spatial variables. The idea of integrating analytical control theory with learning-based control within a unified control scheme is becoming increasingly promising and advantageous. In this paper, we design a integrated control strategy combining PDE backstepping and deep reinforcement learning (RL) for an unstable first-order hyperbolic PDE with spatially-varying delays. This method eliminates extra constraint on the delay function required for the backstepping design. We embed a DeepONet, trained to learn the backstepping controller, into a soft actor-critic (SAC) framework as a feature extractor for both the actor and critic networks. Simulation results demonstrate that the proposed algorithm outperforms standard SAC in reducing steady-state error and surpasses the backstepping controller in mitigating overshoot.
基于神经算子的一阶状态时滞微分方程的强化学习控制[j]
受延迟影响的分布参数系统的控制是一项具有挑战性的任务,特别是当延迟依赖于空间变量时。将分析控制理论与基于学习的控制结合在一个统一的控制方案中的想法正变得越来越有前途和优势。针对一类具有空间变化时滞的不稳定一阶双曲PDE,设计了一种结合PDE反演和深度强化学习(RL)的集成控制策略。该方法消除了反推设计对延迟函数的额外约束。我们将一个DeepONet嵌入到一个软演员-评论家(SAC)框架中,作为演员和评论家网络的特征提取器。仿真结果表明,该算法在减小稳态误差方面优于标准SAC,在减小超调方面优于后退控制器。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IFAC-PapersOnLine
IFAC-PapersOnLine Engineering-Control and Systems Engineering
CiteScore
1.70
自引率
0.00%
发文量
1122
期刊介绍: All papers from IFAC meetings are published, in partnership with Elsevier, the IFAC Publisher, in theIFAC-PapersOnLine proceedings series hosted at the ScienceDirect web service. This series includes papers previously published in the IFAC website.The main features of the IFAC-PapersOnLine series are: -Online archive including papers from IFAC Symposia, Congresses, Conferences, and most Workshops. -All papers accepted at the meeting are published in PDF format - searchable and citable. -All papers published on the web site can be cited using the IFAC PapersOnLine ISSN and the individual paper DOI (Digital Object Identifier). The site is Open Access in nature - no charge is made to individuals for reading or downloading. Copyright of all papers belongs to IFAC and must be referenced if derivative journal papers are produced from the conference papers. All papers published in IFAC-PapersOnLine have undergone a peer review selection process according to the IFAC rules.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信