Neural Operator based Reinforcement Learning for Control of first-order PDEs with Spatially-Varying State Delay⁎

Q3 Engineering

IFAC-PapersOnLine Pub Date : 2025-01-01 DOI:10.1016/j.ifacol.2025.08.085

Jiaqi Hu , Jie Qi , Jing Zhang

引用次数: 0

Abstract

Control of distributed parameter systems affected by delays is a challenging task, particularly when the delays depend on spatial variables. The idea of integrating analytical control theory with learning-based control within a unified control scheme is becoming increasingly promising and advantageous. In this paper, we design a integrated control strategy combining PDE backstepping and deep reinforcement learning (RL) for an unstable first-order hyperbolic PDE with spatially-varying delays. This method eliminates extra constraint on the delay function required for the backstepping design. We embed a DeepONet, trained to learn the backstepping controller, into a soft actor-critic (SAC) framework as a feature extractor for both the actor and critic networks. Simulation results demonstrate that the proposed algorithm outperforms standard SAC in reducing steady-state error and surpasses the backstepping controller in mitigating overshoot.

查看原文本刊更多论文

基于神经算子的一阶状态时滞微分方程的强化学习控制[j]

受延迟影响的分布参数系统的控制是一项具有挑战性的任务，特别是当延迟依赖于空间变量时。将分析控制理论与基于学习的控制结合在一个统一的控制方案中的想法正变得越来越有前途和优势。针对一类具有空间变化时滞的不稳定一阶双曲PDE，设计了一种结合PDE反演和深度强化学习（RL）的集成控制策略。该方法消除了反推设计对延迟函数的额外约束。我们将一个DeepONet嵌入到一个软演员-评论家（SAC）框架中，作为演员和评论家网络的特征提取器。仿真结果表明，该算法在减小稳态误差方面优于标准SAC，在减小超调方面优于后退控制器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IFAC-PapersOnLine Engineering-Control and Systems Engineering

CiteScore

1.70

自引率

0.00%

发文量

1122

期刊介绍： All papers from IFAC meetings are published, in partnership with Elsevier, the IFAC Publisher, in theIFAC-PapersOnLine proceedings series hosted at the ScienceDirect web service. This series includes papers previously published in the IFAC website.The main features of the IFAC-PapersOnLine series are: -Online archive including papers from IFAC Symposia, Congresses, Conferences, and most Workshops. -All papers accepted at the meeting are published in PDF format - searchable and citable. -All papers published on the web site can be cited using the IFAC PapersOnLine ISSN and the individual paper DOI (Digital Object Identifier). The site is Open Access in nature - no charge is made to individuals for reading or downloading. Copyright of all papers belongs to IFAC and must be referenced if derivative journal papers are produced from the conference papers. All papers published in IFAC-PapersOnLine have undergone a peer review selection process according to the IFAC rules.