{"title":"Neural Operator based Reinforcement Learning for Control of first-order PDEs with Spatially-Varying State Delay⁎","authors":"Jiaqi Hu , Jie Qi , Jing Zhang","doi":"10.1016/j.ifacol.2025.08.085","DOIUrl":null,"url":null,"abstract":"<div><div>Control of distributed parameter systems affected by delays is a challenging task, particularly when the delays depend on spatial variables. The idea of integrating analytical control theory with learning-based control within a unified control scheme is becoming increasingly promising and advantageous. In this paper, we design a integrated control strategy combining PDE backstepping and deep reinforcement learning (RL) for an unstable first-order hyperbolic PDE with spatially-varying delays. This method eliminates extra constraint on the delay function required for the backstepping design. We embed a DeepONet, trained to learn the backstepping controller, into a soft actor-critic (SAC) framework as a feature extractor for both the actor and critic networks. Simulation results demonstrate that the proposed algorithm outperforms standard SAC in reducing steady-state error and surpasses the backstepping controller in mitigating overshoot.</div></div>","PeriodicalId":37894,"journal":{"name":"IFAC-PapersOnLine","volume":"59 8","pages":"Pages 167-172"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IFAC-PapersOnLine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S240589632500669X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0
Abstract
Control of distributed parameter systems affected by delays is a challenging task, particularly when the delays depend on spatial variables. The idea of integrating analytical control theory with learning-based control within a unified control scheme is becoming increasingly promising and advantageous. In this paper, we design a integrated control strategy combining PDE backstepping and deep reinforcement learning (RL) for an unstable first-order hyperbolic PDE with spatially-varying delays. This method eliminates extra constraint on the delay function required for the backstepping design. We embed a DeepONet, trained to learn the backstepping controller, into a soft actor-critic (SAC) framework as a feature extractor for both the actor and critic networks. Simulation results demonstrate that the proposed algorithm outperforms standard SAC in reducing steady-state error and surpasses the backstepping controller in mitigating overshoot.
期刊介绍:
All papers from IFAC meetings are published, in partnership with Elsevier, the IFAC Publisher, in theIFAC-PapersOnLine proceedings series hosted at the ScienceDirect web service. This series includes papers previously published in the IFAC website.The main features of the IFAC-PapersOnLine series are: -Online archive including papers from IFAC Symposia, Congresses, Conferences, and most Workshops. -All papers accepted at the meeting are published in PDF format - searchable and citable. -All papers published on the web site can be cited using the IFAC PapersOnLine ISSN and the individual paper DOI (Digital Object Identifier). The site is Open Access in nature - no charge is made to individuals for reading or downloading. Copyright of all papers belongs to IFAC and must be referenced if derivative journal papers are produced from the conference papers. All papers published in IFAC-PapersOnLine have undergone a peer review selection process according to the IFAC rules.