{"title":"Online adaptive surrogates for the value function of high-dimensional nonlinear optimal control problems⁎","authors":"Tobias Ehring , Bernard Haasdonk","doi":"10.1016/j.ifacol.2025.03.057","DOIUrl":null,"url":null,"abstract":"<div><div>We introduce a strategy that generates an adaptive surrogate of the value function of high-dimensional nonlinear optimal control problems. It exploits the relevant operating domain online on which the resulting surrogate satisfies the Hamilton–Jacobi–Bellman (HJB) equation up to a given threshold. The approximate value function is based on Hermite kernel regression, where the data stems from open-loop control of reduced-order optimal control problems. As a measure of accuracy, the full-order HJB residual, known as the Bellman error, is used to determine whether the current Hermite kernel surrogate is sufficient or further training is required. In addition, the reduced-order model can also be improved using the full-order data if the same HJB-based error indicator suggests that the current reduced system is not accurate enough. Numerical experiments support the effectiveness of the new scheme.</div></div>","PeriodicalId":37894,"journal":{"name":"IFAC-PapersOnLine","volume":"59 1","pages":"Pages 331-336"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IFAC-PapersOnLine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405896325002745","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 0
Abstract
We introduce a strategy that generates an adaptive surrogate of the value function of high-dimensional nonlinear optimal control problems. It exploits the relevant operating domain online on which the resulting surrogate satisfies the Hamilton–Jacobi–Bellman (HJB) equation up to a given threshold. The approximate value function is based on Hermite kernel regression, where the data stems from open-loop control of reduced-order optimal control problems. As a measure of accuracy, the full-order HJB residual, known as the Bellman error, is used to determine whether the current Hermite kernel surrogate is sufficient or further training is required. In addition, the reduced-order model can also be improved using the full-order data if the same HJB-based error indicator suggests that the current reduced system is not accurate enough. Numerical experiments support the effectiveness of the new scheme.
期刊介绍:
All papers from IFAC meetings are published, in partnership with Elsevier, the IFAC Publisher, in theIFAC-PapersOnLine proceedings series hosted at the ScienceDirect web service. This series includes papers previously published in the IFAC website.The main features of the IFAC-PapersOnLine series are: -Online archive including papers from IFAC Symposia, Congresses, Conferences, and most Workshops. -All papers accepted at the meeting are published in PDF format - searchable and citable. -All papers published on the web site can be cited using the IFAC PapersOnLine ISSN and the individual paper DOI (Digital Object Identifier). The site is Open Access in nature - no charge is made to individuals for reading or downloading. Copyright of all papers belongs to IFAC and must be referenced if derivative journal papers are produced from the conference papers. All papers published in IFAC-PapersOnLine have undergone a peer review selection process according to the IFAC rules.