{"title":"Learning MPC: System stability and convergent identification under bounded modelling error","authors":"Kacper Grzędziński, P. Trodden","doi":"10.1109/ANZCC.2018.8606603","DOIUrl":null,"url":null,"abstract":"The dual-control problem of simultaneously regulating a plant and identifying its dynamics is addressed in a linear model predictive control (MPC) framework. We propose and study an approach where two optimal control problems are solved in series, online: the first is a standard regulating linear quadratic MPC problem, which achieves exponential nominal stability; the second is an ancillary optimization problem that promotes persistency of excitation, in order to facilitate online learning of the system dynamics. Maximization of the minimum eigenvalue of predicted information matrix increments, in a recursive least squares (RLS) scheme, is used. Our main result, achieved by employing inherent robustness from exponentially stabilizing MPC, establishes a bound on the permitted magnitude of excitation applied by the secondary optimization; in terms of the system model error. Satisfaction of this bound guarantees stability of the closed-loop uncertain system, despite the exciting perturbations.","PeriodicalId":358801,"journal":{"name":"2018 Australian & New Zealand Control Conference (ANZCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Australian & New Zealand Control Conference (ANZCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ANZCC.2018.8606603","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The dual-control problem of simultaneously regulating a plant and identifying its dynamics is addressed in a linear model predictive control (MPC) framework. We propose and study an approach where two optimal control problems are solved in series, online: the first is a standard regulating linear quadratic MPC problem, which achieves exponential nominal stability; the second is an ancillary optimization problem that promotes persistency of excitation, in order to facilitate online learning of the system dynamics. Maximization of the minimum eigenvalue of predicted information matrix increments, in a recursive least squares (RLS) scheme, is used. Our main result, achieved by employing inherent robustness from exponentially stabilizing MPC, establishes a bound on the permitted magnitude of excitation applied by the secondary optimization; in terms of the system model error. Satisfaction of this bound guarantees stability of the closed-loop uncertain system, despite the exciting perturbations.