{"title":"基于近似动态规划的自适应最优观测器设计","authors":"J. Na, G. Herrmann, K. Vamvoudakis","doi":"10.23919/ACC.2017.7963454","DOIUrl":null,"url":null,"abstract":"This paper presents an optimal observer design framework using a recently emerging method, approximate dynamic programming (ADP), to minimize a predefined cost function. We first exploit the duality between the linear optimal observer and the linear quadratic tracking (LQT) control. We show that the optimal observer design can be formulated as an optimal control problem subject to a specific cost function, and thus the solution can be obtained by solving an algebraic Riccati equation (ARE). For nonlinear systems, we further introduce an optimal observer design formulation and suggest a modified policy iteration method. Finally, to solve the problem online we propose a framework based on ADP and specifically on an approximator structure. Namely, a critic approximator is used to estimate the optimal value function, and a newly developed tuning law is proposed to find the parameters online. The stability and the performance are guaranteed with rigorous proofs. Numerical simulations are given to validate the theoretical studies.","PeriodicalId":422926,"journal":{"name":"2017 American Control Conference (ACC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Adaptive optimal observer design via approximate dynamic programming\",\"authors\":\"J. Na, G. Herrmann, K. Vamvoudakis\",\"doi\":\"10.23919/ACC.2017.7963454\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an optimal observer design framework using a recently emerging method, approximate dynamic programming (ADP), to minimize a predefined cost function. We first exploit the duality between the linear optimal observer and the linear quadratic tracking (LQT) control. We show that the optimal observer design can be formulated as an optimal control problem subject to a specific cost function, and thus the solution can be obtained by solving an algebraic Riccati equation (ARE). For nonlinear systems, we further introduce an optimal observer design formulation and suggest a modified policy iteration method. Finally, to solve the problem online we propose a framework based on ADP and specifically on an approximator structure. Namely, a critic approximator is used to estimate the optimal value function, and a newly developed tuning law is proposed to find the parameters online. The stability and the performance are guaranteed with rigorous proofs. Numerical simulations are given to validate the theoretical studies.\",\"PeriodicalId\":422926,\"journal\":{\"name\":\"2017 American Control Conference (ACC)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 American Control Conference (ACC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/ACC.2017.7963454\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 American Control Conference (ACC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/ACC.2017.7963454","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Adaptive optimal observer design via approximate dynamic programming
This paper presents an optimal observer design framework using a recently emerging method, approximate dynamic programming (ADP), to minimize a predefined cost function. We first exploit the duality between the linear optimal observer and the linear quadratic tracking (LQT) control. We show that the optimal observer design can be formulated as an optimal control problem subject to a specific cost function, and thus the solution can be obtained by solving an algebraic Riccati equation (ARE). For nonlinear systems, we further introduce an optimal observer design formulation and suggest a modified policy iteration method. Finally, to solve the problem online we propose a framework based on ADP and specifically on an approximator structure. Namely, a critic approximator is used to estimate the optimal value function, and a newly developed tuning law is proposed to find the parameters online. The stability and the performance are guaranteed with rigorous proofs. Numerical simulations are given to validate the theoretical studies.