{"title":"基于q函数的未知扰动连续双输入系统自适应鲁棒控制","authors":"Yongfeng Lv, Zhengyu Cui, Minlin Wang","doi":"10.1109/DDCLS58216.2023.10166557","DOIUrl":null,"url":null,"abstract":"Considering overshoot and chatter of the multi-input system with unknown interference, this paper studies the adaptive robust optimal controls of continuous-time two-input systems with an approximate dynamic programming (ADP) based Q-function scheme. A complex Hamilton-Jacobi-Issacs (HJI) equation is obtained with the two-input system and the zero-game theory, where a value function is constructed. Solving the HJI equation is a challenging task. Thus, an ADP-based Q-function with a neural network is constructed to learn the saddle point of the HJI equation. Simultaneously, an integral reinforcement signal of the critic networks is introduced such that the system drift and input dynamics in the HJI equation are relaxed when studying the saddle-point intractable solution. Then, the adaptive robust optimal actor and worst disturbance are approximated with another three networks. Finally, an F-16 aircraft plant is used to verify the proposed ADP-based Q-function.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Adaptive robust control of the continuous-time two-input systems with unknown disturbance based on Q-function\",\"authors\":\"Yongfeng Lv, Zhengyu Cui, Minlin Wang\",\"doi\":\"10.1109/DDCLS58216.2023.10166557\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Considering overshoot and chatter of the multi-input system with unknown interference, this paper studies the adaptive robust optimal controls of continuous-time two-input systems with an approximate dynamic programming (ADP) based Q-function scheme. A complex Hamilton-Jacobi-Issacs (HJI) equation is obtained with the two-input system and the zero-game theory, where a value function is constructed. Solving the HJI equation is a challenging task. Thus, an ADP-based Q-function with a neural network is constructed to learn the saddle point of the HJI equation. Simultaneously, an integral reinforcement signal of the critic networks is introduced such that the system drift and input dynamics in the HJI equation are relaxed when studying the saddle-point intractable solution. Then, the adaptive robust optimal actor and worst disturbance are approximated with another three networks. Finally, an F-16 aircraft plant is used to verify the proposed ADP-based Q-function.\",\"PeriodicalId\":415532,\"journal\":{\"name\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DDCLS58216.2023.10166557\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DDCLS58216.2023.10166557","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Adaptive robust control of the continuous-time two-input systems with unknown disturbance based on Q-function
Considering overshoot and chatter of the multi-input system with unknown interference, this paper studies the adaptive robust optimal controls of continuous-time two-input systems with an approximate dynamic programming (ADP) based Q-function scheme. A complex Hamilton-Jacobi-Issacs (HJI) equation is obtained with the two-input system and the zero-game theory, where a value function is constructed. Solving the HJI equation is a challenging task. Thus, an ADP-based Q-function with a neural network is constructed to learn the saddle point of the HJI equation. Simultaneously, an integral reinforcement signal of the critic networks is introduced such that the system drift and input dynamics in the HJI equation are relaxed when studying the saddle-point intractable solution. Then, the adaptive robust optimal actor and worst disturbance are approximated with another three networks. Finally, an F-16 aircraft plant is used to verify the proposed ADP-based Q-function.