{"title":"X33车辆模型的强化学习与非线性控制","authors":"B. Costa, Francisco L. Parente, J. M. Lemos","doi":"10.1109/ICCAD55197.2022.9853874","DOIUrl":null,"url":null,"abstract":"This paper explores the application of nonlinear control and reinforcement learning to control a model of X33 reentry vehicle. The control problem is formulated considering the gliding phase of the X33 spacecraft model. During this phase, no thrust is applied and wind disturbances may change the path of the spacecraft from the reference path. Several difficulties were present when using the reinforcement learning controller. The starting of the controller, the convergence of the controller gains and their relation to the excitation noise, and the available time to learn.","PeriodicalId":436377,"journal":{"name":"2022 International Conference on Control, Automation and Diagnosis (ICCAD)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reinforcement Learning and Nonlinear Control of a X33 Vehicle Model ⋆\",\"authors\":\"B. Costa, Francisco L. Parente, J. M. Lemos\",\"doi\":\"10.1109/ICCAD55197.2022.9853874\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper explores the application of nonlinear control and reinforcement learning to control a model of X33 reentry vehicle. The control problem is formulated considering the gliding phase of the X33 spacecraft model. During this phase, no thrust is applied and wind disturbances may change the path of the spacecraft from the reference path. Several difficulties were present when using the reinforcement learning controller. The starting of the controller, the convergence of the controller gains and their relation to the excitation noise, and the available time to learn.\",\"PeriodicalId\":436377,\"journal\":{\"name\":\"2022 International Conference on Control, Automation and Diagnosis (ICCAD)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Control, Automation and Diagnosis (ICCAD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCAD55197.2022.9853874\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Control, Automation and Diagnosis (ICCAD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAD55197.2022.9853874","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement Learning and Nonlinear Control of a X33 Vehicle Model ⋆
This paper explores the application of nonlinear control and reinforcement learning to control a model of X33 reentry vehicle. The control problem is formulated considering the gliding phase of the X33 spacecraft model. During this phase, no thrust is applied and wind disturbances may change the path of the spacecraft from the reference path. Several difficulties were present when using the reinforcement learning controller. The starting of the controller, the convergence of the controller gains and their relation to the excitation noise, and the available time to learn.