Luy Tan Nguyen, N. T. Dang, Dang Quang Minh, Tran Hong Vinh
{"title":"Machine learning based-distributed optimal control algorithm for multiple nonlinear agents with input constraints","authors":"Luy Tan Nguyen, N. T. Dang, Dang Quang Minh, Tran Hong Vinh","doi":"10.1109/NICS.2018.8606873","DOIUrl":null,"url":null,"abstract":"This paper utilizes the machine learning theory to propose an algorithm for solving the distributed optimal control of multiple nonlinear agents with saturating actuators. Unlike the existing algorithm based on an critic/actor/disturber framework with three neural networks (NNs) to approximate Hamilton-Jacobi-Isaac solution for each nonlinear agent, the algorithm in the paper is proposed with only one NN. It is shown that when the algorithm is executed online, the NN weight approximation errors and states are uniformly ultimately bounded (UUB) as well as the NN weights and optimal control policies are guaranteed to be converged to the approximately optimal values concurrently, and nonquadratic cost functions with constrained-inputs are minimized. To show the effectiveness of the proposed algorithm, simulations for multiple controlled Van der Pol oscillators are carried out and compared.","PeriodicalId":137666,"journal":{"name":"2018 5th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 5th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS.2018.8606873","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This paper utilizes the machine learning theory to propose an algorithm for solving the distributed optimal control of multiple nonlinear agents with saturating actuators. Unlike the existing algorithm based on an critic/actor/disturber framework with three neural networks (NNs) to approximate Hamilton-Jacobi-Isaac solution for each nonlinear agent, the algorithm in the paper is proposed with only one NN. It is shown that when the algorithm is executed online, the NN weight approximation errors and states are uniformly ultimately bounded (UUB) as well as the NN weights and optimal control policies are guaranteed to be converged to the approximately optimal values concurrently, and nonquadratic cost functions with constrained-inputs are minimized. To show the effectiveness of the proposed algorithm, simulations for multiple controlled Van der Pol oscillators are carried out and compared.