Luy Tan Nguyen, N. T. Dang, Dang Quang Minh, Tran Hong Vinh
{"title":"基于机器学习的具有输入约束的多非线性智能体分布式最优控制算法","authors":"Luy Tan Nguyen, N. T. Dang, Dang Quang Minh, Tran Hong Vinh","doi":"10.1109/NICS.2018.8606873","DOIUrl":null,"url":null,"abstract":"This paper utilizes the machine learning theory to propose an algorithm for solving the distributed optimal control of multiple nonlinear agents with saturating actuators. Unlike the existing algorithm based on an critic/actor/disturber framework with three neural networks (NNs) to approximate Hamilton-Jacobi-Isaac solution for each nonlinear agent, the algorithm in the paper is proposed with only one NN. It is shown that when the algorithm is executed online, the NN weight approximation errors and states are uniformly ultimately bounded (UUB) as well as the NN weights and optimal control policies are guaranteed to be converged to the approximately optimal values concurrently, and nonquadratic cost functions with constrained-inputs are minimized. To show the effectiveness of the proposed algorithm, simulations for multiple controlled Van der Pol oscillators are carried out and compared.","PeriodicalId":137666,"journal":{"name":"2018 5th NAFOSTED Conference on Information and Computer Science (NICS)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Machine learning based-distributed optimal control algorithm for multiple nonlinear agents with input constraints\",\"authors\":\"Luy Tan Nguyen, N. T. Dang, Dang Quang Minh, Tran Hong Vinh\",\"doi\":\"10.1109/NICS.2018.8606873\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper utilizes the machine learning theory to propose an algorithm for solving the distributed optimal control of multiple nonlinear agents with saturating actuators. Unlike the existing algorithm based on an critic/actor/disturber framework with three neural networks (NNs) to approximate Hamilton-Jacobi-Isaac solution for each nonlinear agent, the algorithm in the paper is proposed with only one NN. It is shown that when the algorithm is executed online, the NN weight approximation errors and states are uniformly ultimately bounded (UUB) as well as the NN weights and optimal control policies are guaranteed to be converged to the approximately optimal values concurrently, and nonquadratic cost functions with constrained-inputs are minimized. To show the effectiveness of the proposed algorithm, simulations for multiple controlled Van der Pol oscillators are carried out and compared.\",\"PeriodicalId\":137666,\"journal\":{\"name\":\"2018 5th NAFOSTED Conference on Information and Computer Science (NICS)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 5th NAFOSTED Conference on Information and Computer Science (NICS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NICS.2018.8606873\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 5th NAFOSTED Conference on Information and Computer Science (NICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NICS.2018.8606873","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Machine learning based-distributed optimal control algorithm for multiple nonlinear agents with input constraints
This paper utilizes the machine learning theory to propose an algorithm for solving the distributed optimal control of multiple nonlinear agents with saturating actuators. Unlike the existing algorithm based on an critic/actor/disturber framework with three neural networks (NNs) to approximate Hamilton-Jacobi-Isaac solution for each nonlinear agent, the algorithm in the paper is proposed with only one NN. It is shown that when the algorithm is executed online, the NN weight approximation errors and states are uniformly ultimately bounded (UUB) as well as the NN weights and optimal control policies are guaranteed to be converged to the approximately optimal values concurrently, and nonquadratic cost functions with constrained-inputs are minimized. To show the effectiveness of the proposed algorithm, simulations for multiple controlled Van der Pol oscillators are carried out and compared.