Adaptive dynamic programming with stable value iteration algorithm for discrete-time nonlinear systems

The 2012 International Joint Conference on Neural Networks (IJCNN) Pub Date : 2012-06-10 DOI:10.1109/IJCNN.2012.6252512

Qinglai Wei, Derong Liu

引用次数: 12

Abstract

In this paper, a new stable value iteration adaptive dynamic programming (ADP) algorithm, named “θ-ADP” algorithm, is proposed for solving the optimal control problems of infinite horizon discrete-time nonlinear systems. By introducing a parameter θ in the iterative ADP algorithm, it is proved that any of iterative control obtained in the proposed algorithm can stabilize the nonlinear system which overcomes the disadvantage of traditional value iteration algorithms. Neural networks are used to approximate the performance index function and compute the optimal control policy, respectively, for facilitating the implementation of the iterative θ-ADP algorithm. Finally, a simulation example is given to illustrate the performance of the proposed method.

查看原文本刊更多论文

离散非线性系统的稳定值迭代自适应动态规划算法

针对无限水平离散非线性系统的最优控制问题，提出了一种新的稳定值迭代自适应动态规划(ADP)算法——θ-ADP算法。通过在迭代ADP算法中引入参数θ，证明了该算法所得到的任何迭代控制都能稳定非线性系统，克服了传统数值迭代算法的缺点。利用神经网络分别逼近性能指标函数和计算最优控制策略，便于迭代θ-ADP算法的实现。最后，通过仿真实例验证了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The 2012 International Joint Conference on Neural Networks (IJCNN)

自引率

0.00%

发文量