Online Value Iteration for Discrete-Time Nonlinear Optimal Regulation with Stability Guarantee

2022 4th International Conference on Control and Robotics (ICCR) Pub Date : 2022-12-02 DOI:10.1109/ICCR55715.2022.10053685

Yuan Wang, Ding Wang, Junlong Wu, Mingming Zhao

引用次数: 0

Abstract

In this paper, the intelligent and online value iteration (VI) algorithms are developed to solve the optimal control problem for nonlinear discrete-time systems. First, the intelligent VI algorithm combines the advantages of traditional VI initialized by the zero cost function and stabilizing VI initialized by the admissible control policy. The traditional VI is easy to implement and can provide the initial admissible control policy for the stabilizing VI. Meanwhile, stabilizing VI can guarantee all control policies are admissible. Second, based on the concept of the attraction domain, an online value iteration algorithm is proposed to regulate the closed-loop system by using immature control policies rather than the fixed optimal control policy. It ensures that the state trajectory converges to the origin of the attraction domain. Finally, simulations are carried out and the results show the effectiveness of the two new VI algorithms.

查看原文本刊更多论文

具有稳定性保证的离散非线性最优调节的在线值迭代

本文提出了求解非线性离散系统最优控制问题的智能在线值迭代算法。首先，该智能VI算法结合了传统的零代价函数初始化VI和允许控制策略初始化稳定VI的优点。传统VI易于实现，可以为稳定VI提供初始允许的控制策略，同时稳定VI可以保证所有控制策略都是允许的。其次，基于吸引域的概念，提出了一种在线值迭代算法，利用不成熟的控制策略而不是固定的最优控制策略来调节闭环系统。它保证了状态轨迹收敛于吸引域的原点。最后进行了仿真，结果表明了两种新的VI算法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 4th International Conference on Control and Robotics (ICCR)

自引率

0.00%

发文量