A Model-Free Optimal Control Method With Fixed Terminal States and Delay

arXiv - EE - Systems and Control Pub Date : 2024-09-16 DOI:arxiv-2409.10722

Mi Zhou, Erik Verriest, Chaouki Abdallah

引用次数: 0

Abstract

Model-free algorithms are brought into the control system's research with the emergence of reinforcement learning algorithms. However, there are two practical challenges of reinforcement learning-based methods. First, learning by interacting with the environment is highly complex. Second, constraints on the states (boundary conditions) require additional care since the state trajectory is implicitly defined from the inputs and system dynamics. To address these problems, this paper proposes a new model-free algorithm based on basis functions, gradient estimation, and the Lagrange method. The favorable performance of the proposed algorithm is shown using several examples under state-dependent switches and time delays.

查看原文本刊更多论文

具有固定终端状态和延迟的无模型最优控制方法

随着强化学习算法的出现，无模型算法被带入控制系统研究领域。然而，基于强化学习的方法存在两个实际挑战。首先，通过与环境交互进行学习非常复杂。其次，由于状态轨迹是根据输入和系统动态隐式定义的，因此对状态的约束（边界条件）需要额外的关注。为了解决这些问题，本文提出了一种基于基函数、梯度估计和拉格朗日法的新型无模型算法。本文通过几个例子展示了所提算法的优越性能，这些例子都不依赖于开关和时间延迟。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - EE - Systems and Control

自引率

0.00%

发文量