Adaptive dynamic programming-based optimal pursuit–evasion control for quadrotor unmanned aerial vehicles with obstacle avoidance

IF 6.5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neurocomputing Pub Date : 2025-05-24 DOI:10.1016/j.neucom.2025.130483

Bo Li , Ziqi Yang , Hui Liu , Bing Xiao

{"title":"Adaptive dynamic programming-based optimal pursuit–evasion control for quadrotor unmanned aerial vehicles with obstacle avoidance","authors":"Bo Li , Ziqi Yang , Hui Liu , Bing Xiao","doi":"10.1016/j.neucom.2025.130483","DOIUrl":null,"url":null,"abstract":"<div><div>This article investigates the challenging problem of optimized intelligent pursuit–evasion control for quadrotor unmanned aerial vehicles with obstacle avoidance. Firstly, a novel penalty function is developed to achieve obstacle avoidance in the designed cost function. Subsequently, a critic-only structure is employed to substitute the conventional actor-critic structure for addressing the approximate solutions of Hamilton–Jacobi–Isaacs equations. Meanwhile, two new critic neural networks are constructed to learn the optimal cost functions and control policies for the pursuit–evasion control systems. More specifically, the developed weight update laws not only enable the critic neural network weights to be updated online, but also alleviate the persistent excitation condition with a simple structure. In particular, the obstacle avoidance function is incorporated into the construction of the approximate cost function learned by the proposed critic neural networks. In addition, the proposed pursuit–evasion control systems can be guaranteed to achieve uniformly ultimately bounded stability by utilizing Lyapunov methodology. Finally, the effectiveness of the developed pursuit–evasion control scheme is fully illustrated through two simulation cases.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"645 ","pages":"Article 130483"},"PeriodicalIF":6.5000,"publicationDate":"2025-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225011555","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

This article investigates the challenging problem of optimized intelligent pursuit–evasion control for quadrotor unmanned aerial vehicles with obstacle avoidance. Firstly, a novel penalty function is developed to achieve obstacle avoidance in the designed cost function. Subsequently, a critic-only structure is employed to substitute the conventional actor-critic structure for addressing the approximate solutions of Hamilton–Jacobi–Isaacs equations. Meanwhile, two new critic neural networks are constructed to learn the optimal cost functions and control policies for the pursuit–evasion control systems. More specifically, the developed weight update laws not only enable the critic neural network weights to be updated online, but also alleviate the persistent excitation condition with a simple structure. In particular, the obstacle avoidance function is incorporated into the construction of the approximate cost function learned by the proposed critic neural networks. In addition, the proposed pursuit–evasion control systems can be guaranteed to achieve uniformly ultimately bounded stability by utilizing Lyapunov methodology. Finally, the effectiveness of the developed pursuit–evasion control scheme is fully illustrated through two simulation cases.

查看原文本刊更多论文

基于自适应动态规划的四旋翼无人机避障最优追避控制

研究了具有避障功能的四旋翼无人机智能追避控制优化问题。首先，在设计的成本函数中，提出了一种新的惩罚函数来实现避障；在此基础上，采用纯临界结构代替传统的行动者-临界结构来求解Hamilton-Jacobi-Isaacs方程的近似解。同时，构造了两个新的批评神经网络来学习追逃控制系统的最优成本函数和控制策略。更具体地说，所建立的权值更新规律不仅能使神经网络的临界权值在线更新，而且结构简单，缓解了神经网络的持续激励条件。特别地，避障函数被纳入到近似代价函数的构造中，由所提出的批评神经网络学习。此外，利用李雅普诺夫方法可以保证所提出的追避控制系统达到一致最终有界稳定。最后，通过两个仿真实例充分说明了所提出的追逃控制方案的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neurocomputing 工程技术-计算机：人工智能

CiteScore

13.10

自引率

10.00%

发文量

1382

审稿时长

70 days

期刊介绍： Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.