Adaptive dynamic programming-based optimal pursuit–evasion control for quadrotor unmanned aerial vehicles with obstacle avoidance

IF 6.5 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Bo Li , Ziqi Yang , Hui Liu , Bing Xiao
{"title":"Adaptive dynamic programming-based optimal pursuit–evasion control for quadrotor unmanned aerial vehicles with obstacle avoidance","authors":"Bo Li ,&nbsp;Ziqi Yang ,&nbsp;Hui Liu ,&nbsp;Bing Xiao","doi":"10.1016/j.neucom.2025.130483","DOIUrl":null,"url":null,"abstract":"<div><div>This article investigates the challenging problem of optimized intelligent pursuit–evasion control for quadrotor unmanned aerial vehicles with obstacle avoidance. Firstly, a novel penalty function is developed to achieve obstacle avoidance in the designed cost function. Subsequently, a critic-only structure is employed to substitute the conventional actor-critic structure for addressing the approximate solutions of Hamilton–Jacobi–Isaacs equations. Meanwhile, two new critic neural networks are constructed to learn the optimal cost functions and control policies for the pursuit–evasion control systems. More specifically, the developed weight update laws not only enable the critic neural network weights to be updated online, but also alleviate the persistent excitation condition with a simple structure. In particular, the obstacle avoidance function is incorporated into the construction of the approximate cost function learned by the proposed critic neural networks. In addition, the proposed pursuit–evasion control systems can be guaranteed to achieve uniformly ultimately bounded stability by utilizing Lyapunov methodology. Finally, the effectiveness of the developed pursuit–evasion control scheme is fully illustrated through two simulation cases.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"645 ","pages":"Article 130483"},"PeriodicalIF":6.5000,"publicationDate":"2025-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225011555","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

This article investigates the challenging problem of optimized intelligent pursuit–evasion control for quadrotor unmanned aerial vehicles with obstacle avoidance. Firstly, a novel penalty function is developed to achieve obstacle avoidance in the designed cost function. Subsequently, a critic-only structure is employed to substitute the conventional actor-critic structure for addressing the approximate solutions of Hamilton–Jacobi–Isaacs equations. Meanwhile, two new critic neural networks are constructed to learn the optimal cost functions and control policies for the pursuit–evasion control systems. More specifically, the developed weight update laws not only enable the critic neural network weights to be updated online, but also alleviate the persistent excitation condition with a simple structure. In particular, the obstacle avoidance function is incorporated into the construction of the approximate cost function learned by the proposed critic neural networks. In addition, the proposed pursuit–evasion control systems can be guaranteed to achieve uniformly ultimately bounded stability by utilizing Lyapunov methodology. Finally, the effectiveness of the developed pursuit–evasion control scheme is fully illustrated through two simulation cases.
基于自适应动态规划的四旋翼无人机避障最优追避控制
研究了具有避障功能的四旋翼无人机智能追避控制优化问题。首先,在设计的成本函数中,提出了一种新的惩罚函数来实现避障;在此基础上,采用纯临界结构代替传统的行动者-临界结构来求解Hamilton-Jacobi-Isaacs方程的近似解。同时,构造了两个新的批评神经网络来学习追逃控制系统的最优成本函数和控制策略。更具体地说,所建立的权值更新规律不仅能使神经网络的临界权值在线更新,而且结构简单,缓解了神经网络的持续激励条件。特别地,避障函数被纳入到近似代价函数的构造中,由所提出的批评神经网络学习。此外,利用李雅普诺夫方法可以保证所提出的追避控制系统达到一致最终有界稳定。最后,通过两个仿真实例充分说明了所提出的追逃控制方案的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Neurocomputing
Neurocomputing 工程技术-计算机:人工智能
CiteScore
13.10
自引率
10.00%
发文量
1382
审稿时长
70 days
期刊介绍: Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信