{"title":"Prescribed intelligent elliptical pursuing by UAVs: A reinforcement learning policy","authors":"Yi Xia , Xingling Shao , Tianyun Ding , Jun Liu","doi":"10.1016/j.eswa.2024.123547","DOIUrl":null,"url":null,"abstract":"<div><p>This paper proposes a prescribed intelligent elliptical pursuing algorithm driven by deep reinforcement learning (DRL) for Unmanned Aerial Vehicles (UAVs), aiming to achieve an optimal performance without violating appointed-time constraints. Specifically, an unknown system dynamics estimator (USDE) with merely one filtering coefficient is employed to counteract uncertainties effectively. Subsequently, an elliptical enclosing controller is devised for UAVs to achieve asymptotic convergence towards the target ellipse orbit. Particularly, a proximal policy optimization (PPO)-based optimal learning term is contrived to achieve control optimality related to consumption and efficiency, where a refined reward function is established to characterize the appointed-time constraints. The salient merit is that an efficient learning paradigm is formulated to fulfill a near optimal appointed-time enclosing free from prescribed performance control. Finally, a plenty of simulations substantiate the feasibility of developed algorithm.</p></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"249 ","pages":"Article 123547"},"PeriodicalIF":7.5000,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424004123","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposes a prescribed intelligent elliptical pursuing algorithm driven by deep reinforcement learning (DRL) for Unmanned Aerial Vehicles (UAVs), aiming to achieve an optimal performance without violating appointed-time constraints. Specifically, an unknown system dynamics estimator (USDE) with merely one filtering coefficient is employed to counteract uncertainties effectively. Subsequently, an elliptical enclosing controller is devised for UAVs to achieve asymptotic convergence towards the target ellipse orbit. Particularly, a proximal policy optimization (PPO)-based optimal learning term is contrived to achieve control optimality related to consumption and efficiency, where a refined reward function is established to characterize the appointed-time constraints. The salient merit is that an efficient learning paradigm is formulated to fulfill a near optimal appointed-time enclosing free from prescribed performance control. Finally, a plenty of simulations substantiate the feasibility of developed algorithm.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.