{"title":"基于强化学习的三维 Lyapunov 导向矢量场避开拦截卫星的新方法","authors":"Yunfei Zhang, Honglun Wang, Menghua Zhang, Yiheng Liu, Jianfa Wu","doi":"10.1007/s10846-024-02151-x","DOIUrl":null,"url":null,"abstract":"<p>This paper proposes a new 3D Lyapunov guidance vector field(3D-LGV) avoidance strategy based on reinforcement learning for the satellite evasion and interception problem. Combining it with the interfered fluid dynamical system (IFDS) enables the satellite to evade and smoothly enter orbit according to the state of the intercepting satellite in real time. 3D-LGV provides an initial flow field approaching an elliptical orbit, while IFDS provides a perturbed flow field based on the intercepting satellite position. The combined potential field of the initial flow field and the disturbed flow field is the planned velocity direction of the satellite. As a decision-making layer, the proximal policy optimization (PPO) dynamically adjusts the perturbed flow field in the IFDS to increase the avoidance success rate in different scenarios. The experimental results show that, compared with the particle swarm optimization with rolling horizon control algorithm, the algorithm proposed in this paper has a shorter decision time and a higher avoidance success rate. At the same time, Monte Carlo simulation shows that the evasion success rate of the proposed algorithm reaches 98%.</p>","PeriodicalId":54794,"journal":{"name":"Journal of Intelligent & Robotic Systems","volume":"14 1","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Novel Method of 3D Lyapunov Guidance Vector Field to Avoid Intercepting Satellite Based on Reinforcement Learning\",\"authors\":\"Yunfei Zhang, Honglun Wang, Menghua Zhang, Yiheng Liu, Jianfa Wu\",\"doi\":\"10.1007/s10846-024-02151-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This paper proposes a new 3D Lyapunov guidance vector field(3D-LGV) avoidance strategy based on reinforcement learning for the satellite evasion and interception problem. Combining it with the interfered fluid dynamical system (IFDS) enables the satellite to evade and smoothly enter orbit according to the state of the intercepting satellite in real time. 3D-LGV provides an initial flow field approaching an elliptical orbit, while IFDS provides a perturbed flow field based on the intercepting satellite position. The combined potential field of the initial flow field and the disturbed flow field is the planned velocity direction of the satellite. As a decision-making layer, the proximal policy optimization (PPO) dynamically adjusts the perturbed flow field in the IFDS to increase the avoidance success rate in different scenarios. The experimental results show that, compared with the particle swarm optimization with rolling horizon control algorithm, the algorithm proposed in this paper has a shorter decision time and a higher avoidance success rate. At the same time, Monte Carlo simulation shows that the evasion success rate of the proposed algorithm reaches 98%.</p>\",\"PeriodicalId\":54794,\"journal\":{\"name\":\"Journal of Intelligent & Robotic Systems\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2024-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Intelligent & Robotic Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s10846-024-02151-x\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent & Robotic Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10846-024-02151-x","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
A Novel Method of 3D Lyapunov Guidance Vector Field to Avoid Intercepting Satellite Based on Reinforcement Learning
This paper proposes a new 3D Lyapunov guidance vector field(3D-LGV) avoidance strategy based on reinforcement learning for the satellite evasion and interception problem. Combining it with the interfered fluid dynamical system (IFDS) enables the satellite to evade and smoothly enter orbit according to the state of the intercepting satellite in real time. 3D-LGV provides an initial flow field approaching an elliptical orbit, while IFDS provides a perturbed flow field based on the intercepting satellite position. The combined potential field of the initial flow field and the disturbed flow field is the planned velocity direction of the satellite. As a decision-making layer, the proximal policy optimization (PPO) dynamically adjusts the perturbed flow field in the IFDS to increase the avoidance success rate in different scenarios. The experimental results show that, compared with the particle swarm optimization with rolling horizon control algorithm, the algorithm proposed in this paper has a shorter decision time and a higher avoidance success rate. At the same time, Monte Carlo simulation shows that the evasion success rate of the proposed algorithm reaches 98%.
期刊介绍:
The Journal of Intelligent and Robotic Systems bridges the gap between theory and practice in all areas of intelligent systems and robotics. It publishes original, peer reviewed contributions from initial concept and theory to prototyping to final product development and commercialization.
On the theoretical side, the journal features papers focusing on intelligent systems engineering, distributed intelligence systems, multi-level systems, intelligent control, multi-robot systems, cooperation and coordination of unmanned vehicle systems, etc.
On the application side, the journal emphasizes autonomous systems, industrial robotic systems, multi-robot systems, aerial vehicles, mobile robot platforms, underwater robots, sensors, sensor-fusion, and sensor-based control. Readers will also find papers on real applications of intelligent and robotic systems (e.g., mechatronics, manufacturing, biomedical, underwater, humanoid, mobile/legged robot and space applications, etc.).