{"title":"An optimal guidance law based on deep reinforcement learning for compensating the lag in time-varying slope path following","authors":"Zibo Wang, Qidan Zhu, Tianrui Zhao, Lipeng Wang","doi":"10.1016/j.ast.2025.110948","DOIUrl":null,"url":null,"abstract":"<div><div>During the carrier landing process, the carrier motion induces real-time variations in the desired slope path. Considering the inherent lag in aircraft position adjustments, this paper proposes an optimal guidance law based on deep reinforcement learning (DRL) to compensate for the lag. First, the carrier landing process is modeled as a Finite Markov Decision Process (FMDP), and a comprehensive DRL framework is developed. Second, a novel Soft Actor-Critic (LA-SAC) method enhanced with the Long Short-Term Memory (LSTM) network and the attention mechanism (AM) is introduced. The method extracts the deck motion features with the LSTM network and adjusts the weights of different state data with AM to improve learning efficiency. Additionally, a distributed neural network is designed to integrate deck motion prediction and compensation, avoiding the complexity of parameter tuning in conventional methods. LA-SAC leverages full-dimensional data to train the network and derive an optimal guidance law. Finally, the superiority of the proposed method has been verified in a semi-physical simulation platform. Compared to DRL baselines, LA-SAC achieves faster convergence and derives a superior guidance policy. Compared to conventional methods, the proposed method provides a more significant lead margin to reduce landing errors. Furthermore, the ablation experiments confirmed the effectiveness of the LSTM network and AM modules, and the real-time analysis validated the practicality of the LA-SAC algorithm in actual implementation.</div></div>","PeriodicalId":50955,"journal":{"name":"Aerospace Science and Technology","volume":"168 ","pages":"Article 110948"},"PeriodicalIF":5.8000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aerospace Science and Technology","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1270963825010120","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}
引用次数: 0
Abstract
During the carrier landing process, the carrier motion induces real-time variations in the desired slope path. Considering the inherent lag in aircraft position adjustments, this paper proposes an optimal guidance law based on deep reinforcement learning (DRL) to compensate for the lag. First, the carrier landing process is modeled as a Finite Markov Decision Process (FMDP), and a comprehensive DRL framework is developed. Second, a novel Soft Actor-Critic (LA-SAC) method enhanced with the Long Short-Term Memory (LSTM) network and the attention mechanism (AM) is introduced. The method extracts the deck motion features with the LSTM network and adjusts the weights of different state data with AM to improve learning efficiency. Additionally, a distributed neural network is designed to integrate deck motion prediction and compensation, avoiding the complexity of parameter tuning in conventional methods. LA-SAC leverages full-dimensional data to train the network and derive an optimal guidance law. Finally, the superiority of the proposed method has been verified in a semi-physical simulation platform. Compared to DRL baselines, LA-SAC achieves faster convergence and derives a superior guidance policy. Compared to conventional methods, the proposed method provides a more significant lead margin to reduce landing errors. Furthermore, the ablation experiments confirmed the effectiveness of the LSTM network and AM modules, and the real-time analysis validated the practicality of the LA-SAC algorithm in actual implementation.
期刊介绍:
Aerospace Science and Technology publishes articles of outstanding scientific quality. Each article is reviewed by two referees. The journal welcomes papers from a wide range of countries. This journal publishes original papers, review articles and short communications related to all fields of aerospace research, fundamental and applied, potential applications of which are clearly related to:
• The design and the manufacture of aircraft, helicopters, missiles, launchers and satellites
• The control of their environment
• The study of various systems they are involved in, as supports or as targets.
Authors are invited to submit papers on new advances in the following topics to aerospace applications:
• Fluid dynamics
• Energetics and propulsion
• Materials and structures
• Flight mechanics
• Navigation, guidance and control
• Acoustics
• Optics
• Electromagnetism and radar
• Signal and image processing
• Information processing
• Data fusion
• Decision aid
• Human behaviour
• Robotics and intelligent systems
• Complex system engineering.
Etc.