Hongji Zhuang , Wenlong Lu , Qiang Shen , Shufan Wu , Vladimir Yu. Razoumny , Yury N. Razoumny
{"title":"Off-policy reinforcement learning control for space manipulators based on object detection via convolutional neural networks","authors":"Hongji Zhuang , Wenlong Lu , Qiang Shen , Shufan Wu , Vladimir Yu. Razoumny , Yury N. Razoumny","doi":"10.1016/j.ast.2025.110914","DOIUrl":null,"url":null,"abstract":"<div><div>This paper proposes a vision-based control framework that integrates convolutional neural network-based object detection with off-policy reinforcement learning to address the engineering demands of autonomy, robustness, and high control performance in space manipulator operations, as well as to fill gaps in existing vision-based control research. A two-loop architecture comprising a detection loop and a control loop is constructed, with a combined-variable approach employed to simplify the complex image-space dynamics of the space manipulator. On the vision side, a state-of-the-art single-stage object detection network is enhanced with a depth regression module to provide real-time distance feedback. On the control side, an off-policy reinforcement learning algorithm is adopted to achieve model-free optimal control. The proposed integrated vision-based control strategy is validated through both verification and comparative simulations, demonstrating superior autonomy, robustness, and control performance, as well as advantages over the other representative vision-based control method.</div></div>","PeriodicalId":50955,"journal":{"name":"Aerospace Science and Technology","volume":"168 ","pages":"Article 110914"},"PeriodicalIF":5.8000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aerospace Science and Technology","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1270963825009782","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, AEROSPACE","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposes a vision-based control framework that integrates convolutional neural network-based object detection with off-policy reinforcement learning to address the engineering demands of autonomy, robustness, and high control performance in space manipulator operations, as well as to fill gaps in existing vision-based control research. A two-loop architecture comprising a detection loop and a control loop is constructed, with a combined-variable approach employed to simplify the complex image-space dynamics of the space manipulator. On the vision side, a state-of-the-art single-stage object detection network is enhanced with a depth regression module to provide real-time distance feedback. On the control side, an off-policy reinforcement learning algorithm is adopted to achieve model-free optimal control. The proposed integrated vision-based control strategy is validated through both verification and comparative simulations, demonstrating superior autonomy, robustness, and control performance, as well as advantages over the other representative vision-based control method.
期刊介绍:
Aerospace Science and Technology publishes articles of outstanding scientific quality. Each article is reviewed by two referees. The journal welcomes papers from a wide range of countries. This journal publishes original papers, review articles and short communications related to all fields of aerospace research, fundamental and applied, potential applications of which are clearly related to:
• The design and the manufacture of aircraft, helicopters, missiles, launchers and satellites
• The control of their environment
• The study of various systems they are involved in, as supports or as targets.
Authors are invited to submit papers on new advances in the following topics to aerospace applications:
• Fluid dynamics
• Energetics and propulsion
• Materials and structures
• Flight mechanics
• Navigation, guidance and control
• Acoustics
• Optics
• Electromagnetism and radar
• Signal and image processing
• Information processing
• Data fusion
• Decision aid
• Human behaviour
• Robotics and intelligent systems
• Complex system engineering.
Etc.