基于改进策略梯度强化学习的航天器姿态机动规划算法

IF 1.9 4区工程技术 Q2 ENGINEERING, MARINE

Journal of Navigation Pub Date : 2021-12-14 DOI:10.1017/S0373463321000813

Bing Hua, Shenggang Sun, Yunhua Wu, Zhiming Chen

{"title":"基于改进策略梯度强化学习的航天器姿态机动规划算法","authors":"Bing Hua, Shenggang Sun, Yunhua Wu, Zhiming Chen","doi":"10.1017/S0373463321000813","DOIUrl":null,"url":null,"abstract":"Abstract To solve the problem of spacecraft attitude manoeuvre planning under dynamic multiple mandatory pointing constraints and prohibited pointing constraints, a systematic attitude manoeuvre planning approach is proposed that is based on improved policy gradient reinforcement learning. This paper presents a succinct model of dynamic multiple constraints that is similar to a real situation faced by an in-orbit spacecraft. By introducing return baseline and adaptive policy exploration methods, the proposed method overcomes issues such as large variances and slow convergence rates. Concurrently, the required computation time of the proposed method is markedly reduced. Using the proposed method, the near optimal path of the attitude manoeuvre can be determined, making the method suitable for the control of micro spacecraft. Simulation results demonstrate that the planning results fully satisfy all constraints, including six prohibited pointing constraints and two mandatory pointing constraints. The spacecraft also maintains high orientation accuracy to the Earth and Sun during all attitude manoeuvres.","PeriodicalId":50120,"journal":{"name":"Journal of Navigation","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A spacecraft attitude manoeuvre planning algorithm based on improved policy gradient reinforcement learning\",\"authors\":\"Bing Hua, Shenggang Sun, Yunhua Wu, Zhiming Chen\",\"doi\":\"10.1017/S0373463321000813\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract To solve the problem of spacecraft attitude manoeuvre planning under dynamic multiple mandatory pointing constraints and prohibited pointing constraints, a systematic attitude manoeuvre planning approach is proposed that is based on improved policy gradient reinforcement learning. This paper presents a succinct model of dynamic multiple constraints that is similar to a real situation faced by an in-orbit spacecraft. By introducing return baseline and adaptive policy exploration methods, the proposed method overcomes issues such as large variances and slow convergence rates. Concurrently, the required computation time of the proposed method is markedly reduced. Using the proposed method, the near optimal path of the attitude manoeuvre can be determined, making the method suitable for the control of micro spacecraft. Simulation results demonstrate that the planning results fully satisfy all constraints, including six prohibited pointing constraints and two mandatory pointing constraints. The spacecraft also maintains high orientation accuracy to the Earth and Sun during all attitude manoeuvres.\",\"PeriodicalId\":50120,\"journal\":{\"name\":\"Journal of Navigation\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2021-12-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Navigation\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1017/S0373463321000813\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, MARINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Navigation","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1017/S0373463321000813","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MARINE","Score":null,"Total":0}

引用次数: 1

摘要

摘要为了解决动态多重强制指向约束和禁止指向约束下的航天器姿态机动规划问题，提出了一种基于改进策略梯度强化学习的系统姿态机动规划方法。本文提出了一个简洁的动态多重约束模型，该模型类似于在轨航天器所面临的实际情况。通过引入回归基线和自适应策略探索方法，该方法克服了方差大和收敛速度慢等问题。同时，所提出的方法所需的计算时间显著减少。利用该方法可以确定姿态机动的近似最优路径，使该方法适用于微型航天器的控制。仿真结果表明，规划结果完全满足所有约束，包括六个禁止指向约束和两个强制指向约束。在所有姿态操纵过程中，航天器还保持着对地球和太阳的高定向精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A spacecraft attitude manoeuvre planning algorithm based on improved policy gradient reinforcement learning

Abstract To solve the problem of spacecraft attitude manoeuvre planning under dynamic multiple mandatory pointing constraints and prohibited pointing constraints, a systematic attitude manoeuvre planning approach is proposed that is based on improved policy gradient reinforcement learning. This paper presents a succinct model of dynamic multiple constraints that is similar to a real situation faced by an in-orbit spacecraft. By introducing return baseline and adaptive policy exploration methods, the proposed method overcomes issues such as large variances and slow convergence rates. Concurrently, the required computation time of the proposed method is markedly reduced. Using the proposed method, the near optimal path of the attitude manoeuvre can be determined, making the method suitable for the control of micro spacecraft. Simulation results demonstrate that the planning results fully satisfy all constraints, including six prohibited pointing constraints and two mandatory pointing constraints. The spacecraft also maintains high orientation accuracy to the Earth and Sun during all attitude manoeuvres.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Navigation 工程技术-工程：海洋

CiteScore

6.10

自引率

4.20%

发文量

审稿时长

4.6 months

期刊介绍： The Journal of Navigation contains original papers on the science of navigation by man and animals over land and sea and through air and space, including a selection of papers presented at meetings of the Institute and other organisations associated with navigation. Papers cover every aspect of navigation, from the highly technical to the descriptive and historical. Subjects include electronics, astronomy, mathematics, cartography, command and control, psychology and zoology, operational research, risk analysis, theoretical physics, operation in hostile environments, instrumentation, ergonomics, financial planning and law. The journal also publishes selected papers and reports from the Institute’s special interest groups. Contributions come from all parts of the world.