基于改进策略梯度强化学习的航天器姿态机动规划算法

IF 1.9 4区 工程技术 Q2 ENGINEERING, MARINE
Bing Hua, Shenggang Sun, Yunhua Wu, Zhiming Chen
{"title":"基于改进策略梯度强化学习的航天器姿态机动规划算法","authors":"Bing Hua, Shenggang Sun, Yunhua Wu, Zhiming Chen","doi":"10.1017/S0373463321000813","DOIUrl":null,"url":null,"abstract":"Abstract To solve the problem of spacecraft attitude manoeuvre planning under dynamic multiple mandatory pointing constraints and prohibited pointing constraints, a systematic attitude manoeuvre planning approach is proposed that is based on improved policy gradient reinforcement learning. This paper presents a succinct model of dynamic multiple constraints that is similar to a real situation faced by an in-orbit spacecraft. By introducing return baseline and adaptive policy exploration methods, the proposed method overcomes issues such as large variances and slow convergence rates. Concurrently, the required computation time of the proposed method is markedly reduced. Using the proposed method, the near optimal path of the attitude manoeuvre can be determined, making the method suitable for the control of micro spacecraft. Simulation results demonstrate that the planning results fully satisfy all constraints, including six prohibited pointing constraints and two mandatory pointing constraints. The spacecraft also maintains high orientation accuracy to the Earth and Sun during all attitude manoeuvres.","PeriodicalId":50120,"journal":{"name":"Journal of Navigation","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A spacecraft attitude manoeuvre planning algorithm based on improved policy gradient reinforcement learning\",\"authors\":\"Bing Hua, Shenggang Sun, Yunhua Wu, Zhiming Chen\",\"doi\":\"10.1017/S0373463321000813\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract To solve the problem of spacecraft attitude manoeuvre planning under dynamic multiple mandatory pointing constraints and prohibited pointing constraints, a systematic attitude manoeuvre planning approach is proposed that is based on improved policy gradient reinforcement learning. This paper presents a succinct model of dynamic multiple constraints that is similar to a real situation faced by an in-orbit spacecraft. By introducing return baseline and adaptive policy exploration methods, the proposed method overcomes issues such as large variances and slow convergence rates. Concurrently, the required computation time of the proposed method is markedly reduced. Using the proposed method, the near optimal path of the attitude manoeuvre can be determined, making the method suitable for the control of micro spacecraft. Simulation results demonstrate that the planning results fully satisfy all constraints, including six prohibited pointing constraints and two mandatory pointing constraints. The spacecraft also maintains high orientation accuracy to the Earth and Sun during all attitude manoeuvres.\",\"PeriodicalId\":50120,\"journal\":{\"name\":\"Journal of Navigation\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2021-12-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Navigation\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1017/S0373463321000813\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, MARINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Navigation","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1017/S0373463321000813","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MARINE","Score":null,"Total":0}
引用次数: 1

摘要

摘要为了解决动态多重强制指向约束和禁止指向约束下的航天器姿态机动规划问题,提出了一种基于改进策略梯度强化学习的系统姿态机动规划方法。本文提出了一个简洁的动态多重约束模型,该模型类似于在轨航天器所面临的实际情况。通过引入回归基线和自适应策略探索方法,该方法克服了方差大和收敛速度慢等问题。同时,所提出的方法所需的计算时间显著减少。利用该方法可以确定姿态机动的近似最优路径,使该方法适用于微型航天器的控制。仿真结果表明,规划结果完全满足所有约束,包括六个禁止指向约束和两个强制指向约束。在所有姿态操纵过程中,航天器还保持着对地球和太阳的高定向精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A spacecraft attitude manoeuvre planning algorithm based on improved policy gradient reinforcement learning
Abstract To solve the problem of spacecraft attitude manoeuvre planning under dynamic multiple mandatory pointing constraints and prohibited pointing constraints, a systematic attitude manoeuvre planning approach is proposed that is based on improved policy gradient reinforcement learning. This paper presents a succinct model of dynamic multiple constraints that is similar to a real situation faced by an in-orbit spacecraft. By introducing return baseline and adaptive policy exploration methods, the proposed method overcomes issues such as large variances and slow convergence rates. Concurrently, the required computation time of the proposed method is markedly reduced. Using the proposed method, the near optimal path of the attitude manoeuvre can be determined, making the method suitable for the control of micro spacecraft. Simulation results demonstrate that the planning results fully satisfy all constraints, including six prohibited pointing constraints and two mandatory pointing constraints. The spacecraft also maintains high orientation accuracy to the Earth and Sun during all attitude manoeuvres.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Navigation
Journal of Navigation 工程技术-工程:海洋
CiteScore
6.10
自引率
4.20%
发文量
59
审稿时长
4.6 months
期刊介绍: The Journal of Navigation contains original papers on the science of navigation by man and animals over land and sea and through air and space, including a selection of papers presented at meetings of the Institute and other organisations associated with navigation. Papers cover every aspect of navigation, from the highly technical to the descriptive and historical. Subjects include electronics, astronomy, mathematics, cartography, command and control, psychology and zoology, operational research, risk analysis, theoretical physics, operation in hostile environments, instrumentation, ergonomics, financial planning and law. The journal also publishes selected papers and reports from the Institute’s special interest groups. Contributions come from all parts of the world.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信