基于对比预训练强化学习的高效燃料优化多脉冲轨道转移

IF 2.8 3区 地球科学 Q2 ASTRONOMY & ASTROPHYSICS
He Ren, Haichao Gui, Rui Zhong
{"title":"基于对比预训练强化学习的高效燃料优化多脉冲轨道转移","authors":"He Ren,&nbsp;Haichao Gui,&nbsp;Rui Zhong","doi":"10.1016/j.asr.2025.02.049","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-impulse transfers between noncoplanar orbits are significant for on-orbit service spacecraft. This paper investigates the complex optimization problem of multi-impulse orbital transfer involving a chaser and a target. The chaser is subject to constraints on impulse magnitude and time, while the target may experience uncertain disturbances, causing it to deviate from the nominal orbit. The complexity of this problem imposes a significant computational burden on numerical methods, making it challenging for spacecraft to autonomously plan trajectory transfers in real time. To mitigate this burden, we propose a robust, fast, and autonomous algorithm for the optimization challenge, which can rapid plan transfer trajectories. Even if the terminal conditions suddenly change, our algorithm can quickly adjust the trajectory based on observed states without the need to completely re-plan. The algorithm comprises an intelligent trajectory generator and a Lambert transfer algorithm. The intelligent generator is based on a reinforcement learning (RL) method called contrastive-pre-trained Reinforcement Learning (CPRL), which emulates human learning habits to avoid the temporal credit assignment with long time horizons and sparse rewards during the training phase. When the chaser reaches an admissible range, determined by the impulse constraints and geometric relations of the conic curve, the algorithm adopts the Lambert transfer to complete the mission. Compared to traditional genetic and particle swarm algorithms, our method achieves a significant improvement in computational speed. Even with deviations, the average mission success rate remains at 96.8%. Numerical simulations confirm that our algorithm processes data quickly, can be deployed online, and is capable of handling various tasks in real time.</div></div>","PeriodicalId":50850,"journal":{"name":"Advances in Space Research","volume":"75 10","pages":"Pages 7377-7396"},"PeriodicalIF":2.8000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient fuel-optimal multi-impulse orbital transfer via contrastive pre-trained reinforcement learning\",\"authors\":\"He Ren,&nbsp;Haichao Gui,&nbsp;Rui Zhong\",\"doi\":\"10.1016/j.asr.2025.02.049\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Multi-impulse transfers between noncoplanar orbits are significant for on-orbit service spacecraft. This paper investigates the complex optimization problem of multi-impulse orbital transfer involving a chaser and a target. The chaser is subject to constraints on impulse magnitude and time, while the target may experience uncertain disturbances, causing it to deviate from the nominal orbit. The complexity of this problem imposes a significant computational burden on numerical methods, making it challenging for spacecraft to autonomously plan trajectory transfers in real time. To mitigate this burden, we propose a robust, fast, and autonomous algorithm for the optimization challenge, which can rapid plan transfer trajectories. Even if the terminal conditions suddenly change, our algorithm can quickly adjust the trajectory based on observed states without the need to completely re-plan. The algorithm comprises an intelligent trajectory generator and a Lambert transfer algorithm. The intelligent generator is based on a reinforcement learning (RL) method called contrastive-pre-trained Reinforcement Learning (CPRL), which emulates human learning habits to avoid the temporal credit assignment with long time horizons and sparse rewards during the training phase. When the chaser reaches an admissible range, determined by the impulse constraints and geometric relations of the conic curve, the algorithm adopts the Lambert transfer to complete the mission. Compared to traditional genetic and particle swarm algorithms, our method achieves a significant improvement in computational speed. Even with deviations, the average mission success rate remains at 96.8%. Numerical simulations confirm that our algorithm processes data quickly, can be deployed online, and is capable of handling various tasks in real time.</div></div>\",\"PeriodicalId\":50850,\"journal\":{\"name\":\"Advances in Space Research\",\"volume\":\"75 10\",\"pages\":\"Pages 7377-7396\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-02-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Space Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S027311772500184X\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ASTRONOMY & ASTROPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Space Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S027311772500184X","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

摘要

非共面轨道间的多脉冲传递对在轨服务航天器具有重要意义。研究了包含跟踪器和目标的多脉冲轨道转移的复杂优化问题。跟踪器受到脉冲大小和时间的限制,而目标可能受到不确定的干扰,导致其偏离标称轨道。这一问题的复杂性给数值方法带来了巨大的计算负担,给航天器实时自主规划轨迹转移带来了挑战。为了减轻这一负担,我们提出了一种鲁棒、快速和自主的优化算法,可以快速规划转移轨迹。即使终端条件突然改变,我们的算法也可以根据观测状态快速调整轨迹,而无需完全重新规划。该算法包括智能轨迹生成器和朗伯特转移算法。智能生成器基于一种称为对比预训练强化学习(CPRL)的强化学习(RL)方法,该方法模仿人类的学习习惯,避免了训练阶段长时间范围和稀疏奖励的时间信用分配。当追逐者到达允许范围时,由脉冲约束和二次曲线的几何关系决定,算法采用朗伯特转移完成任务。与传统的遗传算法和粒子群算法相比,我们的方法在计算速度上有了显著的提高。即使有偏差,平均任务成功率仍保持在96.8%。数值模拟结果表明,该算法处理数据速度快,可以在线部署,能够实时处理各种任务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Efficient fuel-optimal multi-impulse orbital transfer via contrastive pre-trained reinforcement learning
Multi-impulse transfers between noncoplanar orbits are significant for on-orbit service spacecraft. This paper investigates the complex optimization problem of multi-impulse orbital transfer involving a chaser and a target. The chaser is subject to constraints on impulse magnitude and time, while the target may experience uncertain disturbances, causing it to deviate from the nominal orbit. The complexity of this problem imposes a significant computational burden on numerical methods, making it challenging for spacecraft to autonomously plan trajectory transfers in real time. To mitigate this burden, we propose a robust, fast, and autonomous algorithm for the optimization challenge, which can rapid plan transfer trajectories. Even if the terminal conditions suddenly change, our algorithm can quickly adjust the trajectory based on observed states without the need to completely re-plan. The algorithm comprises an intelligent trajectory generator and a Lambert transfer algorithm. The intelligent generator is based on a reinforcement learning (RL) method called contrastive-pre-trained Reinforcement Learning (CPRL), which emulates human learning habits to avoid the temporal credit assignment with long time horizons and sparse rewards during the training phase. When the chaser reaches an admissible range, determined by the impulse constraints and geometric relations of the conic curve, the algorithm adopts the Lambert transfer to complete the mission. Compared to traditional genetic and particle swarm algorithms, our method achieves a significant improvement in computational speed. Even with deviations, the average mission success rate remains at 96.8%. Numerical simulations confirm that our algorithm processes data quickly, can be deployed online, and is capable of handling various tasks in real time.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Advances in Space Research
Advances in Space Research 地学天文-地球科学综合
CiteScore
5.20
自引率
11.50%
发文量
800
审稿时长
5.8 months
期刊介绍: The COSPAR publication Advances in Space Research (ASR) is an open journal covering all areas of space research including: space studies of the Earth''s surface, meteorology, climate, the Earth-Moon system, planets and small bodies of the solar system, upper atmospheres, ionospheres and magnetospheres of the Earth and planets including reference atmospheres, space plasmas in the solar system, astrophysics from space, materials sciences in space, fundamental physics in space, space debris, space weather, Earth observations of space phenomena, etc. NB: Please note that manuscripts related to life sciences as related to space are no more accepted for submission to Advances in Space Research. Such manuscripts should now be submitted to the new COSPAR Journal Life Sciences in Space Research (LSSR). All submissions are reviewed by two scientists in the field. COSPAR is an interdisciplinary scientific organization concerned with the progress of space research on an international scale. Operating under the rules of ICSU, COSPAR ignores political considerations and considers all questions solely from the scientific viewpoint.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信