基于对比预训练强化学习的高效燃料优化多脉冲轨道转移

IF 2.8 3区地球科学 Q2 ASTRONOMY & ASTROPHYSICS

Advances in Space Research Pub Date : 2025-02-27 DOI:10.1016/j.asr.2025.02.049

He Ren, Haichao Gui, Rui Zhong

{"title":"基于对比预训练强化学习的高效燃料优化多脉冲轨道转移","authors":"He Ren, Haichao Gui, Rui Zhong","doi":"10.1016/j.asr.2025.02.049","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-impulse transfers between noncoplanar orbits are significant for on-orbit service spacecraft. This paper investigates the complex optimization problem of multi-impulse orbital transfer involving a chaser and a target. The chaser is subject to constraints on impulse magnitude and time, while the target may experience uncertain disturbances, causing it to deviate from the nominal orbit. The complexity of this problem imposes a significant computational burden on numerical methods, making it challenging for spacecraft to autonomously plan trajectory transfers in real time. To mitigate this burden, we propose a robust, fast, and autonomous algorithm for the optimization challenge, which can rapid plan transfer trajectories. Even if the terminal conditions suddenly change, our algorithm can quickly adjust the trajectory based on observed states without the need to completely re-plan. The algorithm comprises an intelligent trajectory generator and a Lambert transfer algorithm. The intelligent generator is based on a reinforcement learning (RL) method called contrastive-pre-trained Reinforcement Learning (CPRL), which emulates human learning habits to avoid the temporal credit assignment with long time horizons and sparse rewards during the training phase. When the chaser reaches an admissible range, determined by the impulse constraints and geometric relations of the conic curve, the algorithm adopts the Lambert transfer to complete the mission. Compared to traditional genetic and particle swarm algorithms, our method achieves a significant improvement in computational speed. Even with deviations, the average mission success rate remains at 96.8%. Numerical simulations confirm that our algorithm processes data quickly, can be deployed online, and is capable of handling various tasks in real time.</div></div>","PeriodicalId":50850,"journal":{"name":"Advances in Space Research","volume":"75 10","pages":"Pages 7377-7396"},"PeriodicalIF":2.8000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient fuel-optimal multi-impulse orbital transfer via contrastive pre-trained reinforcement learning\",\"authors\":\"He Ren, Haichao Gui, Rui Zhong\",\"doi\":\"10.1016/j.asr.2025.02.049\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Multi-impulse transfers between noncoplanar orbits are significant for on-orbit service spacecraft. This paper investigates the complex optimization problem of multi-impulse orbital transfer involving a chaser and a target. The chaser is subject to constraints on impulse magnitude and time, while the target may experience uncertain disturbances, causing it to deviate from the nominal orbit. The complexity of this problem imposes a significant computational burden on numerical methods, making it challenging for spacecraft to autonomously plan trajectory transfers in real time. To mitigate this burden, we propose a robust, fast, and autonomous algorithm for the optimization challenge, which can rapid plan transfer trajectories. Even if the terminal conditions suddenly change, our algorithm can quickly adjust the trajectory based on observed states without the need to completely re-plan. The algorithm comprises an intelligent trajectory generator and a Lambert transfer algorithm. The intelligent generator is based on a reinforcement learning (RL) method called contrastive-pre-trained Reinforcement Learning (CPRL), which emulates human learning habits to avoid the temporal credit assignment with long time horizons and sparse rewards during the training phase. When the chaser reaches an admissible range, determined by the impulse constraints and geometric relations of the conic curve, the algorithm adopts the Lambert transfer to complete the mission. Compared to traditional genetic and particle swarm algorithms, our method achieves a significant improvement in computational speed. Even with deviations, the average mission success rate remains at 96.8%. Numerical simulations confirm that our algorithm processes data quickly, can be deployed online, and is capable of handling various tasks in real time.</div></div>\",\"PeriodicalId\":50850,\"journal\":{\"name\":\"Advances in Space Research\",\"volume\":\"75 10\",\"pages\":\"Pages 7377-7396\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-02-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Space Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S027311772500184X\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ASTRONOMY & ASTROPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Space Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S027311772500184X","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}

引用次数: 0

摘要

非共面轨道间的多脉冲传递对在轨服务航天器具有重要意义。研究了包含跟踪器和目标的多脉冲轨道转移的复杂优化问题。跟踪器受到脉冲大小和时间的限制，而目标可能受到不确定的干扰，导致其偏离标称轨道。这一问题的复杂性给数值方法带来了巨大的计算负担，给航天器实时自主规划轨迹转移带来了挑战。为了减轻这一负担，我们提出了一种鲁棒、快速和自主的优化算法，可以快速规划转移轨迹。即使终端条件突然改变，我们的算法也可以根据观测状态快速调整轨迹，而无需完全重新规划。该算法包括智能轨迹生成器和朗伯特转移算法。智能生成器基于一种称为对比预训练强化学习（CPRL）的强化学习（RL）方法，该方法模仿人类的学习习惯，避免了训练阶段长时间范围和稀疏奖励的时间信用分配。当追逐者到达允许范围时，由脉冲约束和二次曲线的几何关系决定，算法采用朗伯特转移完成任务。与传统的遗传算法和粒子群算法相比，我们的方法在计算速度上有了显著的提高。即使有偏差，平均任务成功率仍保持在96.8%。数值模拟结果表明，该算法处理数据速度快，可以在线部署，能够实时处理各种任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Efficient fuel-optimal multi-impulse orbital transfer via contrastive pre-trained reinforcement learning

Multi-impulse transfers between noncoplanar orbits are significant for on-orbit service spacecraft. This paper investigates the complex optimization problem of multi-impulse orbital transfer involving a chaser and a target. The chaser is subject to constraints on impulse magnitude and time, while the target may experience uncertain disturbances, causing it to deviate from the nominal orbit. The complexity of this problem imposes a significant computational burden on numerical methods, making it challenging for spacecraft to autonomously plan trajectory transfers in real time. To mitigate this burden, we propose a robust, fast, and autonomous algorithm for the optimization challenge, which can rapid plan transfer trajectories. Even if the terminal conditions suddenly change, our algorithm can quickly adjust the trajectory based on observed states without the need to completely re-plan. The algorithm comprises an intelligent trajectory generator and a Lambert transfer algorithm. The intelligent generator is based on a reinforcement learning (RL) method called contrastive-pre-trained Reinforcement Learning (CPRL), which emulates human learning habits to avoid the temporal credit assignment with long time horizons and sparse rewards during the training phase. When the chaser reaches an admissible range, determined by the impulse constraints and geometric relations of the conic curve, the algorithm adopts the Lambert transfer to complete the mission. Compared to traditional genetic and particle swarm algorithms, our method achieves a significant improvement in computational speed. Even with deviations, the average mission success rate remains at 96.8%. Numerical simulations confirm that our algorithm processes data quickly, can be deployed online, and is capable of handling various tasks in real time.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Advances in Space Research 地学天文-地球科学综合

CiteScore

5.20

自引率

11.50%

发文量

800

审稿时长

5.8 months

期刊介绍： The COSPAR publication Advances in Space Research (ASR) is an open journal covering all areas of space research including: space studies of the Earth''s surface, meteorology, climate, the Earth-Moon system, planets and small bodies of the solar system, upper atmospheres, ionospheres and magnetospheres of the Earth and planets including reference atmospheres, space plasmas in the solar system, astrophysics from space, materials sciences in space, fundamental physics in space, space debris, space weather, Earth observations of space phenomena, etc. NB: Please note that manuscripts related to life sciences as related to space are no more accepted for submission to Advances in Space Research. Such manuscripts should now be submitted to the new COSPAR Journal Life Sciences in Space Research (LSSR). All submissions are reviewed by two scientists in the field. COSPAR is an interdisciplinary scientific organization concerned with the progress of space research on an international scale. Operating under the rules of ICSU, COSPAR ignores political considerations and considers all questions solely from the scientific viewpoint.