{"title":"基于对比预训练强化学习的高效燃料优化多脉冲轨道转移","authors":"He Ren, Haichao Gui, Rui Zhong","doi":"10.1016/j.asr.2025.02.049","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-impulse transfers between noncoplanar orbits are significant for on-orbit service spacecraft. This paper investigates the complex optimization problem of multi-impulse orbital transfer involving a chaser and a target. The chaser is subject to constraints on impulse magnitude and time, while the target may experience uncertain disturbances, causing it to deviate from the nominal orbit. The complexity of this problem imposes a significant computational burden on numerical methods, making it challenging for spacecraft to autonomously plan trajectory transfers in real time. To mitigate this burden, we propose a robust, fast, and autonomous algorithm for the optimization challenge, which can rapid plan transfer trajectories. Even if the terminal conditions suddenly change, our algorithm can quickly adjust the trajectory based on observed states without the need to completely re-plan. The algorithm comprises an intelligent trajectory generator and a Lambert transfer algorithm. The intelligent generator is based on a reinforcement learning (RL) method called contrastive-pre-trained Reinforcement Learning (CPRL), which emulates human learning habits to avoid the temporal credit assignment with long time horizons and sparse rewards during the training phase. When the chaser reaches an admissible range, determined by the impulse constraints and geometric relations of the conic curve, the algorithm adopts the Lambert transfer to complete the mission. Compared to traditional genetic and particle swarm algorithms, our method achieves a significant improvement in computational speed. Even with deviations, the average mission success rate remains at 96.8%. Numerical simulations confirm that our algorithm processes data quickly, can be deployed online, and is capable of handling various tasks in real time.</div></div>","PeriodicalId":50850,"journal":{"name":"Advances in Space Research","volume":"75 10","pages":"Pages 7377-7396"},"PeriodicalIF":2.8000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient fuel-optimal multi-impulse orbital transfer via contrastive pre-trained reinforcement learning\",\"authors\":\"He Ren, Haichao Gui, Rui Zhong\",\"doi\":\"10.1016/j.asr.2025.02.049\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Multi-impulse transfers between noncoplanar orbits are significant for on-orbit service spacecraft. This paper investigates the complex optimization problem of multi-impulse orbital transfer involving a chaser and a target. The chaser is subject to constraints on impulse magnitude and time, while the target may experience uncertain disturbances, causing it to deviate from the nominal orbit. The complexity of this problem imposes a significant computational burden on numerical methods, making it challenging for spacecraft to autonomously plan trajectory transfers in real time. To mitigate this burden, we propose a robust, fast, and autonomous algorithm for the optimization challenge, which can rapid plan transfer trajectories. Even if the terminal conditions suddenly change, our algorithm can quickly adjust the trajectory based on observed states without the need to completely re-plan. The algorithm comprises an intelligent trajectory generator and a Lambert transfer algorithm. The intelligent generator is based on a reinforcement learning (RL) method called contrastive-pre-trained Reinforcement Learning (CPRL), which emulates human learning habits to avoid the temporal credit assignment with long time horizons and sparse rewards during the training phase. When the chaser reaches an admissible range, determined by the impulse constraints and geometric relations of the conic curve, the algorithm adopts the Lambert transfer to complete the mission. Compared to traditional genetic and particle swarm algorithms, our method achieves a significant improvement in computational speed. Even with deviations, the average mission success rate remains at 96.8%. Numerical simulations confirm that our algorithm processes data quickly, can be deployed online, and is capable of handling various tasks in real time.</div></div>\",\"PeriodicalId\":50850,\"journal\":{\"name\":\"Advances in Space Research\",\"volume\":\"75 10\",\"pages\":\"Pages 7377-7396\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-02-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Space Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S027311772500184X\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ASTRONOMY & ASTROPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Space Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S027311772500184X","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
Efficient fuel-optimal multi-impulse orbital transfer via contrastive pre-trained reinforcement learning
Multi-impulse transfers between noncoplanar orbits are significant for on-orbit service spacecraft. This paper investigates the complex optimization problem of multi-impulse orbital transfer involving a chaser and a target. The chaser is subject to constraints on impulse magnitude and time, while the target may experience uncertain disturbances, causing it to deviate from the nominal orbit. The complexity of this problem imposes a significant computational burden on numerical methods, making it challenging for spacecraft to autonomously plan trajectory transfers in real time. To mitigate this burden, we propose a robust, fast, and autonomous algorithm for the optimization challenge, which can rapid plan transfer trajectories. Even if the terminal conditions suddenly change, our algorithm can quickly adjust the trajectory based on observed states without the need to completely re-plan. The algorithm comprises an intelligent trajectory generator and a Lambert transfer algorithm. The intelligent generator is based on a reinforcement learning (RL) method called contrastive-pre-trained Reinforcement Learning (CPRL), which emulates human learning habits to avoid the temporal credit assignment with long time horizons and sparse rewards during the training phase. When the chaser reaches an admissible range, determined by the impulse constraints and geometric relations of the conic curve, the algorithm adopts the Lambert transfer to complete the mission. Compared to traditional genetic and particle swarm algorithms, our method achieves a significant improvement in computational speed. Even with deviations, the average mission success rate remains at 96.8%. Numerical simulations confirm that our algorithm processes data quickly, can be deployed online, and is capable of handling various tasks in real time.
期刊介绍:
The COSPAR publication Advances in Space Research (ASR) is an open journal covering all areas of space research including: space studies of the Earth''s surface, meteorology, climate, the Earth-Moon system, planets and small bodies of the solar system, upper atmospheres, ionospheres and magnetospheres of the Earth and planets including reference atmospheres, space plasmas in the solar system, astrophysics from space, materials sciences in space, fundamental physics in space, space debris, space weather, Earth observations of space phenomena, etc.
NB: Please note that manuscripts related to life sciences as related to space are no more accepted for submission to Advances in Space Research. Such manuscripts should now be submitted to the new COSPAR Journal Life Sciences in Space Research (LSSR).
All submissions are reviewed by two scientists in the field. COSPAR is an interdisciplinary scientific organization concerned with the progress of space research on an international scale. Operating under the rules of ICSU, COSPAR ignores political considerations and considers all questions solely from the scientific viewpoint.