Xiaoyu Chen , Tian Tian , Guangming Dai , Maocai Wang , Zhiming Song , Lining Xing
{"title":"Deep reinforcement learning-based resource allocation method for multi-satellite scheduling","authors":"Xiaoyu Chen , Tian Tian , Guangming Dai , Maocai Wang , Zhiming Song , Lining Xing","doi":"10.1016/j.cor.2025.107088","DOIUrl":null,"url":null,"abstract":"<div><div>Agile Earth observation satellites (AEOSs) scheduling represents a complex domain within combinatorial optimization, crucial for the regular operations and mission success of in-orbit satellites. In order to timelessly tackle the allocation of complex resources and corresponding time windows, a satellite resource adaptive allocation method, named SRADA-DRL, is proposed in this paper. By combining deep reinforcement learning (DRL) with rule-based heuristics, the SRADA-DRL is designed to optimize the allocation of satellite resources in dynamic environments. Concerning maximizing the total rewards of allocated missions, a mathematical model and a corresponding Markov decision model are constructed within the scheduling process. After analyzing the spatial–temporal distribution features of all resources and missions, the time-dependent missions are first decomposed into meta-missions corresponding to satellite resources, and a meta-mission is then selected to generate an allocation sequence in each stage. On this basis, the execution times for all missions are assigned in the single-satellite scheduling process. In which, the DRL updates the gradient information contingent upon the rewards garnered from the allocation sequence. In addition, the classical scheduling scenarios of varying scales are also conducted. Experimental results demonstrate the effectiveness and efficiency of the proposed SRADA-DRL method in addressing the AEOSs scheduling.</div></div>","PeriodicalId":10542,"journal":{"name":"Computers & Operations Research","volume":"181 ","pages":"Article 107088"},"PeriodicalIF":4.1000,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Operations Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0305054825001169","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Agile Earth observation satellites (AEOSs) scheduling represents a complex domain within combinatorial optimization, crucial for the regular operations and mission success of in-orbit satellites. In order to timelessly tackle the allocation of complex resources and corresponding time windows, a satellite resource adaptive allocation method, named SRADA-DRL, is proposed in this paper. By combining deep reinforcement learning (DRL) with rule-based heuristics, the SRADA-DRL is designed to optimize the allocation of satellite resources in dynamic environments. Concerning maximizing the total rewards of allocated missions, a mathematical model and a corresponding Markov decision model are constructed within the scheduling process. After analyzing the spatial–temporal distribution features of all resources and missions, the time-dependent missions are first decomposed into meta-missions corresponding to satellite resources, and a meta-mission is then selected to generate an allocation sequence in each stage. On this basis, the execution times for all missions are assigned in the single-satellite scheduling process. In which, the DRL updates the gradient information contingent upon the rewards garnered from the allocation sequence. In addition, the classical scheduling scenarios of varying scales are also conducted. Experimental results demonstrate the effectiveness and efficiency of the proposed SRADA-DRL method in addressing the AEOSs scheduling.
期刊介绍:
Operations research and computers meet in a large number of scientific fields, many of which are of vital current concern to our troubled society. These include, among others, ecology, transportation, safety, reliability, urban planning, economics, inventory control, investment strategy and logistics (including reverse logistics). Computers & Operations Research provides an international forum for the application of computers and operations research techniques to problems in these and related fields.