{"title":"基于图指针网络的分层课程强化学习方法求解穿梭油轮调度问题","authors":"Xiaoyong Gao;Yixu Yang;Diao Peng;Shanghe Li;Chaodong Tan;Feifei Li;Tao Chen","doi":"10.23919/CSMS.2024.0017","DOIUrl":null,"url":null,"abstract":"Shuttle tankers scheduling is an important task in offshore oil and gas transportation process, which involves operating time window fulfillment, optimal transportation planning, and proper inventory management. However, conventional approaches like Mixed Integer Linear Programming (MILP) or meta heuristic algorithms often fail in long running time. In this paper, a Graph Pointer Network (GPN) based Hierarchical Curriculum Reinforcement Learning (HCRL) method is proposed to solve Shuttle Tankers Scheduling Problem (STSP). The model is trained to divide STSP into voyage and operation stages and generate routing and inventory management decisions sequentially. An asynchronous training strategy is developed to address the coupling between stages. Comparison experiments demonstrate that the proposed HCRL method achieves 12% shorter tour lengths on average compared to heuristic algorithms. Additional experiments validate its generalizability to unseen instances and scalability to larger instances.","PeriodicalId":65786,"journal":{"name":"复杂系统建模与仿真(英文)","volume":"4 4","pages":"339-352"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10820942","citationCount":"0","resultStr":"{\"title\":\"Graph Pointer Network Based Hierarchical Curriculum Reinforcement Learning Method Solving Shuttle Tankers Scheduling Problem\",\"authors\":\"Xiaoyong Gao;Yixu Yang;Diao Peng;Shanghe Li;Chaodong Tan;Feifei Li;Tao Chen\",\"doi\":\"10.23919/CSMS.2024.0017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Shuttle tankers scheduling is an important task in offshore oil and gas transportation process, which involves operating time window fulfillment, optimal transportation planning, and proper inventory management. However, conventional approaches like Mixed Integer Linear Programming (MILP) or meta heuristic algorithms often fail in long running time. In this paper, a Graph Pointer Network (GPN) based Hierarchical Curriculum Reinforcement Learning (HCRL) method is proposed to solve Shuttle Tankers Scheduling Problem (STSP). The model is trained to divide STSP into voyage and operation stages and generate routing and inventory management decisions sequentially. An asynchronous training strategy is developed to address the coupling between stages. Comparison experiments demonstrate that the proposed HCRL method achieves 12% shorter tour lengths on average compared to heuristic algorithms. Additional experiments validate its generalizability to unseen instances and scalability to larger instances.\",\"PeriodicalId\":65786,\"journal\":{\"name\":\"复杂系统建模与仿真(英文)\",\"volume\":\"4 4\",\"pages\":\"339-352\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10820942\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"复杂系统建模与仿真(英文)\",\"FirstCategoryId\":\"1089\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10820942/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"复杂系统建模与仿真(英文)","FirstCategoryId":"1089","ListUrlMain":"https://ieeexplore.ieee.org/document/10820942/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Graph Pointer Network Based Hierarchical Curriculum Reinforcement Learning Method Solving Shuttle Tankers Scheduling Problem
Shuttle tankers scheduling is an important task in offshore oil and gas transportation process, which involves operating time window fulfillment, optimal transportation planning, and proper inventory management. However, conventional approaches like Mixed Integer Linear Programming (MILP) or meta heuristic algorithms often fail in long running time. In this paper, a Graph Pointer Network (GPN) based Hierarchical Curriculum Reinforcement Learning (HCRL) method is proposed to solve Shuttle Tankers Scheduling Problem (STSP). The model is trained to divide STSP into voyage and operation stages and generate routing and inventory management decisions sequentially. An asynchronous training strategy is developed to address the coupling between stages. Comparison experiments demonstrate that the proposed HCRL method achieves 12% shorter tour lengths on average compared to heuristic algorithms. Additional experiments validate its generalizability to unseen instances and scalability to larger instances.