基于图指针网络的分层课程强化学习方法求解穿梭油轮调度问题

复杂系统建模与仿真(英文) Pub Date : 2024-12-01 DOI:10.23919/CSMS.2024.0017

Xiaoyong Gao;Yixu Yang;Diao Peng;Shanghe Li;Chaodong Tan;Feifei Li;Tao Chen

{"title":"基于图指针网络的分层课程强化学习方法求解穿梭油轮调度问题","authors":"Xiaoyong Gao;Yixu Yang;Diao Peng;Shanghe Li;Chaodong Tan;Feifei Li;Tao Chen","doi":"10.23919/CSMS.2024.0017","DOIUrl":null,"url":null,"abstract":"Shuttle tankers scheduling is an important task in offshore oil and gas transportation process, which involves operating time window fulfillment, optimal transportation planning, and proper inventory management. However, conventional approaches like Mixed Integer Linear Programming (MILP) or meta heuristic algorithms often fail in long running time. In this paper, a Graph Pointer Network (GPN) based Hierarchical Curriculum Reinforcement Learning (HCRL) method is proposed to solve Shuttle Tankers Scheduling Problem (STSP). The model is trained to divide STSP into voyage and operation stages and generate routing and inventory management decisions sequentially. An asynchronous training strategy is developed to address the coupling between stages. Comparison experiments demonstrate that the proposed HCRL method achieves 12% shorter tour lengths on average compared to heuristic algorithms. Additional experiments validate its generalizability to unseen instances and scalability to larger instances.","PeriodicalId":65786,"journal":{"name":"复杂系统建模与仿真(英文)","volume":"4 4","pages":"339-352"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10820942","citationCount":"0","resultStr":"{\"title\":\"Graph Pointer Network Based Hierarchical Curriculum Reinforcement Learning Method Solving Shuttle Tankers Scheduling Problem\",\"authors\":\"Xiaoyong Gao;Yixu Yang;Diao Peng;Shanghe Li;Chaodong Tan;Feifei Li;Tao Chen\",\"doi\":\"10.23919/CSMS.2024.0017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Shuttle tankers scheduling is an important task in offshore oil and gas transportation process, which involves operating time window fulfillment, optimal transportation planning, and proper inventory management. However, conventional approaches like Mixed Integer Linear Programming (MILP) or meta heuristic algorithms often fail in long running time. In this paper, a Graph Pointer Network (GPN) based Hierarchical Curriculum Reinforcement Learning (HCRL) method is proposed to solve Shuttle Tankers Scheduling Problem (STSP). The model is trained to divide STSP into voyage and operation stages and generate routing and inventory management decisions sequentially. An asynchronous training strategy is developed to address the coupling between stages. Comparison experiments demonstrate that the proposed HCRL method achieves 12% shorter tour lengths on average compared to heuristic algorithms. Additional experiments validate its generalizability to unseen instances and scalability to larger instances.\",\"PeriodicalId\":65786,\"journal\":{\"name\":\"复杂系统建模与仿真(英文)\",\"volume\":\"4 4\",\"pages\":\"339-352\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10820942\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"复杂系统建模与仿真(英文)\",\"FirstCategoryId\":\"1089\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10820942/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"复杂系统建模与仿真(英文)","FirstCategoryId":"1089","ListUrlMain":"https://ieeexplore.ieee.org/document/10820942/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

穿梭油轮调度是海上油气运输过程中的一项重要任务，涉及作业时间窗口的履行、运输计划的优化和库存的合理管理。然而，传统的方法如混合整数线性规划（MILP）或元启发式算法在长时间的运行中往往失败。针对穿梭油轮调度问题，提出了一种基于图指针网络（GPN）的分层课程强化学习（HCRL）方法。训练模型将STSP划分为航次和作业阶段，并依次生成路线和库存管理决策。为了解决阶段间的耦合问题，提出了一种异步训练策略。对比实验表明，与启发式算法相比，HCRL算法的平均行程缩短了12%。其他实验验证了它对未见实例的通用性和对更大实例的可伸缩性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Graph Pointer Network Based Hierarchical Curriculum Reinforcement Learning Method Solving Shuttle Tankers Scheduling Problem

Shuttle tankers scheduling is an important task in offshore oil and gas transportation process, which involves operating time window fulfillment, optimal transportation planning, and proper inventory management. However, conventional approaches like Mixed Integer Linear Programming (MILP) or meta heuristic algorithms often fail in long running time. In this paper, a Graph Pointer Network (GPN) based Hierarchical Curriculum Reinforcement Learning (HCRL) method is proposed to solve Shuttle Tankers Scheduling Problem (STSP). The model is trained to divide STSP into voyage and operation stages and generate routing and inventory management decisions sequentially. An asynchronous training strategy is developed to address the coupling between stages. Comparison experiments demonstrate that the proposed HCRL method achieves 12% shorter tour lengths on average compared to heuristic algorithms. Additional experiments validate its generalizability to unseen instances and scalability to larger instances.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

复杂系统建模与仿真(英文)

CiteScore

7.80

自引率

0.00%

发文量