A multi-agent reinforcement learning model incorporating historical operational data for pump station operational scheduling

IF 4.3 2区工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Computers & Operations Research Pub Date : 2025-07-26 DOI:10.1016/j.cor.2025.107221

Chao Wang , Yaofei Zhang , Sherong Zhang , Xiaohua Wang , Zhiyong Zhao

{"title":"A multi-agent reinforcement learning model incorporating historical operational data for pump station operational scheduling","authors":"Chao Wang , Yaofei Zhang , Sherong Zhang , Xiaohua Wang , Zhiyong Zhao","doi":"10.1016/j.cor.2025.107221","DOIUrl":null,"url":null,"abstract":"<div><div>The development of operational scheduling for pump units is a critical focus in pump station management. This study introduces a pump station operation scheduling model based on multi-agent reinforcement learning that integrates historical operational data, addressing inefficiencies, poor generalization, and operational complexity encountered with traditional evolutionary algorithms. Utilizing a graph neural network, the model incorporates historical data and prior knowledge about pump station performance curves to establish a performance computation model for multi-unit operation combinations. The scheduling of multi-unit operations at a pump station is conceptualized as a parallel decision-making problem, incorporating rules aimed at cost reduction, efficiency improvement, and meeting water delivery volume requirements while minimizing operational complexity. A MARL model is developed, taking into account the variability in initial operating conditions to enhance generalization capabilities. The study compares the performance of various reinforcement learning models with evolutionary algorithms. Results indicate that the trained MARL model adapts effectively to dynamic water delivery conditions and exhibits strong generalization capabilities. Compared to actual operational scheduling, it achieves significant savings of over 419,000 units in operational costs and over 390,000 kWh in energy consumption. Furthermore, compared to evolutionary algorithms, the decision-making solutions generated by the reinforcement learning model align more closely with operational logic and are more efficient, achieving a planning speed increase of over 70 times.</div></div>","PeriodicalId":10542,"journal":{"name":"Computers & Operations Research","volume":"184 ","pages":"Article 107221"},"PeriodicalIF":4.3000,"publicationDate":"2025-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Operations Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0305054825002503","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

The development of operational scheduling for pump units is a critical focus in pump station management. This study introduces a pump station operation scheduling model based on multi-agent reinforcement learning that integrates historical operational data, addressing inefficiencies, poor generalization, and operational complexity encountered with traditional evolutionary algorithms. Utilizing a graph neural network, the model incorporates historical data and prior knowledge about pump station performance curves to establish a performance computation model for multi-unit operation combinations. The scheduling of multi-unit operations at a pump station is conceptualized as a parallel decision-making problem, incorporating rules aimed at cost reduction, efficiency improvement, and meeting water delivery volume requirements while minimizing operational complexity. A MARL model is developed, taking into account the variability in initial operating conditions to enhance generalization capabilities. The study compares the performance of various reinforcement learning models with evolutionary algorithms. Results indicate that the trained MARL model adapts effectively to dynamic water delivery conditions and exhibits strong generalization capabilities. Compared to actual operational scheduling, it achieves significant savings of over 419,000 units in operational costs and over 390,000 kWh in energy consumption. Furthermore, compared to evolutionary algorithms, the decision-making solutions generated by the reinforcement learning model align more closely with operational logic and are more efficient, achieving a planning speed increase of over 70 times.

查看原文本刊更多论文

基于历史运行数据的泵站运行调度多智能体强化学习模型

水泵机组运行调度的制定是泵站管理的一个关键问题。本文介绍了一种基于多智能体强化学习的泵站运行调度模型，该模型集成了历史运行数据，解决了传统进化算法的低效、泛化差和操作复杂性问题。该模型利用图神经网络，结合历史数据和泵站性能曲线的先验知识，建立了多机组组合运行的性能计算模型。泵站的多单元操作调度是一个并行决策问题，包含旨在降低成本、提高效率和满足供水量要求的规则，同时最大限度地降低操作复杂性。建立了一个考虑初始操作条件可变性的MARL模型，以提高泛化能力。该研究比较了各种强化学习模型与进化算法的性能。结果表明，训练后的MARL模型能有效适应动态输水条件，具有较强的泛化能力。与实际运行计划相比，该方案可节省419,000多台机组的运行成本和超过390,000千瓦时的能耗。此外，与进化算法相比，强化学习模型生成的决策方案更贴近操作逻辑，效率更高，规划速度提高了70倍以上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers & Operations Research 工程技术-工程：工业

CiteScore

8.60

自引率

8.70%

发文量

292

审稿时长

8.5 months

期刊介绍： Operations research and computers meet in a large number of scientific fields, many of which are of vital current concern to our troubled society. These include, among others, ecology, transportation, safety, reliability, urban planning, economics, inventory control, investment strategy and logistics (including reverse logistics). Computers & Operations Research provides an international forum for the application of computers and operations research techniques to problems in these and related fields.