Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2024-11-25 DOI:10.1016/j.eswa.2024.125792

Yao Jing , Bin Guo , Nuo Li , Yasan Ding , Yan Liu , Zhiwen Yu

{"title":"Scalable order dispatching through Federated Multi-Agent Deep Reinforcement Learning","authors":"Yao Jing , Bin Guo , Nuo Li , Yasan Ding , Yan Liu , Zhiwen Yu","doi":"10.1016/j.eswa.2024.125792","DOIUrl":null,"url":null,"abstract":"<div><div>Efficient order dispatching is crucial for online ride-hailing systems, directly influencing user experience and platform revenue. Traditional methods often focus on maximizing immediate revenue through local observations of individual vehicles, ignoring the long-term potential benefits, the dynamic nature of dispatching systems, and the importance of collaboration among distributed vehicles. This typically results in suboptimal performance. To address these issues, we propose FedMARL4OD, a novel Federated Multi-Agent Deep Reinforcement Learning framework designed to optimize order dispatching. This framework integrates local learning via Multi-Agent Reinforcement Learning (MARL) for individual vehicles and global learning via Federated Multi-Agent Reinforcement Learning (FedMARL) across all vehicles. Specifically, we introduce an innovative reward mechanism in local learning that considers both the current revenue of each order and the supply–demand dynamics of the system related to potential future revenue, thereby improving dispatching performance. Moreover, we introduce a scalable model aggregation method in global learning that explicitly models interactions among distributed vehicles to facilitate collaborative learning. By progressively integrating local and global insights through average parameter aggregation, this method not only reduces communication overhead and enhances the learning efficiency of agents, but also ensures system scalability and maintains data privacy. Extensive real-world simulations demonstrate that FedMARL4OD outperforms baseline methods, achieving a 9.17% increase in Accumulated Driver Income (ADI) and a 7.75% improvement in Order Response Rate (ORR). The ADI improvements demonstrate the framework’s effectiveness in boosting revenue, while the enhanced ORR indicates a quicker fulfillment of users’ requests, improving user experience.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"264 ","pages":"Article 125792"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424026599","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Efficient order dispatching is crucial for online ride-hailing systems, directly influencing user experience and platform revenue. Traditional methods often focus on maximizing immediate revenue through local observations of individual vehicles, ignoring the long-term potential benefits, the dynamic nature of dispatching systems, and the importance of collaboration among distributed vehicles. This typically results in suboptimal performance. To address these issues, we propose FedMARL4OD, a novel Federated Multi-Agent Deep Reinforcement Learning framework designed to optimize order dispatching. This framework integrates local learning via Multi-Agent Reinforcement Learning (MARL) for individual vehicles and global learning via Federated Multi-Agent Reinforcement Learning (FedMARL) across all vehicles. Specifically, we introduce an innovative reward mechanism in local learning that considers both the current revenue of each order and the supply–demand dynamics of the system related to potential future revenue, thereby improving dispatching performance. Moreover, we introduce a scalable model aggregation method in global learning that explicitly models interactions among distributed vehicles to facilitate collaborative learning. By progressively integrating local and global insights through average parameter aggregation, this method not only reduces communication overhead and enhances the learning efficiency of agents, but also ensures system scalability and maintains data privacy. Extensive real-world simulations demonstrate that FedMARL4OD outperforms baseline methods, achieving a 9.17% increase in Accumulated Driver Income (ADI) and a 7.75% improvement in Order Response Rate (ORR). The ADI improvements demonstrate the framework’s effectiveness in boosting revenue, while the enhanced ORR indicates a quicker fulfillment of users’ requests, improving user experience.

查看原文本刊更多论文

通过联合多代理深度强化学习实现可扩展订单调度

高效的订单调度对在线叫车系统至关重要，直接影响用户体验和平台收入。传统方法通常侧重于通过对单个车辆的局部观察来实现即时收益最大化，而忽视了长期潜在收益、调度系统的动态性质以及分布式车辆间协作的重要性。这通常会导致性能不达标。为了解决这些问题，我们提出了 FedMARL4OD，这是一个新颖的联合多代理深度强化学习框架，旨在优化订单调度。该框架通过多代理强化学习（MARL）对单个车辆进行局部学习，并通过联邦多代理强化学习（FedMARL）对所有车辆进行全局学习。具体来说，我们在局部学习中引入了一种创新奖励机制，既考虑每个订单的当前收入，又考虑与潜在未来收入相关的系统供需动态，从而提高调度性能。此外，我们还在全局学习中引入了一种可扩展的模型聚合方法，该方法对分布式车辆之间的互动进行了明确建模，以促进协作学习。通过平均参数聚合逐步整合本地和全局洞察力，该方法不仅降低了通信开销，提高了代理的学习效率，还确保了系统的可扩展性并维护了数据隐私。大量实际模拟证明，FedMARL4OD 的性能优于基线方法，累计驱动收入（ADI）提高了 9.17%，订单响应率（ORR）提高了 7.75%。ADI 的提高证明了该框架在提高收入方面的有效性，而 ORR 的提高则表明用户的请求得到了更快的满足，从而改善了用户体验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.