Ambulance Dispatch via Deep Reinforcement Learning

Proceedings of the 28th International Conference on Advances in Geographic Information Systems Pub Date : 2020-11-03 DOI:10.1145/3397536.3422204

Kunpeng Liu, Xiaolin Li, C. Zou, Haibo Huang, Yanjie Fu

引用次数: 10

Abstract

In this paper, we solve the ambulance dispatch problem with a reinforcement learning oriented strategy. The ambulance dispatch problem is defined as deciding which ambulance to pick up which patient. Traditional studies on ambulance dispatch mainly focus on predefined protocols and are verified on simple simulation data, which are not flexible enough when facing the dynamically changing real-world cases. In this paper, we propose an efficient ambulance dispatch method based on the reinforcement learning framework, i.e., Multi-Agent Q-Network with Experience Replay(MAQR). Specifically, we firstly reformulate the ambulance dispatch problem with a multi-agent reinforcement learning framework, and then design the state, action, and reward function correspondingly for the framework. Thirdly, we design a simulator that controls ambulance status, generates patient requests and interacts with ambulances. Finally, we design extensive experiments to demonstrate the superiority of the proposed method.

查看原文本刊更多论文

通过深度强化学习调度救护车

在本文中，我们用一种面向强化学习的策略来解决救护车调度问题。救护车调度问题被定义为决定哪辆救护车接哪个病人。传统的救护车调度研究主要集中在预定义的协议上，并在简单的仿真数据上进行验证，在面对动态变化的现实情况时不够灵活。在本文中，我们提出了一种基于强化学习框架的高效救护车调度方法，即多agent Q-Network with Experience Replay(MAQR)。具体来说，我们首先用一个多智能体强化学习框架重新表述救护车调度问题，然后为该框架设计相应的状态函数、动作函数和奖励函数。第三，我们设计了一个模拟器来控制救护车状态，生成病人的请求，并与救护车进行交互。最后，我们设计了大量的实验来证明所提出方法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 28th International Conference on Advances in Geographic Information Systems

自引率

0.00%

发文量