A joint travel mode and departure time choice model in dynamic multimodal transportation networks based on deep reinforcement learning

Multimodal Transportation Pub Date : 2024-05-11 DOI:10.1016/j.multra.2024.100137

Ziyuan Gu , Yukai Wang , Wei Ma , Zhiyuan Liu

{"title":"A joint travel mode and departure time choice model in dynamic multimodal transportation networks based on deep reinforcement learning","authors":"Ziyuan Gu , Yukai Wang , Wei Ma , Zhiyuan Liu","doi":"10.1016/j.multra.2024.100137","DOIUrl":null,"url":null,"abstract":"<div><p>Decision on travel choices in dynamic multimodal transportation networks is non-trivial. In this paper, we tackle this problem by proposing a new joint travel mode and departure time choice (JTMDTC) model based on deep reinforcement learning (DRL). The objective of the model is to maximize individuals travel utilities across multiple days, which is accomplished by establishing a problem-specific Markov decision process to characterize the multi-day JTMDTC, and developing a customized Deep Q-Network as the resolution scheme. To render the approach applicable to many individuals with travel decision-making requests, a clustering method is integrated with DRL to obtain representative individuals for model training, thus resulting in an elegant and computationally efficient approach. Extensive numerical experiments based on multimodal microscopic traffic simulation are conducted in a real-world network of Suzhou, China to demonstrate the effectiveness of the proposed approach. The results indicate that the proposed approach is able to make (near-)optimal JTMDTC for different individuals in complex traffic environments, that it consistently yields higher travel utilities compared with other alternatives, and that it is robust to different model parameter changes.</p></div>","PeriodicalId":100933,"journal":{"name":"Multimodal Transportation","volume":"3 3","pages":"Article 100137"},"PeriodicalIF":0.0000,"publicationDate":"2024-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772586324000182/pdfft?md5=577982a14be687e3ffd60a512644df5e&pid=1-s2.0-S2772586324000182-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multimodal Transportation","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772586324000182","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Decision on travel choices in dynamic multimodal transportation networks is non-trivial. In this paper, we tackle this problem by proposing a new joint travel mode and departure time choice (JTMDTC) model based on deep reinforcement learning (DRL). The objective of the model is to maximize individuals travel utilities across multiple days, which is accomplished by establishing a problem-specific Markov decision process to characterize the multi-day JTMDTC, and developing a customized Deep Q-Network as the resolution scheme. To render the approach applicable to many individuals with travel decision-making requests, a clustering method is integrated with DRL to obtain representative individuals for model training, thus resulting in an elegant and computationally efficient approach. Extensive numerical experiments based on multimodal microscopic traffic simulation are conducted in a real-world network of Suzhou, China to demonstrate the effectiveness of the proposed approach. The results indicate that the proposed approach is able to make (near-)optimal JTMDTC for different individuals in complex traffic environments, that it consistently yields higher travel utilities compared with other alternatives, and that it is robust to different model parameter changes.

查看原文本刊更多论文

基于深度强化学习的动态多式联运网络中的联合出行方式和出发时间选择模型

动态多式联运网络中的出行选择决策并非易事。在本文中，我们提出了一种基于深度强化学习（DRL）的新型联合出行模式和出发时间选择（JTMDTC）模型，以解决这一问题。该模型的目标是最大化个人在多天内的旅行效用，具体做法是建立一个特定问题的马尔可夫决策过程来描述多天的 JTMDTC，并开发一个定制的深度 Q 网络作为解析方案。为使该方法适用于提出旅行决策要求的众多个体，我们将聚类方法与 DRL 相结合，以获得用于模型训练的代表性个体，从而形成了一种优雅且计算效率高的方法。为了证明所提方法的有效性，我们在中国苏州的实际网络中进行了基于多模式微观交通模拟的大量数值实验。实验结果表明，所提出的方法能够在复杂的交通环境中为不同的个体制定（接近）最优的 JTMDTC，与其他替代方法相比，该方法能够持续产生更高的出行效用，并且对不同的模型参数变化具有鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Multimodal Transportation

CiteScore

5.10

自引率

0.00%

发文量