Large scale multi-GPU based parallel traffic simulation for accelerated traffic assignment and propagation

IF 7.6 1区工程技术 Q1 TRANSPORTATION SCIENCE & TECHNOLOGY

Transportation Research Part C-Emerging Technologies Pub Date : 2024-10-10 DOI:10.1016/j.trc.2024.104873

Xuan Jiang , Raja Sengupta , James Demmel , Samuel Williams

{"title":"Large scale multi-GPU based parallel traffic simulation for accelerated traffic assignment and propagation","authors":"Xuan Jiang , Raja Sengupta , James Demmel , Samuel Williams","doi":"10.1016/j.trc.2024.104873","DOIUrl":null,"url":null,"abstract":"<div><div>Traffic simulation is a critical tool for congestion analysis, travel time estimation, and route optimization in urban planning, benefiting navigation apps, transportation network companies, and state agencies. Traditionally, traffic micro-simulation frameworks are based on road segments and can only support a limited number of main roads. Efficient traffic simulation on a regional scale remains a significant challenge due to the complexity of urban mobility and the large scale of spatiotemporal data. This paper introduces a Large Scale Multi-GPU Parallel Computing based Regional Scale Traffic Simulation Framework (LPSim), which leverages graphical processing unit (GPU) parallel computing to address these challenges. LPSim utilizes a multi-GPU architecture to simulate extensive and dynamic traffic networks with high fidelity and reduced computation time. Using the parallel processing capabilities of GPUs, LPSim can perform tens of millions of individual vehicle dynamics simulations simultaneously, significantly outperforming traditional CPU-based approaches. The framework is designed to be scalable and can easily accommodate the increasing complexity of traffic simulations. We present the theory behind GPU-based traffic simulation, the architecture of single- and multi-GPU based simulations, and the graph partition strategies that enhance computation resource load balance. Our experimental results demonstrate the effectiveness of LPSim in simulating large-scale traffic scenarios. LPSim is capable of completing simulations of 2.82 million trips in just 6.28 min on a single GPU machine equipped with 5120 CUDA cores (Tesla V100-SXM2). Furthermore, utilizing a Google Cloud instance with two NVIDIA V100 GPUs, which collectively offer 10240 CUDA cores, LPSim successfully simulates 9.01 million trips within 21.16 min. We further tested our simulator with the same demand on dual NVIDIA A100-PCIE-40GB GPUs, which finished the simulation in 0.0398 h, approximately 113 times faster than the same simulation scenario running on an Intel(R) Xeon(R) Gold 6326 CPU @ 2.90 GHz, which takes 4.49 h to complete. This performance not only demonstrates its speed and scalability advantages over traditional simulation techniques but also highlights LPSim’s unique position as the first traffic simulation framework that is scalable for both single- and multiple-GPU configurations. Consequently, LPSim provides an invaluable tool for individuals and extensive research teams alike, enabling the acquisition of large-scale traffic simulation results in a time-efficient manner. LPSim code is available at: <span><span>https://github.com/Xuan-1998/LPSim</span><svg><path></path></svg></span></div></div>","PeriodicalId":54417,"journal":{"name":"Transportation Research Part C-Emerging Technologies","volume":"169 ","pages":"Article 104873"},"PeriodicalIF":7.6000,"publicationDate":"2024-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part C-Emerging Technologies","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968090X24003942","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Traffic simulation is a critical tool for congestion analysis, travel time estimation, and route optimization in urban planning, benefiting navigation apps, transportation network companies, and state agencies. Traditionally, traffic micro-simulation frameworks are based on road segments and can only support a limited number of main roads. Efficient traffic simulation on a regional scale remains a significant challenge due to the complexity of urban mobility and the large scale of spatiotemporal data. This paper introduces a Large Scale Multi-GPU Parallel Computing based Regional Scale Traffic Simulation Framework (LPSim), which leverages graphical processing unit (GPU) parallel computing to address these challenges. LPSim utilizes a multi-GPU architecture to simulate extensive and dynamic traffic networks with high fidelity and reduced computation time. Using the parallel processing capabilities of GPUs, LPSim can perform tens of millions of individual vehicle dynamics simulations simultaneously, significantly outperforming traditional CPU-based approaches. The framework is designed to be scalable and can easily accommodate the increasing complexity of traffic simulations. We present the theory behind GPU-based traffic simulation, the architecture of single- and multi-GPU based simulations, and the graph partition strategies that enhance computation resource load balance. Our experimental results demonstrate the effectiveness of LPSim in simulating large-scale traffic scenarios. LPSim is capable of completing simulations of 2.82 million trips in just 6.28 min on a single GPU machine equipped with 5120 CUDA cores (Tesla V100-SXM2). Furthermore, utilizing a Google Cloud instance with two NVIDIA V100 GPUs, which collectively offer 10240 CUDA cores, LPSim successfully simulates 9.01 million trips within 21.16 min. We further tested our simulator with the same demand on dual NVIDIA A100-PCIE-40GB GPUs, which finished the simulation in 0.0398 h, approximately 113 times faster than the same simulation scenario running on an Intel(R) Xeon(R) Gold 6326 CPU @ 2.90 GHz, which takes 4.49 h to complete. This performance not only demonstrates its speed and scalability advantages over traditional simulation techniques but also highlights LPSim’s unique position as the first traffic simulation framework that is scalable for both single- and multiple-GPU configurations. Consequently, LPSim provides an invaluable tool for individuals and extensive research teams alike, enabling the acquisition of large-scale traffic simulation results in a time-efficient manner. LPSim code is available at: https://github.com/Xuan-1998/LPSim

查看原文本刊更多论文

基于多 GPU 的大规模并行交通仿真，用于加速交通分配和传播

交通模拟是城市规划中进行拥堵分析、旅行时间估算和路线优化的重要工具，导航应用程序、交通网络公司和国家机构都从中受益。传统的交通微观模拟框架以路段为基础，只能支持数量有限的主干道。由于城市交通的复杂性和时空数据的庞大规模，区域范围内的高效交通模拟仍然是一项重大挑战。本文介绍了基于大规模多 GPU 并行计算的区域规模交通仿真框架（LPSim），该框架利用图形处理器（GPU）并行计算来应对这些挑战。LPSim 利用多 GPU 架构，以高保真和更短的计算时间模拟广泛的动态交通网络。利用 GPU 的并行处理能力，LPSim 可同时执行数千万次单个车辆动力学模拟，大大优于基于 CPU 的传统方法。该框架具有可扩展性，可轻松适应日益复杂的交通模拟。我们介绍了基于 GPU 的交通仿真背后的理论、基于单 GPU 和多 GPU 的仿真架构，以及可提高计算资源负载平衡的图分割策略。我们的实验结果证明了 LPSim 在模拟大规模交通场景方面的有效性。在配备 5120 个 CUDA 内核（Tesla V100-SXM2）的单 GPU 机器上，LPSim 能够在短短 6.28 分钟内完成 282 万次行程的模拟。此外，在谷歌云实例中，LPSim 使用了两台英伟达 V100 GPU（共提供 10240 个 CUDA 内核），在 21.16 分钟内成功模拟了 901 万次行程。我们还在双英伟达A100-PCIE-40GB GPU上以同样的需求测试了我们的模拟器，它在0.0398小时内完成了模拟，比在英特尔（R）至强（R）Gold 6326 CPU（主频2.90 GHz）上运行同样的模拟场景（耗时4.49小时）快了约113倍。这一性能不仅证明了LPSim在速度和可扩展性方面优于传统仿真技术，还凸显了LPSim作为首个可扩展至单GPU和多GPU配置的交通仿真框架的独特地位。因此，LPSim 为个人和广泛的研究团队提供了一个宝贵的工具，使其能够以一种省时高效的方式获取大规模交通仿真结果。LPSim 代码可在以下网址获取： https://github.com/Xuan-1998/LPSim

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Transportation Research Part C-Emerging Technologies 工程技术-运输科技

CiteScore

15.80

自引率

12.00%

发文量

332

审稿时长

64 days

期刊介绍： Transportation Research: Part C (TR_C) is dedicated to showcasing high-quality, scholarly research that delves into the development, applications, and implications of transportation systems and emerging technologies. Our focus lies not solely on individual technologies, but rather on their broader implications for the planning, design, operation, control, maintenance, and rehabilitation of transportation systems, services, and components. In essence, the intellectual core of the journal revolves around the transportation aspect rather than the technology itself. We actively encourage the integration of quantitative methods from diverse fields such as operations research, control systems, complex networks, computer science, and artificial intelligence. Join us in exploring the intersection of transportation systems and emerging technologies to drive innovation and progress in the field.