GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

arXiv - CS - Performance Pub Date : 2024-08-02 DOI:arxiv-2408.01584

Saman Kazemkhani, Aarav Pandya, Daphne Cornelisse, Brennan Shacklett, Eugene Vinitsky

引用次数: 0

Abstract

Multi-agent learning algorithms have been successful at generating superhuman planning in a wide variety of games but have had little impact on the design of deployed multi-agent planners. A key bottleneck in applying these techniques to multi-agent planning is that they require billions of steps of experience. To enable the study of multi-agent planning at this scale, we present GPUDrive, a GPU-accelerated, multi-agent simulator built on top of the Madrona Game Engine that can generate over a million steps of experience per second. Observation, reward, and dynamics functions are written directly in C++, allowing users to define complex, heterogeneous agent behaviors that are lowered to high-performance CUDA. We show that using GPUDrive we are able to effectively train reinforcement learning agents over many scenes in the Waymo Motion dataset, yielding highly effective goal-reaching agents in minutes for individual scenes and generally capable agents in a few hours. We ship these trained agents as part of the code base at https://github.com/Emerge-Lab/gpudrive.

查看原文本刊更多论文

GPUDrive：每秒 100 万次的数据驱动多代理驾驶模拟

多代理学习算法成功地在各种游戏中生成了超人规划，但对部署多代理规划器的设计却影响甚微。将这些技术应用于多代理规划的一个关键瓶颈是，它们需要数十亿步的经验。为了能够研究这种规模的多代理规划，我们提出了 GPUDrive，这是一个基于 Madrona 游戏引擎的 GPU 加速多代理模拟器，每秒可以生成超过一百万步的经验。观察、奖励和动态函数直接用 C++ 编写，允许用户定义复杂的异构代理行为，并将其降低到高性能 CUDA 中。我们的研究表明，使用 GPUDrive，我们能够在 Waymo 运动数据集中的许多场景中有效地训练强化学习代理，在个别场景中几分钟内就能训练出高效的目标达成代理，在几个小时内就能训练出具有一般能力的代理。我们将经过强化训练的代理作为代码库的一部分发布在https://github.com/Emerge-Lab/gpudrive。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Performance

自引率

0.00%

发文量