Adversarial environment design for crowd navigation based on deep reinforcement learning

IF 7.5 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-07-07 DOI:10.1016/j.engappai.2025.111621

Jeongeun Kim , Hyo-Seok Hwang , Junhee Seok

{"title":"Adversarial environment design for crowd navigation based on deep reinforcement learning","authors":"Jeongeun Kim , Hyo-Seok Hwang , Junhee Seok","doi":"10.1016/j.engappai.2025.111621","DOIUrl":null,"url":null,"abstract":"<div><div>The widespread use of mobile robots has increased the shared space between humans and robots, necessitating advanced solutions for crowd navigation. Recent studies have proposed approaches based on deep reinforcement learning to safely and efficiently achieve this goal. However, these approaches face challenges such as difficulty in presenting diverse pedestrian patterns and limited generalization performance. This study proposes a framework called Simultaneous training Process with Adversarial Crowd Environment (SPACE), which is an implemented artificial intelligence that generates crowd navigation environments. This framework competitively trains a crowd navigation agent and an adversarial crowd environment. In the adversarial crowd environment, the adversarial agent places pedestrians to induce collisions with the crowd navigation agent. By applying artificial intelligence within the episode-generation, this framework addresses vulnerabilities of previous approaches and allows the training of robust crowd navigation agents with high generalization performance. Experimental results demonstrate up to a 24.62% increase in navigation success rate and a 41.6% improvement in minimum distance from pedestrians compared to agents trained in non-adversarial environments, ensuring safer crowd navigation. Furthermore, SPACE exhibits more stable navigation performance in evaluation environment settings that are significantly more complex than the training scenarios. These findings highlight the promise of SPACE for training crowd navigation agents capable of operating effectively under diverse environmental conditions.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111621"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625016239","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

The widespread use of mobile robots has increased the shared space between humans and robots, necessitating advanced solutions for crowd navigation. Recent studies have proposed approaches based on deep reinforcement learning to safely and efficiently achieve this goal. However, these approaches face challenges such as difficulty in presenting diverse pedestrian patterns and limited generalization performance. This study proposes a framework called Simultaneous training Process with Adversarial Crowd Environment (SPACE), which is an implemented artificial intelligence that generates crowd navigation environments. This framework competitively trains a crowd navigation agent and an adversarial crowd environment. In the adversarial crowd environment, the adversarial agent places pedestrians to induce collisions with the crowd navigation agent. By applying artificial intelligence within the episode-generation, this framework addresses vulnerabilities of previous approaches and allows the training of robust crowd navigation agents with high generalization performance. Experimental results demonstrate up to a 24.62% increase in navigation success rate and a 41.6% improvement in minimum distance from pedestrians compared to agents trained in non-adversarial environments, ensuring safer crowd navigation. Furthermore, SPACE exhibits more stable navigation performance in evaluation environment settings that are significantly more complex than the training scenarios. These findings highlight the promise of SPACE for training crowd navigation agents capable of operating effectively under diverse environmental conditions.

查看原文本刊更多论文

基于深度强化学习的人群导航对抗环境设计

移动机器人的广泛使用增加了人与机器人之间的共享空间，需要先进的人群导航解决方案。最近的研究提出了基于深度强化学习的方法来安全有效地实现这一目标。然而，这些方法面临着一些挑战，如难以呈现不同的行人模式和有限的泛化性能。本研究提出了一个名为“对抗人群环境同步训练过程”（Simultaneous training Process with Adversarial Crowd Environment， SPACE）的框架，该框架是一种实现的人工智能，可生成人群导航环境。这个框架竞争性地训练了一个群体导航代理和一个对抗的群体环境。在对抗人群环境中，对抗智能体放置行人以诱导与人群导航智能体的碰撞。通过在情节生成中应用人工智能，该框架解决了以前方法的漏洞，并允许训练具有高泛化性能的鲁棒人群导航代理。实验结果表明，与在非对抗环境中训练的智能体相比，导航成功率提高了24.62%，距离行人的最小距离提高了41.6%，确保了更安全的人群导航。此外，在比训练场景复杂得多的评估环境设置中，SPACE显示出更稳定的导航性能。这些发现突出了SPACE在训练能够在不同环境条件下有效运行的人群导航代理方面的前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.