Adversarial environment design for crowd navigation based on deep reinforcement learning

IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Jeongeun Kim , Hyo-Seok Hwang , Junhee Seok
{"title":"Adversarial environment design for crowd navigation based on deep reinforcement learning","authors":"Jeongeun Kim ,&nbsp;Hyo-Seok Hwang ,&nbsp;Junhee Seok","doi":"10.1016/j.engappai.2025.111621","DOIUrl":null,"url":null,"abstract":"<div><div>The widespread use of mobile robots has increased the shared space between humans and robots, necessitating advanced solutions for crowd navigation. Recent studies have proposed approaches based on deep reinforcement learning to safely and efficiently achieve this goal. However, these approaches face challenges such as difficulty in presenting diverse pedestrian patterns and limited generalization performance. This study proposes a framework called Simultaneous training Process with Adversarial Crowd Environment (SPACE), which is an implemented artificial intelligence that generates crowd navigation environments. This framework competitively trains a crowd navigation agent and an adversarial crowd environment. In the adversarial crowd environment, the adversarial agent places pedestrians to induce collisions with the crowd navigation agent. By applying artificial intelligence within the episode-generation, this framework addresses vulnerabilities of previous approaches and allows the training of robust crowd navigation agents with high generalization performance. Experimental results demonstrate up to a 24.62% increase in navigation success rate and a 41.6% improvement in minimum distance from pedestrians compared to agents trained in non-adversarial environments, ensuring safer crowd navigation. Furthermore, SPACE exhibits more stable navigation performance in evaluation environment settings that are significantly more complex than the training scenarios. These findings highlight the promise of SPACE for training crowd navigation agents capable of operating effectively under diverse environmental conditions.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111621"},"PeriodicalIF":7.5000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625016239","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

The widespread use of mobile robots has increased the shared space between humans and robots, necessitating advanced solutions for crowd navigation. Recent studies have proposed approaches based on deep reinforcement learning to safely and efficiently achieve this goal. However, these approaches face challenges such as difficulty in presenting diverse pedestrian patterns and limited generalization performance. This study proposes a framework called Simultaneous training Process with Adversarial Crowd Environment (SPACE), which is an implemented artificial intelligence that generates crowd navigation environments. This framework competitively trains a crowd navigation agent and an adversarial crowd environment. In the adversarial crowd environment, the adversarial agent places pedestrians to induce collisions with the crowd navigation agent. By applying artificial intelligence within the episode-generation, this framework addresses vulnerabilities of previous approaches and allows the training of robust crowd navigation agents with high generalization performance. Experimental results demonstrate up to a 24.62% increase in navigation success rate and a 41.6% improvement in minimum distance from pedestrians compared to agents trained in non-adversarial environments, ensuring safer crowd navigation. Furthermore, SPACE exhibits more stable navigation performance in evaluation environment settings that are significantly more complex than the training scenarios. These findings highlight the promise of SPACE for training crowd navigation agents capable of operating effectively under diverse environmental conditions.
基于深度强化学习的人群导航对抗环境设计
移动机器人的广泛使用增加了人与机器人之间的共享空间,需要先进的人群导航解决方案。最近的研究提出了基于深度强化学习的方法来安全有效地实现这一目标。然而,这些方法面临着一些挑战,如难以呈现不同的行人模式和有限的泛化性能。本研究提出了一个名为“对抗人群环境同步训练过程”(Simultaneous training Process with Adversarial Crowd Environment, SPACE)的框架,该框架是一种实现的人工智能,可生成人群导航环境。这个框架竞争性地训练了一个群体导航代理和一个对抗的群体环境。在对抗人群环境中,对抗智能体放置行人以诱导与人群导航智能体的碰撞。通过在情节生成中应用人工智能,该框架解决了以前方法的漏洞,并允许训练具有高泛化性能的鲁棒人群导航代理。实验结果表明,与在非对抗环境中训练的智能体相比,导航成功率提高了24.62%,距离行人的最小距离提高了41.6%,确保了更安全的人群导航。此外,在比训练场景复杂得多的评估环境设置中,SPACE显示出更稳定的导航性能。这些发现突出了SPACE在训练能够在不同环境条件下有效运行的人群导航代理方面的前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信