CCMA: A framework for cascading cooperative multi-agent in autonomous driving merging using Large Language Models

IF 7.5 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems with Applications Pub Date : 2025-04-24 DOI:10.1016/j.eswa.2025.127717

Miao Zhang , Zhenlong Fang , Tianyi Wang , Shuai Lu , Xueqian Wang , Tianyu Shi

{"title":"CCMA: A framework for cascading cooperative multi-agent in autonomous driving merging using Large Language Models","authors":"Miao Zhang , Zhenlong Fang , Tianyi Wang , Shuai Lu , Xueqian Wang , Tianyu Shi","doi":"10.1016/j.eswa.2025.127717","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional Reinforcement Learning (RL) suffers from challenges in replicating human-like behaviors, generalizing effectively in multi-agent scenarios, and overcoming inherent interpretability issues. These tasks become even more difficult when they require a deep understanding of the environment, coordination of agents’ intentions and driving styles across various scenarios, and the overall optimization of safety, efficiency, and comfort in dynamic environments. Recently, Large Language Model (LLM) enhanced methods have shown promise in improving generalization and interoperability. However, these approaches primarily focus on single-agent scenarios and often neglect the necessary coordination among multiple road users. Therefore, in this paper, we introduce the Cascading Cooperative Multi-agent (CCMA) framework, designed to address these challenges by enhancing human-like behaviors and fostering multi-level cooperation across diverse multi-agent driving tasks, ultimately improving both micro and macro-level performance in complex driving environments. Specifically, the CCMA framework integrates RL for individual interactions, a fine-tuned LLM for regional cooperation, a reward function for global optimization, and the Retrieval-augmented Generation mechanism to dynamically optimize decision-making across complex driving scenarios. Our experiments demonstrate that our CCMA method not only enhances human-like behaviors and interpretability, but also outperforms other state-of-the-art RL methods in multi-agent environments. These findings highlight the significant impact of cascading coordinated communication and dynamic functional alignment in advanced, human-like multi-agent autonomous driving environments. Our project page is <span><span>https://miaorain.github.io/rainrun.github.io/</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"282 ","pages":"Article 127717"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425013399","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Traditional Reinforcement Learning (RL) suffers from challenges in replicating human-like behaviors, generalizing effectively in multi-agent scenarios, and overcoming inherent interpretability issues. These tasks become even more difficult when they require a deep understanding of the environment, coordination of agents’ intentions and driving styles across various scenarios, and the overall optimization of safety, efficiency, and comfort in dynamic environments. Recently, Large Language Model (LLM) enhanced methods have shown promise in improving generalization and interoperability. However, these approaches primarily focus on single-agent scenarios and often neglect the necessary coordination among multiple road users. Therefore, in this paper, we introduce the Cascading Cooperative Multi-agent (CCMA) framework, designed to address these challenges by enhancing human-like behaviors and fostering multi-level cooperation across diverse multi-agent driving tasks, ultimately improving both micro and macro-level performance in complex driving environments. Specifically, the CCMA framework integrates RL for individual interactions, a fine-tuned LLM for regional cooperation, a reward function for global optimization, and the Retrieval-augmented Generation mechanism to dynamically optimize decision-making across complex driving scenarios. Our experiments demonstrate that our CCMA method not only enhances human-like behaviors and interpretability, but also outperforms other state-of-the-art RL methods in multi-agent environments. These findings highlight the significant impact of cascading coordinated communication and dynamic functional alignment in advanced, human-like multi-agent autonomous driving environments. Our project page is https://miaorain.github.io/rainrun.github.io/.

查看原文本刊更多论文

基于大语言模型的自动驾驶级联协作多智能体合并框架

传统的强化学习（RL）在复制类人行为、在多智能体场景中有效泛化以及克服固有的可解释性问题方面面临挑战。当这些任务需要对环境有深入的了解，需要在各种场景中协调agent的意图和驾驶风格，以及在动态环境中对安全性、效率和舒适性进行整体优化时，这些任务就变得更加困难。近年来，大型语言模型（LLM）增强方法在提高泛化和互操作性方面表现出了良好的前景。然而，这些方法主要关注单智能体场景，往往忽略了多个道路使用者之间必要的协调。因此，在本文中，我们引入了级联合作多智能体（Cascading Cooperative Multi-agent， CCMA）框架，旨在通过增强类人行为和促进跨不同多智能体驾驶任务的多层次合作来解决这些挑战，最终提高复杂驾驶环境中的微观和宏观性能。具体而言，CCMA框架集成了用于个体交互的RL，用于区域合作的微调LLM，用于全局优化的奖励函数，以及用于动态优化复杂驾驶场景决策的检索增强生成机制。我们的实验表明，我们的CCMA方法不仅增强了类人行为和可解释性，而且在多智能体环境中优于其他最先进的强化学习方法。这些发现强调了级联协调通信和动态功能对齐在高级类人多智能体自动驾驶环境中的重要影响。我们的项目页面是https://miaorain.github.io/rainrun.github.io/。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Expert Systems with Applications 工程技术-工程：电子与电气

CiteScore

13.80

自引率

10.60%

发文量

2045

审稿时长

8.7 months

期刊介绍： Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.