Swarm reinforcement learning for traffic signal control based on cooperative multi-agent framework

Mohammed Tahifa, J. Boumhidi, Ali Yahyaouy
{"title":"Swarm reinforcement learning for traffic signal control based on cooperative multi-agent framework","authors":"Mohammed Tahifa, J. Boumhidi, Ali Yahyaouy","doi":"10.1109/ISACV.2015.7105536","DOIUrl":null,"url":null,"abstract":"Congestion, accidents, pollution, and many other problems resulting from urban traffic are present every day in most cities around the world. The growing number of traffic lights in intersections needs efficient control, and hence, automatic systems are essential nowadays for optimally tackling this task. Agent based technologies and reinforcements learning are largely used for modelling and controlling intelligent transportation systems, where agents represent a traffic signal controller. Each agent learns to achieve its goal through many episodes. With a complicated learning problem, it may take much computation time to acquire the optimal policy. In this paper, we use a population based methods such as particle swarm optimization to be able to find rapidly the global optimal solution for multimodal functions with wide solution space. Agents learn through not only on their respective experiences, but also by exchanging information among them, simulation results show that the swarm Q-learning surpass the simple Q-learning causing less average delay time and higher flow rate.","PeriodicalId":426557,"journal":{"name":"2015 Intelligent Systems and Computer Vision (ISCV)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Intelligent Systems and Computer Vision (ISCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISACV.2015.7105536","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 20

Abstract

Congestion, accidents, pollution, and many other problems resulting from urban traffic are present every day in most cities around the world. The growing number of traffic lights in intersections needs efficient control, and hence, automatic systems are essential nowadays for optimally tackling this task. Agent based technologies and reinforcements learning are largely used for modelling and controlling intelligent transportation systems, where agents represent a traffic signal controller. Each agent learns to achieve its goal through many episodes. With a complicated learning problem, it may take much computation time to acquire the optimal policy. In this paper, we use a population based methods such as particle swarm optimization to be able to find rapidly the global optimal solution for multimodal functions with wide solution space. Agents learn through not only on their respective experiences, but also by exchanging information among them, simulation results show that the swarm Q-learning surpass the simple Q-learning causing less average delay time and higher flow rate.
基于协同多智能体框架的交通信号控制群体强化学习
拥堵、事故、污染和许多其他由城市交通引起的问题在世界上大多数城市每天都存在。十字路口交通灯的数量不断增加,需要有效的控制,因此,自动化系统对于优化处理这一任务至关重要。基于智能体的技术和强化学习主要用于智能交通系统的建模和控制,其中智能体代表交通信号控制器。每个智能体通过许多情节来学习实现其目标。这是一个复杂的学习问题,获取最优策略可能需要大量的计算时间。本文采用粒子群优化等基于种群的方法,快速求解具有宽解空间的多模态函数的全局最优解。智能体不仅通过各自的经验进行学习,还通过相互之间的信息交换进行学习,仿真结果表明,群体q -学习优于简单q -学习,平均延迟时间更短,流量更高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信