Swarm reinforcement learning for traffic signal control based on cooperative multi-agent framework

2015 Intelligent Systems and Computer Vision (ISCV) Pub Date : 2015-03-25 DOI:10.1109/ISACV.2015.7105536

Mohammed Tahifa, J. Boumhidi, Ali Yahyaouy

{"title":"Swarm reinforcement learning for traffic signal control based on cooperative multi-agent framework","authors":"Mohammed Tahifa, J. Boumhidi, Ali Yahyaouy","doi":"10.1109/ISACV.2015.7105536","DOIUrl":null,"url":null,"abstract":"Congestion, accidents, pollution, and many other problems resulting from urban traffic are present every day in most cities around the world. The growing number of traffic lights in intersections needs efficient control, and hence, automatic systems are essential nowadays for optimally tackling this task. Agent based technologies and reinforcements learning are largely used for modelling and controlling intelligent transportation systems, where agents represent a traffic signal controller. Each agent learns to achieve its goal through many episodes. With a complicated learning problem, it may take much computation time to acquire the optimal policy. In this paper, we use a population based methods such as particle swarm optimization to be able to find rapidly the global optimal solution for multimodal functions with wide solution space. Agents learn through not only on their respective experiences, but also by exchanging information among them, simulation results show that the swarm Q-learning surpass the simple Q-learning causing less average delay time and higher flow rate.","PeriodicalId":426557,"journal":{"name":"2015 Intelligent Systems and Computer Vision (ISCV)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Intelligent Systems and Computer Vision (ISCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISACV.2015.7105536","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 20

Abstract

Congestion, accidents, pollution, and many other problems resulting from urban traffic are present every day in most cities around the world. The growing number of traffic lights in intersections needs efficient control, and hence, automatic systems are essential nowadays for optimally tackling this task. Agent based technologies and reinforcements learning are largely used for modelling and controlling intelligent transportation systems, where agents represent a traffic signal controller. Each agent learns to achieve its goal through many episodes. With a complicated learning problem, it may take much computation time to acquire the optimal policy. In this paper, we use a population based methods such as particle swarm optimization to be able to find rapidly the global optimal solution for multimodal functions with wide solution space. Agents learn through not only on their respective experiences, but also by exchanging information among them, simulation results show that the swarm Q-learning surpass the simple Q-learning causing less average delay time and higher flow rate.

查看原文本刊更多论文

基于协同多智能体框架的交通信号控制群体强化学习

拥堵、事故、污染和许多其他由城市交通引起的问题在世界上大多数城市每天都存在。十字路口交通灯的数量不断增加，需要有效的控制，因此，自动化系统对于优化处理这一任务至关重要。基于智能体的技术和强化学习主要用于智能交通系统的建模和控制，其中智能体代表交通信号控制器。每个智能体通过许多情节来学习实现其目标。这是一个复杂的学习问题，获取最优策略可能需要大量的计算时间。本文采用粒子群优化等基于种群的方法，快速求解具有宽解空间的多模态函数的全局最优解。智能体不仅通过各自的经验进行学习，还通过相互之间的信息交换进行学习，仿真结果表明，群体q -学习优于简单q -学习，平均延迟时间更短，流量更高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 Intelligent Systems and Computer Vision (ISCV)

自引率

0.00%

发文量