响应式交通信号控制的近似动态规划策略

2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning Pub Date : 2007-04-01 DOI:10.1109/ADPRL.2007.368203

C. Cai

{"title":"响应式交通信号控制的近似动态规划策略","authors":"C. Cai","doi":"10.1109/ADPRL.2007.368203","DOIUrl":null,"url":null,"abstract":"This paper proposes an approximate dynamic programming strategy for responsive traffic signal control. It is the first attempt that optimizes signal control objective dynamically through adaptive approximation of value function. The proposed value function approximation is separable and exogenous factor independent. The algorithm updates the approximated value function progressively in operation, while preserving the structural property of the control problem. The convergence and performance of the algorithm have been tested in a range of experiments. It has been concluded that the new strategy is as good as the best existing control strategies while being efficient and simple in computation. It also has the potential of being extended to multi-phase signal control at isolate junction and to decentralized network operation","PeriodicalId":152536,"journal":{"name":"2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":"{\"title\":\"An Approximate Dynamic Programming Strategy for Responsive Traffic Signal Control\",\"authors\":\"C. Cai\",\"doi\":\"10.1109/ADPRL.2007.368203\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes an approximate dynamic programming strategy for responsive traffic signal control. It is the first attempt that optimizes signal control objective dynamically through adaptive approximation of value function. The proposed value function approximation is separable and exogenous factor independent. The algorithm updates the approximated value function progressively in operation, while preserving the structural property of the control problem. The convergence and performance of the algorithm have been tested in a range of experiments. It has been concluded that the new strategy is as good as the best existing control strategies while being efficient and simple in computation. It also has the potential of being extended to multi-phase signal control at isolate junction and to decentralized network operation\",\"PeriodicalId\":152536,\"journal\":{\"name\":\"2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"16\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ADPRL.2007.368203\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ADPRL.2007.368203","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 16

摘要

提出了一种响应式交通信号控制的近似动态规划策略。通过值函数的自适应逼近来动态优化信号控制目标是首次尝试。所提出的值函数近似是可分离的，与外生因素无关。该算法在保持控制问题的结构特性的同时，逐步更新逼近值函数。该算法的收敛性和性能在一系列实验中得到了验证。结果表明，该控制策略与现有的最佳控制策略一样有效，且计算简单。它还具有推广到孤立结点多相信号控制和分散网络运行的潜力

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An Approximate Dynamic Programming Strategy for Responsive Traffic Signal Control

This paper proposes an approximate dynamic programming strategy for responsive traffic signal control. It is the first attempt that optimizes signal control objective dynamically through adaptive approximation of value function. The proposed value function approximation is separable and exogenous factor independent. The algorithm updates the approximated value function progressively in operation, while preserving the structural property of the control problem. The convergence and performance of the algorithm have been tested in a range of experiments. It has been concluded that the new strategy is as good as the best existing control strategies while being efficient and simple in computation. It also has the potential of being extended to multi-phase signal control at isolate junction and to decentralized network operation

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning

自引率

0.00%

发文量