{"title":"A reinforcement learning algorithm for optimal motion of car-like vehicles","authors":"T. Martínez-Marín","doi":"10.1109/ITSC.2004.1398870","DOIUrl":null,"url":null,"abstract":"We propose a new reinforcement learning algorithm to obtain the optimal motion of a vehicle considering kinematic and obstacle constraints. The algorithm is an extension of the CACM technique for learning the dynamic behaviour of the vehicle instead of using its analytical state equations. The method overcomes some limitations of reinforcement learning techniques when they are employed in applications with continuous non-linear systems, such as car-like vehicles. In particular, a good approximation to the optimal behaviour is obtained by a lookup table without of using function approximation. Simulation results of learning optimal motion in the presence of obstacles are reported to show the satisfactory performance of the method compared with the popular Q-learning algorithm.","PeriodicalId":239269,"journal":{"name":"Proceedings. The 7th International IEEE Conference on Intelligent Transportation Systems (IEEE Cat. No.04TH8749)","volume":"164 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. The 7th International IEEE Conference on Intelligent Transportation Systems (IEEE Cat. No.04TH8749)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITSC.2004.1398870","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
We propose a new reinforcement learning algorithm to obtain the optimal motion of a vehicle considering kinematic and obstacle constraints. The algorithm is an extension of the CACM technique for learning the dynamic behaviour of the vehicle instead of using its analytical state equations. The method overcomes some limitations of reinforcement learning techniques when they are employed in applications with continuous non-linear systems, such as car-like vehicles. In particular, a good approximation to the optimal behaviour is obtained by a lookup table without of using function approximation. Simulation results of learning optimal motion in the presence of obstacles are reported to show the satisfactory performance of the method compared with the popular Q-learning algorithm.