M. G. García-Hernández, J. Ruiz-Pinales, S. Ledesma-Orozco, J. Aviña-Cervantes, E. Onaindía, A. Reyes-Ballesteros
{"title":"Combination of acceleration procedures for solving stochastic shortest-path Markov decision processes","authors":"M. G. García-Hernández, J. Ruiz-Pinales, S. Ledesma-Orozco, J. Aviña-Cervantes, E. Onaindía, A. Reyes-Ballesteros","doi":"10.1109/ISKE.2010.5680801","DOIUrl":null,"url":null,"abstract":"In this paper we propose the combination of accelerated variants of value iteration with improved prioritized sweeping for the solution of stochastic shortest path Markov decision processes. For the fastest solution, asynchronous updates, prioritization and prioritized sweeping have been tested. A topological reordering algorithm was also compared with a static reordering algorithm. Experimental results obtained on afinite state and action-space stochastic shortest path problem are presented.","PeriodicalId":6417,"journal":{"name":"2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering","volume":"28 1","pages":"89-94"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISKE.2010.5680801","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
In this paper we propose the combination of accelerated variants of value iteration with improved prioritized sweeping for the solution of stochastic shortest path Markov decision processes. For the fastest solution, asynchronous updates, prioritization and prioritized sweeping have been tested. A topological reordering algorithm was also compared with a static reordering algorithm. Experimental results obtained on afinite state and action-space stochastic shortest path problem are presented.