M. G. García-Hernández, J. Ruiz-Pinales, S. Ledesma-Orozco, J. Aviña-Cervantes, E. Onaindía, A. Reyes-Ballesteros
{"title":"求解随机最短路径马尔可夫决策过程的组合加速程序","authors":"M. G. García-Hernández, J. Ruiz-Pinales, S. Ledesma-Orozco, J. Aviña-Cervantes, E. Onaindía, A. Reyes-Ballesteros","doi":"10.1109/ISKE.2010.5680801","DOIUrl":null,"url":null,"abstract":"In this paper we propose the combination of accelerated variants of value iteration with improved prioritized sweeping for the solution of stochastic shortest path Markov decision processes. For the fastest solution, asynchronous updates, prioritization and prioritized sweeping have been tested. A topological reordering algorithm was also compared with a static reordering algorithm. Experimental results obtained on afinite state and action-space stochastic shortest path problem are presented.","PeriodicalId":6417,"journal":{"name":"2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering","volume":"28 1","pages":"89-94"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Combination of acceleration procedures for solving stochastic shortest-path Markov decision processes\",\"authors\":\"M. G. García-Hernández, J. Ruiz-Pinales, S. Ledesma-Orozco, J. Aviña-Cervantes, E. Onaindía, A. Reyes-Ballesteros\",\"doi\":\"10.1109/ISKE.2010.5680801\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we propose the combination of accelerated variants of value iteration with improved prioritized sweeping for the solution of stochastic shortest path Markov decision processes. For the fastest solution, asynchronous updates, prioritization and prioritized sweeping have been tested. A topological reordering algorithm was also compared with a static reordering algorithm. Experimental results obtained on afinite state and action-space stochastic shortest path problem are presented.\",\"PeriodicalId\":6417,\"journal\":{\"name\":\"2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering\",\"volume\":\"28 1\",\"pages\":\"89-94\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISKE.2010.5680801\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISKE.2010.5680801","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Combination of acceleration procedures for solving stochastic shortest-path Markov decision processes
In this paper we propose the combination of accelerated variants of value iteration with improved prioritized sweeping for the solution of stochastic shortest path Markov decision processes. For the fastest solution, asynchronous updates, prioritization and prioritized sweeping have been tested. A topological reordering algorithm was also compared with a static reordering algorithm. Experimental results obtained on afinite state and action-space stochastic shortest path problem are presented.