{"title":"Successive approximation algorithms for stochastic games-Numerical comparisons","authors":"P. Dasgupta, W. Gruver","doi":"10.1109/CDC.1978.268108","DOIUrl":null,"url":null,"abstract":"In this paper we treat algorithmic methods for solution of stochastic games. A value iteration method incorporating bounds in a test for suboptimality is compared with policy iteration for three types of transition probability matrices. Numerical experiments demonstrate the superiority of the value iteration technique for problems with special structure.","PeriodicalId":375119,"journal":{"name":"1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes","volume":"35 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CDC.1978.268108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper we treat algorithmic methods for solution of stochastic games. A value iteration method incorporating bounds in a test for suboptimality is compared with policy iteration for three types of transition probability matrices. Numerical experiments demonstrate the superiority of the value iteration technique for problems with special structure.