{"title":"随机博弈的逐次逼近算法-数值比较","authors":"P. Dasgupta, W. Gruver","doi":"10.1109/CDC.1978.268108","DOIUrl":null,"url":null,"abstract":"In this paper we treat algorithmic methods for solution of stochastic games. A value iteration method incorporating bounds in a test for suboptimality is compared with policy iteration for three types of transition probability matrices. Numerical experiments demonstrate the superiority of the value iteration technique for problems with special structure.","PeriodicalId":375119,"journal":{"name":"1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes","volume":"35 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Successive approximation algorithms for stochastic games-Numerical comparisons\",\"authors\":\"P. Dasgupta, W. Gruver\",\"doi\":\"10.1109/CDC.1978.268108\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper we treat algorithmic methods for solution of stochastic games. A value iteration method incorporating bounds in a test for suboptimality is compared with policy iteration for three types of transition probability matrices. Numerical experiments demonstrate the superiority of the value iteration technique for problems with special structure.\",\"PeriodicalId\":375119,\"journal\":{\"name\":\"1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes\",\"volume\":\"35 4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CDC.1978.268108\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CDC.1978.268108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Successive approximation algorithms for stochastic games-Numerical comparisons
In this paper we treat algorithmic methods for solution of stochastic games. A value iteration method incorporating bounds in a test for suboptimality is compared with policy iteration for three types of transition probability matrices. Numerical experiments demonstrate the superiority of the value iteration technique for problems with special structure.