{"title":"Borel空间约束马尔可夫决策过程的有限状态逼近","authors":"Naci Saldi, S. Yüksel, T. Linder","doi":"10.1109/ALLERTON.2015.7447055","DOIUrl":null,"url":null,"abstract":"We consider the finite-state approximation of a discrete-time constrained Markov decision process with compact state space, under the discounted cost criterion. Using the linear programming formulation of the constrained problem, we prove the convergence of the optimal value function of the finite-state model to the optimal value function of the original model. Under further continuity condition on the transition probability of the original model, we also establish a method to compute approximately optimal policies.","PeriodicalId":112948,"journal":{"name":"2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Finite-state approximations to constrained Markov decision processes with Borel spaces\",\"authors\":\"Naci Saldi, S. Yüksel, T. Linder\",\"doi\":\"10.1109/ALLERTON.2015.7447055\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider the finite-state approximation of a discrete-time constrained Markov decision process with compact state space, under the discounted cost criterion. Using the linear programming formulation of the constrained problem, we prove the convergence of the optimal value function of the finite-state model to the optimal value function of the original model. Under further continuity condition on the transition probability of the original model, we also establish a method to compute approximately optimal policies.\",\"PeriodicalId\":112948,\"journal\":{\"name\":\"2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ALLERTON.2015.7447055\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ALLERTON.2015.7447055","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Finite-state approximations to constrained Markov decision processes with Borel spaces
We consider the finite-state approximation of a discrete-time constrained Markov decision process with compact state space, under the discounted cost criterion. Using the linear programming formulation of the constrained problem, we prove the convergence of the optimal value function of the finite-state model to the optimal value function of the original model. Under further continuity condition on the transition probability of the original model, we also establish a method to compute approximately optimal policies.