马尔可夫奖励与决策链的单调性:理论与应用

Found. Trends Stoch. Syst. Pub Date : 2007-06-04 DOI:10.1561/0900000002

G. Koole

{"title":"马尔可夫奖励与决策链的单调性:理论与应用","authors":"G. Koole","doi":"10.1561/0900000002","DOIUrl":null,"url":null,"abstract":"This paper focuses on monotonicity results for dynamic systems that take values in the natural numbers or in more-dimensional lattices. The results are mostly formulated in terms of controlled queueing systems, but there are also applications to maintenance systems, revenue management, and so forth. We concentrate on results that are obtained by inductively proving properties of the dynamic programming value function. We give a framework for using this method that unifies results obtained for different models. We also give a comprehensive overview of the results that can be obtained through it, in which we discuss not only (partial) characterizations of optimal policies but also applications of monotonicity to optimization problems and the comparison of systems.","PeriodicalId":156024,"journal":{"name":"Found. Trends Stoch. Syst.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"136","resultStr":"{\"title\":\"Monotonicity in Markov Reward and Decision Chains: Theory and Applications\",\"authors\":\"G. Koole\",\"doi\":\"10.1561/0900000002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper focuses on monotonicity results for dynamic systems that take values in the natural numbers or in more-dimensional lattices. The results are mostly formulated in terms of controlled queueing systems, but there are also applications to maintenance systems, revenue management, and so forth. We concentrate on results that are obtained by inductively proving properties of the dynamic programming value function. We give a framework for using this method that unifies results obtained for different models. We also give a comprehensive overview of the results that can be obtained through it, in which we discuss not only (partial) characterizations of optimal policies but also applications of monotonicity to optimization problems and the comparison of systems.\",\"PeriodicalId\":156024,\"journal\":{\"name\":\"Found. Trends Stoch. Syst.\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"136\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Found. Trends Stoch. Syst.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1561/0900000002\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Found. Trends Stoch. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1561/0900000002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 136

摘要

本文主要研究在自然数或多维格中取值的动态系统的单调性结果。结果主要是根据受控排队系统制定的，但也有应用于维护系统、收入管理等。重点讨论了用归纳法证明动态规划值函数性质所得到的结果。我们给出了一个使用该方法的框架，统一了不同模型的结果。我们还全面概述了通过它可以得到的结果，其中我们不仅讨论了最优策略的(部分)特征，而且讨论了单调性在优化问题和系统比较中的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Monotonicity in Markov Reward and Decision Chains: Theory and Applications

This paper focuses on monotonicity results for dynamic systems that take values in the natural numbers or in more-dimensional lattices. The results are mostly formulated in terms of controlled queueing systems, but there are also applications to maintenance systems, revenue management, and so forth. We concentrate on results that are obtained by inductively proving properties of the dynamic programming value function. We give a framework for using this method that unifies results obtained for different models. We also give a comprehensive overview of the results that can be obtained through it, in which we discuss not only (partial) characterizations of optimal policies but also applications of monotonicity to optimization problems and the comparison of systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Found. Trends Stoch. Syst.

自引率

0.00%

发文量