Expectations or Guarantees? I Want It All! A crossroad between games and MDPs

Hkhmt m`Sr Pub Date : 2014-04-03 DOI:10.4204/EPTCS.146.1
V. Bruyère, E. Filiot, Mickael Randour, Jean-François Raskin
{"title":"Expectations or Guarantees? I Want It All! A crossroad between games and MDPs","authors":"V. Bruyère, E. Filiot, Mickael Randour, Jean-François Raskin","doi":"10.4204/EPTCS.146.1","DOIUrl":null,"url":null,"abstract":"When reasoning about the strategic capabilities of an agent, it is important to consider the nature of its adversaries. In the particular context of controller synthesis for quantitative specifications, the usual problem is to devise a strategy for a reactive system which yields some desired performance, taking into account the possible impact of the environment of the system. There are at least two ways to look at this environment. In the classical analysis of two-player quantitative games, the environment is purely antagonistic and the problem is to provide strict performance guarantees. In Markov decision processes, the environment is seen as purely stochastic: the aim is then to optimize the expected payoff, with no guarantee on individual outcomes. In this expository work, we report on recent results [10, 9] introducing the beyond worst-case synthesis problem, which is to construct strategies that guarantee some quantitative requirement in the worst-case while providing an higher expected value against a particular stochastic model of the environment given as input. This problem is relevant to produce system controllers that provide nice expected performance in the everyday situation while ensuring a strict (but relaxed) performance threshold even in the event of very bad (while unlikely) circumstances. It has been studied for both the mean-payoff and the shortest path quantitative measures.","PeriodicalId":53035,"journal":{"name":"Hkhmt m`Sr","volume":"1 1","pages":"1-8"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hkhmt m`Sr","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4204/EPTCS.146.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

When reasoning about the strategic capabilities of an agent, it is important to consider the nature of its adversaries. In the particular context of controller synthesis for quantitative specifications, the usual problem is to devise a strategy for a reactive system which yields some desired performance, taking into account the possible impact of the environment of the system. There are at least two ways to look at this environment. In the classical analysis of two-player quantitative games, the environment is purely antagonistic and the problem is to provide strict performance guarantees. In Markov decision processes, the environment is seen as purely stochastic: the aim is then to optimize the expected payoff, with no guarantee on individual outcomes. In this expository work, we report on recent results [10, 9] introducing the beyond worst-case synthesis problem, which is to construct strategies that guarantee some quantitative requirement in the worst-case while providing an higher expected value against a particular stochastic model of the environment given as input. This problem is relevant to produce system controllers that provide nice expected performance in the everyday situation while ensuring a strict (but relaxed) performance threshold even in the event of very bad (while unlikely) circumstances. It has been studied for both the mean-payoff and the shortest path quantitative measures.
期望还是保证?我想要一切!游戏和mdp之间的十字路口
在对代理的战略能力进行推理时,考虑其对手的性质是很重要的。在定量规范的控制器综合的特定背景下,通常的问题是为一个产生某些期望性能的反应系统设计一个策略,考虑到系统环境的可能影响。至少有两种方式来看待这种环境。在经典的双人定量博弈分析中,环境是纯粹对抗性的,问题是提供严格的性能保证。在马尔可夫决策过程中,环境被视为纯粹的随机:目标是优化预期收益,而不保证个人结果。在这篇说说性的工作中,我们报告了最近的结果[10,9],介绍了超越最坏情况综合问题,该问题是构建策略,保证在最坏情况下的一些定量要求,同时针对给定的环境的特定随机模型提供更高的期望值作为输入。这个问题与生产在日常情况下提供良好预期性能的系统控制器相关,同时确保即使在非常糟糕(虽然不太可能)的情况下也有严格(但宽松)的性能阈值。本文分别对平均收益和最短路径定量测度进行了研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
审稿时长
24 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信