{"title":"Markov Decision Processes with Sure Parity and Multiple Reachability Objectives","authors":"Raphaël Berthon, Joost-Pieter Katoen, Tobias Winkler","doi":"arxiv-2408.01212","DOIUrl":null,"url":null,"abstract":"This paper considers the problem of finding strategies that satisfy a mixture\nof sure and threshold objectives in Markov decision processes. We focus on a\nsingle $\\omega$-regular objective expressed as parity that must be surely met\nwhile satisfying $n$ reachability objectives towards sink states with some\nprobability thresholds too. We consider three variants of the problem: (a)\nstrict and (b) non-strict thresholds on all reachability objectives, and (c)\nmaximizing the thresholds with respect to a lexicographic order. We show that\n(a) and (c) can be reduced to solving parity games, and (b) can be solved in\n$\\sf{EXPTIME}$. Strategy complexities as well as algorithms are provided for\nall cases.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"86 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Science and Game Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.01212","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper considers the problem of finding strategies that satisfy a mixture
of sure and threshold objectives in Markov decision processes. We focus on a
single $\omega$-regular objective expressed as parity that must be surely met
while satisfying $n$ reachability objectives towards sink states with some
probability thresholds too. We consider three variants of the problem: (a)
strict and (b) non-strict thresholds on all reachability objectives, and (c)
maximizing the thresholds with respect to a lexicographic order. We show that
(a) and (c) can be reduced to solving parity games, and (b) can be solved in
$\sf{EXPTIME}$. Strategy complexities as well as algorithms are provided for
all cases.