Elthon Manhas de Freitas, Karina Valdivia Delgado, Valdinei Mendes da Silva
{"title":"Processos de decisão de Markov com sensibilidade a risco com função de utilidade exponencial: Uma revisão sistemática da literatura","authors":"Elthon Manhas de Freitas, Karina Valdivia Delgado, Valdinei Mendes da Silva","doi":"10.5753/SBSI.2017.6045","DOIUrl":null,"url":null,"abstract":"Markov Decision Process (MDP) has been used very eciently to solve sequential decision-making problems. There are problems in which dealing with the risks of the environment to obtain a reliable result is more important than maximizing the expected average return. MDPs that deal with this type of problem are called risk-sensitive Markov decision processes (RSMDP). This systematic review of the literature aims to identify the theoretical results and proposed algorithms to solve RSMDP problems that have an exponential utility function, evaluating their main characteristics, similarities, particularities and differences in order to allow the reader the knowledge of this tool of decision making for risk sensitive problems.","PeriodicalId":111716,"journal":{"name":"Brazilian Symposium on Information Systems","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brazilian Symposium on Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5753/SBSI.2017.6045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Markov Decision Process (MDP) has been used very eciently to solve sequential decision-making problems. There are problems in which dealing with the risks of the environment to obtain a reliable result is more important than maximizing the expected average return. MDPs that deal with this type of problem are called risk-sensitive Markov decision processes (RSMDP). This systematic review of the literature aims to identify the theoretical results and proposed algorithms to solve RSMDP problems that have an exponential utility function, evaluating their main characteristics, similarities, particularities and differences in order to allow the reader the knowledge of this tool of decision making for risk sensitive problems.