B. Escobedo-Trujillo, O. Hernández-Lerma, F. A. Alaffita-Hernández
{"title":"Adaptive control of diffusion processes with a discounted reward criterion","authors":"B. Escobedo-Trujillo, O. Hernández-Lerma, F. A. Alaffita-Hernández","doi":"10.4064/am2421-10-2020","DOIUrl":null,"url":null,"abstract":". The optimal control problem we are dealing with in this paper is to determine control policies that maximize a discounted reward criterion when the dynamic system evolves as a stochastic differential equation (SDE). Both the instantaneous reward function and the SDE’s drift coef-ficient may depend on an unknown parameter. We give conditions ensur-ing the existence of an asymptotically optimal policy using the so-called Principle of Estimation and Control. We illustrate our results with several examples.","PeriodicalId":52313,"journal":{"name":"Applicationes Mathematicae","volume":"29 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applicationes Mathematicae","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4064/am2421-10-2020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0
Abstract
. The optimal control problem we are dealing with in this paper is to determine control policies that maximize a discounted reward criterion when the dynamic system evolves as a stochastic differential equation (SDE). Both the instantaneous reward function and the SDE’s drift coef-ficient may depend on an unknown parameter. We give conditions ensur-ing the existence of an asymptotically optimal policy using the so-called Principle of Estimation and Control. We illustrate our results with several examples.