{"title":"A max-plus method for the approximate solution of discrete time linear regulator problems with non-quadratic terminal payoff","authors":"Huan Zhang, P. Dower","doi":"10.1137/1.9781611973273.15","DOIUrl":null,"url":null,"abstract":"Efficient Riccati equation based techniques for the approximate solution of discrete time linear regulator problems are restricted in their application to problems with quadratic terminal payoffs. Where non-quadratic terminal payoffs are required, these techniques fail due to the attendant nonquadratic value functions involved. In order to compute these non-quadratic value functions, it is often necessary to appeal directly to dynamic programming in the form of gridor element-based iterations for the value function. These iterations suffer from poor scalability with respect to problem dimension and time horizon. In this paper, a new max-plus based method is developed for the approximate solution of discrete time linear regulator problems with non-quadratic payoffs.","PeriodicalId":193106,"journal":{"name":"SIAM Conf. on Control and its Applications","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"SIAM Conf. on Control and its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1137/1.9781611973273.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Efficient Riccati equation based techniques for the approximate solution of discrete time linear regulator problems are restricted in their application to problems with quadratic terminal payoffs. Where non-quadratic terminal payoffs are required, these techniques fail due to the attendant nonquadratic value functions involved. In order to compute these non-quadratic value functions, it is often necessary to appeal directly to dynamic programming in the form of gridor element-based iterations for the value function. These iterations suffer from poor scalability with respect to problem dimension and time horizon. In this paper, a new max-plus based method is developed for the approximate solution of discrete time linear regulator problems with non-quadratic payoffs.