{"title":"具有非二次终端收益的离散时间线性调节器问题近似解的极大加法","authors":"Huan Zhang, P. Dower","doi":"10.1137/1.9781611973273.15","DOIUrl":null,"url":null,"abstract":"Efficient Riccati equation based techniques for the approximate solution of discrete time linear regulator problems are restricted in their application to problems with quadratic terminal payoffs. Where non-quadratic terminal payoffs are required, these techniques fail due to the attendant nonquadratic value functions involved. In order to compute these non-quadratic value functions, it is often necessary to appeal directly to dynamic programming in the form of gridor element-based iterations for the value function. These iterations suffer from poor scalability with respect to problem dimension and time horizon. In this paper, a new max-plus based method is developed for the approximate solution of discrete time linear regulator problems with non-quadratic payoffs.","PeriodicalId":193106,"journal":{"name":"SIAM Conf. on Control and its Applications","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A max-plus method for the approximate solution of discrete time linear regulator problems with non-quadratic terminal payoff\",\"authors\":\"Huan Zhang, P. Dower\",\"doi\":\"10.1137/1.9781611973273.15\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Efficient Riccati equation based techniques for the approximate solution of discrete time linear regulator problems are restricted in their application to problems with quadratic terminal payoffs. Where non-quadratic terminal payoffs are required, these techniques fail due to the attendant nonquadratic value functions involved. In order to compute these non-quadratic value functions, it is often necessary to appeal directly to dynamic programming in the form of gridor element-based iterations for the value function. These iterations suffer from poor scalability with respect to problem dimension and time horizon. In this paper, a new max-plus based method is developed for the approximate solution of discrete time linear regulator problems with non-quadratic payoffs.\",\"PeriodicalId\":193106,\"journal\":{\"name\":\"SIAM Conf. on Control and its Applications\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SIAM Conf. on Control and its Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1137/1.9781611973273.15\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SIAM Conf. on Control and its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1137/1.9781611973273.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A max-plus method for the approximate solution of discrete time linear regulator problems with non-quadratic terminal payoff
Efficient Riccati equation based techniques for the approximate solution of discrete time linear regulator problems are restricted in their application to problems with quadratic terminal payoffs. Where non-quadratic terminal payoffs are required, these techniques fail due to the attendant nonquadratic value functions involved. In order to compute these non-quadratic value functions, it is often necessary to appeal directly to dynamic programming in the form of gridor element-based iterations for the value function. These iterations suffer from poor scalability with respect to problem dimension and time horizon. In this paper, a new max-plus based method is developed for the approximate solution of discrete time linear regulator problems with non-quadratic payoffs.