{"title":"通过路径积分控制的离散时间随机 LQR 及其样本复杂性分析","authors":"Apurva Patil;Grani A. Hanasusanto;Takashi Tanaka","doi":"10.1109/LCSYS.2024.3413869","DOIUrl":null,"url":null,"abstract":"In this letter, we derive the path integral control algorithm to solve a discrete-time stochastic Linear Quadratic Regulator (LQR) problem and carry out its sample complexity analysis. While the stochastic LQR problem can be efficiently solved by the standard backward Riccati recursion, our primary focus in this letter is to establish the foundation for a sample complexity analysis of the path integral method when the analytical expressions of optimal control law and the cost are available. Specifically, we derive a bound on the error between the optimal LQR input and the input computed by the path integral method as a function of the sample size. Our analysis reveals that the sample size required exhibits a logarithmic dependence on the dimension of the control input. Lastly, we formulate a chance-constrained optimization problem whose solution quantifies the worst-case control performance of the path integral approach.","PeriodicalId":37235,"journal":{"name":"IEEE Control Systems Letters","volume":null,"pages":null},"PeriodicalIF":2.4000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Discrete-Time Stochastic LQR via Path Integral Control and Its Sample Complexity Analysis\",\"authors\":\"Apurva Patil;Grani A. Hanasusanto;Takashi Tanaka\",\"doi\":\"10.1109/LCSYS.2024.3413869\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this letter, we derive the path integral control algorithm to solve a discrete-time stochastic Linear Quadratic Regulator (LQR) problem and carry out its sample complexity analysis. While the stochastic LQR problem can be efficiently solved by the standard backward Riccati recursion, our primary focus in this letter is to establish the foundation for a sample complexity analysis of the path integral method when the analytical expressions of optimal control law and the cost are available. Specifically, we derive a bound on the error between the optimal LQR input and the input computed by the path integral method as a function of the sample size. Our analysis reveals that the sample size required exhibits a logarithmic dependence on the dimension of the control input. Lastly, we formulate a chance-constrained optimization problem whose solution quantifies the worst-case control performance of the path integral approach.\",\"PeriodicalId\":37235,\"journal\":{\"name\":\"IEEE Control Systems Letters\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2024-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Control Systems Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10556579/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Control Systems Letters","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10556579/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Discrete-Time Stochastic LQR via Path Integral Control and Its Sample Complexity Analysis
In this letter, we derive the path integral control algorithm to solve a discrete-time stochastic Linear Quadratic Regulator (LQR) problem and carry out its sample complexity analysis. While the stochastic LQR problem can be efficiently solved by the standard backward Riccati recursion, our primary focus in this letter is to establish the foundation for a sample complexity analysis of the path integral method when the analytical expressions of optimal control law and the cost are available. Specifically, we derive a bound on the error between the optimal LQR input and the input computed by the path integral method as a function of the sample size. Our analysis reveals that the sample size required exhibits a logarithmic dependence on the dimension of the control input. Lastly, we formulate a chance-constrained optimization problem whose solution quantifies the worst-case control performance of the path integral approach.