Can Direct Latent Model Learning Solve Linear Quadratic Gaussian Control?

Conference on Learning for Dynamics & Control Pub Date : 2022-12-30 DOI:10.48550/arXiv.2212.14511

Yi Tian, K. Zhang, Russ Tedrake, S. Sra

{"title":"Can Direct Latent Model Learning Solve Linear Quadratic Gaussian Control?","authors":"Yi Tian, K. Zhang, Russ Tedrake, S. Sra","doi":"10.48550/arXiv.2212.14511","DOIUrl":null,"url":null,"abstract":"We study the task of learning state representations from potentially high-dimensional observations, with the goal of controlling an unknown partially observable system. We pursue a direct latent model learning approach, where a dynamic model in some latent state space is learned by predicting quantities directly related to planning (e.g., costs) without reconstructing the observations. In particular, we focus on an intuitive cost-driven state representation learning method for solving Linear Quadratic Gaussian (LQG) control, one of the most fundamental partially observable control problems. As our main results, we establish finite-sample guarantees of finding a near-optimal state representation function and a near-optimal controller using the directly learned latent model. To the best of our knowledge, despite various empirical successes, prior to this work it was unclear if such a cost-driven latent model learner enjoys finite-sample guarantees. Our work underscores the value of predicting multi-step costs, an idea that is key to our theory, and notably also an idea that is known to be empirically valuable for learning state representations.","PeriodicalId":268449,"journal":{"name":"Conference on Learning for Dynamics & Control","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Learning for Dynamics & Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2212.14511","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

We study the task of learning state representations from potentially high-dimensional observations, with the goal of controlling an unknown partially observable system. We pursue a direct latent model learning approach, where a dynamic model in some latent state space is learned by predicting quantities directly related to planning (e.g., costs) without reconstructing the observations. In particular, we focus on an intuitive cost-driven state representation learning method for solving Linear Quadratic Gaussian (LQG) control, one of the most fundamental partially observable control problems. As our main results, we establish finite-sample guarantees of finding a near-optimal state representation function and a near-optimal controller using the directly learned latent model. To the best of our knowledge, despite various empirical successes, prior to this work it was unclear if such a cost-driven latent model learner enjoys finite-sample guarantees. Our work underscores the value of predicting multi-step costs, an idea that is key to our theory, and notably also an idea that is known to be empirically valuable for learning state representations.

查看原文本刊更多论文

直接潜模型学习能解决线性二次高斯控制吗?

我们研究从潜在的高维观测中学习状态表示的任务，目标是控制未知的部分可观察系统。我们采用直接潜在模型学习方法，通过预测与规划(例如成本)直接相关的数量来学习某些潜在状态空间中的动态模型，而无需重建观测值。我们特别关注一种直观的成本驱动状态表示学习方法，用于解决线性二次高斯(LQG)控制，这是最基本的部分可观察控制问题之一。作为我们的主要结果，我们使用直接学习的潜在模型建立了寻找近最优状态表示函数和近最优控制器的有限样本保证。据我们所知，尽管有各种各样的经验成功，但在这项工作之前，尚不清楚这种成本驱动的潜在模型学习者是否享有有限样本保证。我们的工作强调了预测多步成本的价值，这是我们理论的关键思想，也是一个已知的对学习状态表示具有经验价值的思想。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Conference on Learning for Dynamics & Control

自引率

0.00%

发文量