{"title":"学习不稳定的连续时间随机线性控制系统","authors":"Reza Sadeghi Hafshejani, Mohamad Kazem Shirani Fradonbeh","doi":"arxiv-2409.11327","DOIUrl":null,"url":null,"abstract":"We study the problem of system identification for stochastic continuous-time\ndynamics, based on a single finite-length state trajectory. We present a method\nfor estimating the possibly unstable open-loop matrix by employing properly\nrandomized control inputs. Then, we establish theoretical performance\nguarantees showing that the estimation error decays with trajectory length, a\nmeasure of excitability, and the signal-to-noise ratio, while it grows with\ndimension. Numerical illustrations that showcase the rates of learning the\ndynamics, will be provided as well. To perform the theoretical analysis, we\ndevelop new technical tools that are of independent interest. That includes\nnon-asymptotic stochastic bounds for highly non-stationary martingales and\ngeneralized laws of iterated logarithms, among others.","PeriodicalId":501340,"journal":{"name":"arXiv - STAT - Machine Learning","volume":"119 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning Unstable Continuous-Time Stochastic Linear Control Systems\",\"authors\":\"Reza Sadeghi Hafshejani, Mohamad Kazem Shirani Fradonbeh\",\"doi\":\"arxiv-2409.11327\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We study the problem of system identification for stochastic continuous-time\\ndynamics, based on a single finite-length state trajectory. We present a method\\nfor estimating the possibly unstable open-loop matrix by employing properly\\nrandomized control inputs. Then, we establish theoretical performance\\nguarantees showing that the estimation error decays with trajectory length, a\\nmeasure of excitability, and the signal-to-noise ratio, while it grows with\\ndimension. Numerical illustrations that showcase the rates of learning the\\ndynamics, will be provided as well. To perform the theoretical analysis, we\\ndevelop new technical tools that are of independent interest. That includes\\nnon-asymptotic stochastic bounds for highly non-stationary martingales and\\ngeneralized laws of iterated logarithms, among others.\",\"PeriodicalId\":501340,\"journal\":{\"name\":\"arXiv - STAT - Machine Learning\",\"volume\":\"119 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Machine Learning\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.11327\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11327","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Learning Unstable Continuous-Time Stochastic Linear Control Systems
We study the problem of system identification for stochastic continuous-time
dynamics, based on a single finite-length state trajectory. We present a method
for estimating the possibly unstable open-loop matrix by employing properly
randomized control inputs. Then, we establish theoretical performance
guarantees showing that the estimation error decays with trajectory length, a
measure of excitability, and the signal-to-noise ratio, while it grows with
dimension. Numerical illustrations that showcase the rates of learning the
dynamics, will be provided as well. To perform the theoretical analysis, we
develop new technical tools that are of independent interest. That includes
non-asymptotic stochastic bounds for highly non-stationary martingales and
generalized laws of iterated logarithms, among others.