{"title":"Stochastic approximation in non-Markovian environments","authors":"Vivek S. Borkar","doi":"10.1016/j.sysconle.2025.106250","DOIUrl":null,"url":null,"abstract":"<div><div>We analyze a stochastic approximation scheme driven by non-Markovian noise in addition to the standard martingale difference noise. Our main result is that it behaves like a stochastic approximation driven by an ‘equivalent’ Markov noise, i.e., has the same convergence properties, although the convergence rate may degrade. Some implications to reinforcement learning algorithms are discussed.</div></div>","PeriodicalId":49450,"journal":{"name":"Systems & Control Letters","volume":"205 ","pages":"Article 106250"},"PeriodicalIF":2.5000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Systems & Control Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167691125002324","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
We analyze a stochastic approximation scheme driven by non-Markovian noise in addition to the standard martingale difference noise. Our main result is that it behaves like a stochastic approximation driven by an ‘equivalent’ Markov noise, i.e., has the same convergence properties, although the convergence rate may degrade. Some implications to reinforcement learning algorithms are discussed.
期刊介绍:
Founded in 1981 by two of the pre-eminent control theorists, Roger Brockett and Jan Willems, Systems & Control Letters is one of the leading journals in the field of control theory. The aim of the journal is to allow dissemination of relatively concise but highly original contributions whose high initial quality enables a relatively rapid review process. All aspects of the fields of systems and control are covered, especially mathematically-oriented and theoretical papers that have a clear relevance to engineering, physical and biological sciences, and even economics. Application-oriented papers with sophisticated and rigorous mathematical elements are also welcome.