{"title":"Data-Driven Stochastic Game Theoretic Differential Dynamic Programming","authors":"Mohammad Sarbaz, Wei Sun","doi":"10.1002/rnc.7984","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>This paper introduces a novel approach for designing optimal control using data-driven Stochastic Game Theoretic Differential Dynamic Programming (SGT-DDP). The proposed method addresses unknown stochastic systems by approximating both drift and diffusion dynamics. The drift dynamics is estimated via Gaussian Process Regression (GPR) using input–output data. The diffusion dynamics is approximated from the noise data, which is extracted through subtracting the noisy output from the smoothed output. Subsequently, the binning method is combined with GPR to obtain the approximate model of the diffusion dynamics. These approximations are integrated into the SGT-DDP framework to compute optimal control policies. Simulations on benchmark nonlinear systems under unknown dynamics demonstrate the effectiveness of the method.</p>\n </div>","PeriodicalId":50291,"journal":{"name":"International Journal of Robust and Nonlinear Control","volume":"35 13","pages":"5343-5354"},"PeriodicalIF":3.2000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Robust and Nonlinear Control","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/rnc.7984","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper introduces a novel approach for designing optimal control using data-driven Stochastic Game Theoretic Differential Dynamic Programming (SGT-DDP). The proposed method addresses unknown stochastic systems by approximating both drift and diffusion dynamics. The drift dynamics is estimated via Gaussian Process Regression (GPR) using input–output data. The diffusion dynamics is approximated from the noise data, which is extracted through subtracting the noisy output from the smoothed output. Subsequently, the binning method is combined with GPR to obtain the approximate model of the diffusion dynamics. These approximations are integrated into the SGT-DDP framework to compute optimal control policies. Simulations on benchmark nonlinear systems under unknown dynamics demonstrate the effectiveness of the method.
期刊介绍:
Papers that do not include an element of robust or nonlinear control and estimation theory will not be considered by the journal, and all papers will be expected to include significant novel content. The focus of the journal is on model based control design approaches rather than heuristic or rule based methods. Papers on neural networks will have to be of exceptional novelty to be considered for the journal.