An algorithm for stochastic control through dynamic programming techniques

P. Chen
{"title":"An algorithm for stochastic control through dynamic programming techniques","authors":"P. Chen","doi":"10.1109/TAC.1962.1105502","DOIUrl":null,"url":null,"abstract":"An algorithm based on the concept of state and dynamic programming is derived for designing an optimum controller for a linear plant subject to noise. The controller is optimal in the sense that the behavior of the plant satisfies the expected mean quadratic performance index (EMQPI) defined in the paper. The algorithm generates the sequence of control signals which minimize the EMQPI. In addition, it gives the minimum of the EMQPI for the specified sequence of control signals. The control signal is found to consist of two components: 1) a linear combination of the system state variables, and 2) a noise-balance component which minimizes the noise-induced deviation of the actual plant output from the desired output. An example is given to illustrate the iterative procedure and the asymptotic behavior of the algorithm. The design is optimal for a class of system inputs, and is applicable to both sampling and continuous systems. The design procedure is developed to make full use of a digital computer. The basic principles of dynamic programming to the treatment of stochastic control processes are clearly illustrated in an introductory form so that it will be of interest to control engineers who may wish to familiarize themselves with dynamic programming techniques.","PeriodicalId":226447,"journal":{"name":"Ire Transactions on Automatic Control","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1962-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ire Transactions on Automatic Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TAC.1962.1105502","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

An algorithm based on the concept of state and dynamic programming is derived for designing an optimum controller for a linear plant subject to noise. The controller is optimal in the sense that the behavior of the plant satisfies the expected mean quadratic performance index (EMQPI) defined in the paper. The algorithm generates the sequence of control signals which minimize the EMQPI. In addition, it gives the minimum of the EMQPI for the specified sequence of control signals. The control signal is found to consist of two components: 1) a linear combination of the system state variables, and 2) a noise-balance component which minimizes the noise-induced deviation of the actual plant output from the desired output. An example is given to illustrate the iterative procedure and the asymptotic behavior of the algorithm. The design is optimal for a class of system inputs, and is applicable to both sampling and continuous systems. The design procedure is developed to make full use of a digital computer. The basic principles of dynamic programming to the treatment of stochastic control processes are clearly illustrated in an introductory form so that it will be of interest to control engineers who may wish to familiarize themselves with dynamic programming techniques.
基于动态规划技术的随机控制算法
提出了一种基于状态和动态规划的算法,用于设计受噪声影响的线性对象的最优控制器。当被控对象的行为满足本文定义的期望平均二次性能指标(EMQPI)时,控制器是最优的。该算法生成的控制信号序列使EMQPI最小。此外,给出了给定控制信号序列的最小EMQPI值。控制信号由两个部分组成:1)系统状态变量的线性组合,以及2)噪声平衡部分,该部分将实际工厂输出与期望输出的噪声引起的偏差最小化。通过实例说明了该算法的迭代过程和渐近性。该设计对于一类系统输入是最优的,并且适用于采样和连续系统。设计程序是为了充分利用数字计算机。动态规划处理随机控制过程的基本原理以介绍性的形式清楚地说明,以便希望熟悉动态规划技术的控制工程师感兴趣。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信