Towards one nonconvex connected Markov chain control problem. An approach to numerical solution

B. Miller, K. Stepanyan, A. Miller, A. Popov
{"title":"Towards one nonconvex connected Markov chain control problem. An approach to numerical solution","authors":"B. Miller, K. Stepanyan, A. Miller, A. Popov","doi":"10.1109/ANZCC.2018.8606576","DOIUrl":null,"url":null,"abstract":"The control problems for Markov chains in continuous time usually can be reduced to the solution of a system of ordinary differential equations, which is an analog of the dynamic programming equation. This system contains, as an obligatory element, the solution of the minimization problem on the set of admissible controls at each instant of time and the success in obtaining a numerical solution depends on how effective this optimization procedure is, because it must be performed for all MC states at any time. Actually, afterward the problem of integrating a system of differential equations is not difficult, since it does not require the extra high accuracy. In one of our previous works, we tried to obtain a solution to the problem of managing the dams system, however even with a small number of dams an increase in the number of states leads to a serious increase in the time of the program performance with usage of standard computational means. In this paper, we propose to divide the minimization procedures on the right-hand side for which an analytical solution has been obtained and the integration procedures, which made it possible to radically increase the number of states and thereby, to raise the accuracy of the solution of the problem as a whole.","PeriodicalId":358801,"journal":{"name":"2018 Australian & New Zealand Control Conference (ANZCC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Australian & New Zealand Control Conference (ANZCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ANZCC.2018.8606576","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The control problems for Markov chains in continuous time usually can be reduced to the solution of a system of ordinary differential equations, which is an analog of the dynamic programming equation. This system contains, as an obligatory element, the solution of the minimization problem on the set of admissible controls at each instant of time and the success in obtaining a numerical solution depends on how effective this optimization procedure is, because it must be performed for all MC states at any time. Actually, afterward the problem of integrating a system of differential equations is not difficult, since it does not require the extra high accuracy. In one of our previous works, we tried to obtain a solution to the problem of managing the dams system, however even with a small number of dams an increase in the number of states leads to a serious increase in the time of the program performance with usage of standard computational means. In this paper, we propose to divide the minimization procedures on the right-hand side for which an analytical solution has been obtained and the integration procedures, which made it possible to radically increase the number of states and thereby, to raise the accuracy of the solution of the problem as a whole.
一个非凸连通马尔可夫链控制问题。数值解的一种方法
连续时间马尔可夫链的控制问题通常可以归结为一个常微分方程组的解,这是一个类似于动态规划方程的解。该系统必须包含每一时刻允许控制集合上的最小化问题的解,而获得数值解的成功与否取决于该优化过程的有效性,因为它必须在任何时刻对所有MC状态进行优化。实际上,之后的微分方程组的积分问题并不难,因为它不需要特别高的精度。在我们之前的一项工作中,我们试图获得管理水坝系统问题的解决方案,然而,即使使用少量水坝,状态数量的增加也会导致使用标准计算手段的程序执行时间的严重增加。在本文中,我们建议将已得到解析解的最小化过程与积分过程分开,这使得从根本上增加状态数成为可能,从而提高了整个问题解的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信