Towards one nonconvex connected Markov chain control problem. An approach to numerical solution

2018 Australian & New Zealand Control Conference (ANZCC) Pub Date : 2018-12-01 DOI:10.1109/ANZCC.2018.8606576

B. Miller, K. Stepanyan, A. Miller, A. Popov

{"title":"Towards one nonconvex connected Markov chain control problem. An approach to numerical solution","authors":"B. Miller, K. Stepanyan, A. Miller, A. Popov","doi":"10.1109/ANZCC.2018.8606576","DOIUrl":null,"url":null,"abstract":"The control problems for Markov chains in continuous time usually can be reduced to the solution of a system of ordinary differential equations, which is an analog of the dynamic programming equation. This system contains, as an obligatory element, the solution of the minimization problem on the set of admissible controls at each instant of time and the success in obtaining a numerical solution depends on how effective this optimization procedure is, because it must be performed for all MC states at any time. Actually, afterward the problem of integrating a system of differential equations is not difficult, since it does not require the extra high accuracy. In one of our previous works, we tried to obtain a solution to the problem of managing the dams system, however even with a small number of dams an increase in the number of states leads to a serious increase in the time of the program performance with usage of standard computational means. In this paper, we propose to divide the minimization procedures on the right-hand side for which an analytical solution has been obtained and the integration procedures, which made it possible to radically increase the number of states and thereby, to raise the accuracy of the solution of the problem as a whole.","PeriodicalId":358801,"journal":{"name":"2018 Australian & New Zealand Control Conference (ANZCC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Australian & New Zealand Control Conference (ANZCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ANZCC.2018.8606576","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

The control problems for Markov chains in continuous time usually can be reduced to the solution of a system of ordinary differential equations, which is an analog of the dynamic programming equation. This system contains, as an obligatory element, the solution of the minimization problem on the set of admissible controls at each instant of time and the success in obtaining a numerical solution depends on how effective this optimization procedure is, because it must be performed for all MC states at any time. Actually, afterward the problem of integrating a system of differential equations is not difficult, since it does not require the extra high accuracy. In one of our previous works, we tried to obtain a solution to the problem of managing the dams system, however even with a small number of dams an increase in the number of states leads to a serious increase in the time of the program performance with usage of standard computational means. In this paper, we propose to divide the minimization procedures on the right-hand side for which an analytical solution has been obtained and the integration procedures, which made it possible to radically increase the number of states and thereby, to raise the accuracy of the solution of the problem as a whole.

查看原文本刊更多论文

一个非凸连通马尔可夫链控制问题。数值解的一种方法

连续时间马尔可夫链的控制问题通常可以归结为一个常微分方程组的解，这是一个类似于动态规划方程的解。该系统必须包含每一时刻允许控制集合上的最小化问题的解，而获得数值解的成功与否取决于该优化过程的有效性，因为它必须在任何时刻对所有MC状态进行优化。实际上，之后的微分方程组的积分问题并不难，因为它不需要特别高的精度。在我们之前的一项工作中，我们试图获得管理水坝系统问题的解决方案，然而，即使使用少量水坝，状态数量的增加也会导致使用标准计算手段的程序执行时间的严重增加。在本文中，我们建议将已得到解析解的最小化过程与积分过程分开，这使得从根本上增加状态数成为可能，从而提高了整个问题解的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 Australian & New Zealand Control Conference (ANZCC)

自引率

0.00%

发文量