LP formulation of asymmetric zero-sum stochastic games

53rd IEEE Conference on Decision and Control Pub Date : 2014-12-15 DOI:10.1109/CDC.2014.7039680

Lichun Li, J. Shamma

引用次数: 29

Abstract

This paper provides an efficient linear programming (LP) formulation of asymmetric two player zero-sum stochastic games with finite horizon. In these stochastic games, only one player is informed of the state at each stage, and the transition law is only controlled by the informed player. Compared with the LP formulation of extensive stochastic games whose size grows polynomially with respect to the size of the state and the size of the uninformed player's actions, our proposed LP formulation has its size to be linear with respect to the size of the state and the size of the uninformed player, and hence greatly reduces the computational complexity. A travelling inspector problem is used to demonstrate the efficiency of the proposed LP formulation.

查看原文本刊更多论文

非对称零和随机博弈的LP公式

本文给出了具有有限视界的非对称二人零和随机对策的一个有效线性规划公式。在这些随机博弈中，每个阶段只有一个参与者被告知状态，并且过渡律仅由知情的参与者控制。与广泛随机博弈的LP公式(其大小与状态大小和不知情参与者的行动大小呈多项式增长)相比，我们提出的LP公式的大小与状态大小和不知情参与者的大小呈线性关系，从而大大降低了计算复杂度。用一个旅行检查员问题来证明所提出的LP公式的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

53rd IEEE Conference on Decision and Control

自引率

0.00%

发文量