关于动态巡逻安全游戏

Q4 Decision Sciences

Journal of the Operations Research Society of Japan Pub Date : 2019-10-31 DOI:10.15807/jorsj.62.152

Akifumi Kira, Naoyuki Kamiyama, H. Anai, H. Iwashita, Kotaro Ohori

{"title":"关于动态巡逻安全游戏","authors":"Akifumi Kira, Naoyuki Kamiyama, H. Anai, H. Iwashita, Kotaro Ohori","doi":"10.15807/jorsj.62.152","DOIUrl":null,"url":null,"abstract":"We consider Stackelberg patrolling security games in which a security guard and an intruder walk around a facility. In these games, at each timepoint, the guard earns a reward (intruder incurs a cost) depending on their locations at that time. The objective of the guard (resp., the intruder) is to patrol (intrude) the facility so that the total sum of rewards is maximized (minimized). We study three cases: In Case 1, the guard chooses a scheduled route first and then the intruder chooses a scheduled route after perfectly observing the guard’s choice. In Case 2, the guard randomizes her scheduled routes and then intruder observes its probability distribution and also randomize his scheduled routes. In Case 3, the guard randomizes her scheduled routes as well, but the intruder sequentially observes the location of the guard and reroutes to reach one of his targets. We show that the intruder’s best response problem in Cases 1 and 2 and Case 3 can be formulated as a shortest path problem and a Markov decision process, respectively. Moreover, the equilibrium problem in each case reduces to a polynomial-sized mixed integer linear programming, linear programming, and bilinear programming problem, respectively.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ON DYNAMIC PATROLLING SECURITY GAMES\",\"authors\":\"Akifumi Kira, Naoyuki Kamiyama, H. Anai, H. Iwashita, Kotaro Ohori\",\"doi\":\"10.15807/jorsj.62.152\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider Stackelberg patrolling security games in which a security guard and an intruder walk around a facility. In these games, at each timepoint, the guard earns a reward (intruder incurs a cost) depending on their locations at that time. The objective of the guard (resp., the intruder) is to patrol (intrude) the facility so that the total sum of rewards is maximized (minimized). We study three cases: In Case 1, the guard chooses a scheduled route first and then the intruder chooses a scheduled route after perfectly observing the guard’s choice. In Case 2, the guard randomizes her scheduled routes and then intruder observes its probability distribution and also randomize his scheduled routes. In Case 3, the guard randomizes her scheduled routes as well, but the intruder sequentially observes the location of the guard and reroutes to reach one of his targets. We show that the intruder’s best response problem in Cases 1 and 2 and Case 3 can be formulated as a shortest path problem and a Markov decision process, respectively. Moreover, the equilibrium problem in each case reduces to a polynomial-sized mixed integer linear programming, linear programming, and bilinear programming problem, respectively.\",\"PeriodicalId\":51107,\"journal\":{\"name\":\"Journal of the Operations Research Society of Japan\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Operations Research Society of Japan\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15807/jorsj.62.152\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Decision Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Operations Research Society of Japan","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15807/jorsj.62.152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Decision Sciences","Score":null,"Total":0}

引用次数: 0

摘要

我们考虑Stackelberg巡逻安全游戏，在这个游戏中，一名保安和一名入侵者在一个设施周围走动。在这些游戏中，在每个时间点，守卫会根据他们当时的位置获得奖励(入侵者会付出代价)。守卫的目的(责任)。(如入侵者)巡视(侵入)设施，使总奖励最大化(最小化)。我们研究了三种情况:在情形1中，守卫先选择预定路线，然后入侵者在完全观察守卫的选择后选择预定路线。在情形2中，守卫将自己的计划路线随机化，入侵者观察其概率分布，也将自己的计划路线随机化。在情况3中，守卫也随机安排了她的预定路线，但是入侵者会依次观察守卫的位置并改变路线到达他的目标之一。研究表明，在情形1、情形2和情形3中，入侵者的最佳对策问题可以分别表述为最短路径问题和马尔可夫决策过程。此外，每种情况下的平衡问题分别简化为多项式大小的混合整数线性规划、线性规划和双线性规划问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

ON DYNAMIC PATROLLING SECURITY GAMES

We consider Stackelberg patrolling security games in which a security guard and an intruder walk around a facility. In these games, at each timepoint, the guard earns a reward (intruder incurs a cost) depending on their locations at that time. The objective of the guard (resp., the intruder) is to patrol (intrude) the facility so that the total sum of rewards is maximized (minimized). We study three cases: In Case 1, the guard chooses a scheduled route first and then the intruder chooses a scheduled route after perfectly observing the guard’s choice. In Case 2, the guard randomizes her scheduled routes and then intruder observes its probability distribution and also randomize his scheduled routes. In Case 3, the guard randomizes her scheduled routes as well, but the intruder sequentially observes the location of the guard and reroutes to reach one of his targets. We show that the intruder’s best response problem in Cases 1 and 2 and Case 3 can be formulated as a shortest path problem and a Markov decision process, respectively. Moreover, the equilibrium problem in each case reduces to a polynomial-sized mixed integer linear programming, linear programming, and bilinear programming problem, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of the Operations Research Society of Japan 管理科学-运筹学与管理科学

CiteScore

0.70

自引率

0.00%

发文量

审稿时长

12 months

期刊介绍： The journal publishes original work and quality reviews in the field of operations research and management science to OR practitioners and researchers in two substantive categories: operations research methods; applications and practices of operations research in industry, public sector, and all areas of science and engineering.