Akifumi Kira, Naoyuki Kamiyama, H. Anai, H. Iwashita, Kotaro Ohori
{"title":"关于动态巡逻安全游戏","authors":"Akifumi Kira, Naoyuki Kamiyama, H. Anai, H. Iwashita, Kotaro Ohori","doi":"10.15807/jorsj.62.152","DOIUrl":null,"url":null,"abstract":"We consider Stackelberg patrolling security games in which a security guard and an intruder walk around a facility. In these games, at each timepoint, the guard earns a reward (intruder incurs a cost) depending on their locations at that time. The objective of the guard (resp., the intruder) is to patrol (intrude) the facility so that the total sum of rewards is maximized (minimized). We study three cases: In Case 1, the guard chooses a scheduled route first and then the intruder chooses a scheduled route after perfectly observing the guard’s choice. In Case 2, the guard randomizes her scheduled routes and then intruder observes its probability distribution and also randomize his scheduled routes. In Case 3, the guard randomizes her scheduled routes as well, but the intruder sequentially observes the location of the guard and reroutes to reach one of his targets. We show that the intruder’s best response problem in Cases 1 and 2 and Case 3 can be formulated as a shortest path problem and a Markov decision process, respectively. Moreover, the equilibrium problem in each case reduces to a polynomial-sized mixed integer linear programming, linear programming, and bilinear programming problem, respectively.","PeriodicalId":51107,"journal":{"name":"Journal of the Operations Research Society of Japan","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ON DYNAMIC PATROLLING SECURITY GAMES\",\"authors\":\"Akifumi Kira, Naoyuki Kamiyama, H. Anai, H. Iwashita, Kotaro Ohori\",\"doi\":\"10.15807/jorsj.62.152\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider Stackelberg patrolling security games in which a security guard and an intruder walk around a facility. In these games, at each timepoint, the guard earns a reward (intruder incurs a cost) depending on their locations at that time. The objective of the guard (resp., the intruder) is to patrol (intrude) the facility so that the total sum of rewards is maximized (minimized). We study three cases: In Case 1, the guard chooses a scheduled route first and then the intruder chooses a scheduled route after perfectly observing the guard’s choice. In Case 2, the guard randomizes her scheduled routes and then intruder observes its probability distribution and also randomize his scheduled routes. In Case 3, the guard randomizes her scheduled routes as well, but the intruder sequentially observes the location of the guard and reroutes to reach one of his targets. We show that the intruder’s best response problem in Cases 1 and 2 and Case 3 can be formulated as a shortest path problem and a Markov decision process, respectively. Moreover, the equilibrium problem in each case reduces to a polynomial-sized mixed integer linear programming, linear programming, and bilinear programming problem, respectively.\",\"PeriodicalId\":51107,\"journal\":{\"name\":\"Journal of the Operations Research Society of Japan\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Operations Research Society of Japan\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.15807/jorsj.62.152\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Decision Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Operations Research Society of Japan","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15807/jorsj.62.152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Decision Sciences","Score":null,"Total":0}
We consider Stackelberg patrolling security games in which a security guard and an intruder walk around a facility. In these games, at each timepoint, the guard earns a reward (intruder incurs a cost) depending on their locations at that time. The objective of the guard (resp., the intruder) is to patrol (intrude) the facility so that the total sum of rewards is maximized (minimized). We study three cases: In Case 1, the guard chooses a scheduled route first and then the intruder chooses a scheduled route after perfectly observing the guard’s choice. In Case 2, the guard randomizes her scheduled routes and then intruder observes its probability distribution and also randomize his scheduled routes. In Case 3, the guard randomizes her scheduled routes as well, but the intruder sequentially observes the location of the guard and reroutes to reach one of his targets. We show that the intruder’s best response problem in Cases 1 and 2 and Case 3 can be formulated as a shortest path problem and a Markov decision process, respectively. Moreover, the equilibrium problem in each case reduces to a polynomial-sized mixed integer linear programming, linear programming, and bilinear programming problem, respectively.
期刊介绍:
The journal publishes original work and quality reviews in the field of operations research and management science to OR practitioners and researchers in two substantive categories: operations research methods; applications and practices of operations research in industry, public sector, and all areas of science and engineering.