{"title":"认知雷达任务调度的强化学习","authors":"M. Gaafar, M. Shaghaghi, R. Adve, Z. Ding","doi":"10.1109/IEEECONF44664.2019.9048892","DOIUrl":null,"url":null,"abstract":"Cognitive Radars (CRs) have the capability to adapt to their environment and accumulate knowledge from their interactions with the environment. This paper deals with the Radar Resource Management (RRM) problem where the radar assigns limited time resources to a set of tasks. The problem is modeled as an optimization problem where the aim is to minimize the number of delayed and dropped tasks which is an NP-hard problem. We propose a modified Monte Carlo Tree Search (MCTS) approach to find an effective solution. We further develop a Reinforcement Learning (RL) solution that uses a Neural Network (NN) to guide the modified MCTS. This produces a stable RL algorithm that learns on its own, requires no external training data, and can adapt to a varying environment. The results show the proposed RL algorithm outperforms other techniques including commonly used heuristics and produces close to optimal results.","PeriodicalId":6684,"journal":{"name":"2019 53rd Asilomar Conference on Signals, Systems, and Computers","volume":"48 1","pages":"1653-1657"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Reinforcement Learning for Cognitive Radar Task Scheduling\",\"authors\":\"M. Gaafar, M. Shaghaghi, R. Adve, Z. Ding\",\"doi\":\"10.1109/IEEECONF44664.2019.9048892\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cognitive Radars (CRs) have the capability to adapt to their environment and accumulate knowledge from their interactions with the environment. This paper deals with the Radar Resource Management (RRM) problem where the radar assigns limited time resources to a set of tasks. The problem is modeled as an optimization problem where the aim is to minimize the number of delayed and dropped tasks which is an NP-hard problem. We propose a modified Monte Carlo Tree Search (MCTS) approach to find an effective solution. We further develop a Reinforcement Learning (RL) solution that uses a Neural Network (NN) to guide the modified MCTS. This produces a stable RL algorithm that learns on its own, requires no external training data, and can adapt to a varying environment. The results show the proposed RL algorithm outperforms other techniques including commonly used heuristics and produces close to optimal results.\",\"PeriodicalId\":6684,\"journal\":{\"name\":\"2019 53rd Asilomar Conference on Signals, Systems, and Computers\",\"volume\":\"48 1\",\"pages\":\"1653-1657\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 53rd Asilomar Conference on Signals, Systems, and Computers\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IEEECONF44664.2019.9048892\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 53rd Asilomar Conference on Signals, Systems, and Computers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEEECONF44664.2019.9048892","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Reinforcement Learning for Cognitive Radar Task Scheduling
Cognitive Radars (CRs) have the capability to adapt to their environment and accumulate knowledge from their interactions with the environment. This paper deals with the Radar Resource Management (RRM) problem where the radar assigns limited time resources to a set of tasks. The problem is modeled as an optimization problem where the aim is to minimize the number of delayed and dropped tasks which is an NP-hard problem. We propose a modified Monte Carlo Tree Search (MCTS) approach to find an effective solution. We further develop a Reinforcement Learning (RL) solution that uses a Neural Network (NN) to guide the modified MCTS. This produces a stable RL algorithm that learns on its own, requires no external training data, and can adapt to a varying environment. The results show the proposed RL algorithm outperforms other techniques including commonly used heuristics and produces close to optimal results.