{"title":"基于迁移DRL的动态环境下基于优先级的认知MFR任务调度","authors":"Sunila Akbar, R. Adve, Z. Ding, P. Moo","doi":"10.1109/RadarConf2351548.2023.10149670","DOIUrl":null,"url":null,"abstract":"A radar resource management module in a cognitive multifunction radar manages the resources by first prioritizing and then scheduling the tasks. Apart from scheduling the tasks, the task scheduler of a cognitive radar requires the scheduling to be adaptable to the changing environment. We formulate a gen-eral model for the distributions of task parameters, specifically, task priorities and delay tolerance, to ensure priority-based task scheduling. We develop the use of transfer learning (TL) within a deep reinforcement learning (DRL) framework to address the challenge of adaptability to a varying environment. Our approach builds on using a Monte Carlo Tree Search (MCTS) aided by a deep neural network (DNN). We show that TL allows accelerated training by transferring the policy learned by training the D NN-based MCTS on initial parameter distribution (environment) to the policy required for a new environment. Our results show that the high priority tasks are least delayed and dropped with the new formulation, whereas TL ensures the respective adaptation to the dynamic environment.","PeriodicalId":168311,"journal":{"name":"2023 IEEE Radar Conference (RadarConf23)","volume":"3 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Priority-based Task Scheduling in Dynamic Environments for Cognitive MFR via Transfer DRL\",\"authors\":\"Sunila Akbar, R. Adve, Z. Ding, P. Moo\",\"doi\":\"10.1109/RadarConf2351548.2023.10149670\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A radar resource management module in a cognitive multifunction radar manages the resources by first prioritizing and then scheduling the tasks. Apart from scheduling the tasks, the task scheduler of a cognitive radar requires the scheduling to be adaptable to the changing environment. We formulate a gen-eral model for the distributions of task parameters, specifically, task priorities and delay tolerance, to ensure priority-based task scheduling. We develop the use of transfer learning (TL) within a deep reinforcement learning (DRL) framework to address the challenge of adaptability to a varying environment. Our approach builds on using a Monte Carlo Tree Search (MCTS) aided by a deep neural network (DNN). We show that TL allows accelerated training by transferring the policy learned by training the D NN-based MCTS on initial parameter distribution (environment) to the policy required for a new environment. Our results show that the high priority tasks are least delayed and dropped with the new formulation, whereas TL ensures the respective adaptation to the dynamic environment.\",\"PeriodicalId\":168311,\"journal\":{\"name\":\"2023 IEEE Radar Conference (RadarConf23)\",\"volume\":\"3 3\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE Radar Conference (RadarConf23)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RadarConf2351548.2023.10149670\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE Radar Conference (RadarConf23)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RadarConf2351548.2023.10149670","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Priority-based Task Scheduling in Dynamic Environments for Cognitive MFR via Transfer DRL
A radar resource management module in a cognitive multifunction radar manages the resources by first prioritizing and then scheduling the tasks. Apart from scheduling the tasks, the task scheduler of a cognitive radar requires the scheduling to be adaptable to the changing environment. We formulate a gen-eral model for the distributions of task parameters, specifically, task priorities and delay tolerance, to ensure priority-based task scheduling. We develop the use of transfer learning (TL) within a deep reinforcement learning (DRL) framework to address the challenge of adaptability to a varying environment. Our approach builds on using a Monte Carlo Tree Search (MCTS) aided by a deep neural network (DNN). We show that TL allows accelerated training by transferring the policy learned by training the D NN-based MCTS on initial parameter distribution (environment) to the policy required for a new environment. Our results show that the high priority tasks are least delayed and dropped with the new formulation, whereas TL ensures the respective adaptation to the dynamic environment.