{"title":"Limit Action Space to Enhance Drone Control with Deep Reinforcement Learning","authors":"Sooyoung Jang, Noh-Sam Park","doi":"10.1109/ICTC49870.2020.9289571","DOIUrl":null,"url":null,"abstract":"Although many research progresses on deep reinforcement learning, it is not yet perfect. It may take too much time or even fail to solve the problem. Therefore, simplifying the problem by intentionally limiting the agent’s action space should help train the agent efficiently and effectively. To verify that, in this paper, we analyze the performances of various action space designs for controlling a drone with deep reinforcement learning. We have designed six different action spaces according to the degree of freedom to analyze the effect of limiting the agent’s action space on performance metrics such as travel distance and time, goal rate, and total reward. We show that by limiting the degree of freedom, the agent learns to reach the goal faster with less travel distance and achieve a higher goal rate and reward.","PeriodicalId":282243,"journal":{"name":"2020 International Conference on Information and Communication Technology Convergence (ICTC)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Information and Communication Technology Convergence (ICTC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTC49870.2020.9289571","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Although many research progresses on deep reinforcement learning, it is not yet perfect. It may take too much time or even fail to solve the problem. Therefore, simplifying the problem by intentionally limiting the agent’s action space should help train the agent efficiently and effectively. To verify that, in this paper, we analyze the performances of various action space designs for controlling a drone with deep reinforcement learning. We have designed six different action spaces according to the degree of freedom to analyze the effect of limiting the agent’s action space on performance metrics such as travel distance and time, goal rate, and total reward. We show that by limiting the degree of freedom, the agent learns to reach the goal faster with less travel distance and achieve a higher goal rate and reward.