利用基于注意力的深度强化学习解决城市空域无人机作战中的战术冲突

Green Energy and Intelligent Transportation Pub Date : 2023-08-01 DOI:10.1016/j.geits.2023.100107

Mingcheng Zhang , Chao Yan , Wei Dai , Xiaojia Xiang , Kin Huat Low

{"title":"利用基于注意力的深度强化学习解决城市空域无人机作战中的战术冲突","authors":"Mingcheng Zhang , Chao Yan , Wei Dai , Xiaojia Xiang , Kin Huat Low","doi":"10.1016/j.geits.2023.100107","DOIUrl":null,"url":null,"abstract":"<div><p>Unmanned aerial vehicles (UAVs) have gained much attention from academic and industrial areas due to the significant number of potential applications in urban airspace. A traffic management system for these UAVs is needed to manage this future traffic. Tactical conflict resolution for unmanned aerial systems (UASs) is an essential piece of the puzzle for the future UAS Traffic Management (UTM), especially in very low-level (VLL) urban airspace. Unlike conflict resolution in higher altitude airspace, the dense high-rise buildings are an essential source of potential conflict to be considered in VLL urban airspace. In this paper, we propose an attention-based deep reinforcement learning approach to solve the tactical conflict resolution problem. Specifically, we formulate this task as a sequential decision-making problem using Markov Decision Process (MDP). The double deep Q network (DDQN) framework is used as a learning framework for the host drone to learn to output conflict-free maneuvers at each time step. We use the attention mechanism to model the individual neighbor's effect on the host drone, endowing the learned conflict resolution policy to be adapted to an arbitrary number of neighboring drones. Lastly, we build a simulation environment with various scenarios covering different types of encounters to evaluate the proposed approach. The simulation results demonstrate that our proposed algorithm provides a reliable solution to minimize secondary conflict counts compared to learning and non-learning-based approaches under different traffic density scenarios.</p></div>","PeriodicalId":100596,"journal":{"name":"Green Energy and Intelligent Transportation","volume":"2 4","pages":"Article 100107"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Tactical conflict resolution in urban airspace for unmanned aerial vehicles operations using attention-based deep reinforcement learning\",\"authors\":\"Mingcheng Zhang , Chao Yan , Wei Dai , Xiaojia Xiang , Kin Huat Low\",\"doi\":\"10.1016/j.geits.2023.100107\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Unmanned aerial vehicles (UAVs) have gained much attention from academic and industrial areas due to the significant number of potential applications in urban airspace. A traffic management system for these UAVs is needed to manage this future traffic. Tactical conflict resolution for unmanned aerial systems (UASs) is an essential piece of the puzzle for the future UAS Traffic Management (UTM), especially in very low-level (VLL) urban airspace. Unlike conflict resolution in higher altitude airspace, the dense high-rise buildings are an essential source of potential conflict to be considered in VLL urban airspace. In this paper, we propose an attention-based deep reinforcement learning approach to solve the tactical conflict resolution problem. Specifically, we formulate this task as a sequential decision-making problem using Markov Decision Process (MDP). The double deep Q network (DDQN) framework is used as a learning framework for the host drone to learn to output conflict-free maneuvers at each time step. We use the attention mechanism to model the individual neighbor's effect on the host drone, endowing the learned conflict resolution policy to be adapted to an arbitrary number of neighboring drones. Lastly, we build a simulation environment with various scenarios covering different types of encounters to evaluate the proposed approach. The simulation results demonstrate that our proposed algorithm provides a reliable solution to minimize secondary conflict counts compared to learning and non-learning-based approaches under different traffic density scenarios.</p></div>\",\"PeriodicalId\":100596,\"journal\":{\"name\":\"Green Energy and Intelligent Transportation\",\"volume\":\"2 4\",\"pages\":\"Article 100107\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Green Energy and Intelligent Transportation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2773153723000439\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Green Energy and Intelligent Transportation","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2773153723000439","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

无人机（UAV）在城市空域具有大量潜在应用，因此受到了学术界和工业界的广泛关注。这些无人机需要一个交通管理系统来管理未来的交通。无人机系统（UAS）的战术冲突解决是未来无人机交通管理（UTM）的一个重要难题，尤其是在极低空（VLL）城市空域。与高海拔空域的冲突解决不同，密集的高层建筑是VLL城市空域潜在冲突的重要来源。在本文中，我们提出了一种基于注意力的深度强化学习方法来解决战术冲突解决问题。具体来说，我们使用马尔可夫决策过程（MDP）将该任务表述为一个序列决策问题。双深度Q网络（DDQN）框架被用作宿主无人机的学习框架，以学习在每个时间步长输出无冲突机动。我们使用注意力机制来模拟单个邻居对宿主无人机的影响，使学习到的冲突解决策略能够适应任意数量的邻居无人机。最后，我们构建了一个模拟环境，其中包含涵盖不同类型遭遇的各种场景，以评估所提出的方法。仿真结果表明，在不同的交通密度场景下，与基于学习和非学习的方法相比，我们提出的算法提供了一种可靠的解决方案，可以最大限度地减少二次冲突计数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Tactical conflict resolution in urban airspace for unmanned aerial vehicles operations using attention-based deep reinforcement learning

Unmanned aerial vehicles (UAVs) have gained much attention from academic and industrial areas due to the significant number of potential applications in urban airspace. A traffic management system for these UAVs is needed to manage this future traffic. Tactical conflict resolution for unmanned aerial systems (UASs) is an essential piece of the puzzle for the future UAS Traffic Management (UTM), especially in very low-level (VLL) urban airspace. Unlike conflict resolution in higher altitude airspace, the dense high-rise buildings are an essential source of potential conflict to be considered in VLL urban airspace. In this paper, we propose an attention-based deep reinforcement learning approach to solve the tactical conflict resolution problem. Specifically, we formulate this task as a sequential decision-making problem using Markov Decision Process (MDP). The double deep Q network (DDQN) framework is used as a learning framework for the host drone to learn to output conflict-free maneuvers at each time step. We use the attention mechanism to model the individual neighbor's effect on the host drone, endowing the learned conflict resolution policy to be adapted to an arbitrary number of neighboring drones. Lastly, we build a simulation environment with various scenarios covering different types of encounters to evaluate the proposed approach. The simulation results demonstrate that our proposed algorithm provides a reliable solution to minimize secondary conflict counts compared to learning and non-learning-based approaches under different traffic density scenarios.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Green Energy and Intelligent Transportation

CiteScore

6.40

自引率

0.00%

发文量