Imitation Learning of Complex Behaviors for Multiple Drones with Limited Vision

IF 4.4 2区地球科学 Q1 REMOTE SENSING

Drones Pub Date : 2023-12-13 DOI:10.3390/drones7120704

Yu Wan, Jun Tang, Zipeng Zhao

{"title":"Imitation Learning of Complex Behaviors for Multiple Drones with Limited Vision","authors":"Yu Wan, Jun Tang, Zipeng Zhao","doi":"10.3390/drones7120704","DOIUrl":null,"url":null,"abstract":"Navigating multiple drones autonomously in complex and unpredictable environments, such as forests, poses a significant challenge typically addressed by wireless communication for coordination. However, this approach falls short in situations with limited central control or blocked communications. Addressing this gap, our paper explores the learning of complex behaviors by multiple drones with limited vision. Drones in a swarm rely on onboard sensors, primarily forward-facing stereo cameras, for environmental perception and neighbor detection. They learn complex maneuvers through the imitation of a privileged expert system, which involves finding the optimal set of neural network parameters to enable the most effective mapping from sensory perception to control commands. The training process adopts the Dagger algorithm, employing the framework of centralized training with decentralized execution. Using this technique, drones rapidly learn complex behaviors, such as avoiding obstacles, coordinating movements, and navigating to specified targets, all in the absence of wireless communication. This paper details the construction of a distributed multi-UAV cooperative motion model under limited vision, emphasizing the autonomy of each drone in achieving coordinated flight and obstacle avoidance. Our methodological approach and experimental results validate the effectiveness of the proposed vision-based end-to-end controller, paving the way for more sophisticated applications of multi-UAV systems in intricate, real-world scenarios.","PeriodicalId":36448,"journal":{"name":"Drones","volume":"52 8","pages":""},"PeriodicalIF":4.4000,"publicationDate":"2023-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Drones","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/drones7120704","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}

引用次数: 0

Abstract

Navigating multiple drones autonomously in complex and unpredictable environments, such as forests, poses a significant challenge typically addressed by wireless communication for coordination. However, this approach falls short in situations with limited central control or blocked communications. Addressing this gap, our paper explores the learning of complex behaviors by multiple drones with limited vision. Drones in a swarm rely on onboard sensors, primarily forward-facing stereo cameras, for environmental perception and neighbor detection. They learn complex maneuvers through the imitation of a privileged expert system, which involves finding the optimal set of neural network parameters to enable the most effective mapping from sensory perception to control commands. The training process adopts the Dagger algorithm, employing the framework of centralized training with decentralized execution. Using this technique, drones rapidly learn complex behaviors, such as avoiding obstacles, coordinating movements, and navigating to specified targets, all in the absence of wireless communication. This paper details the construction of a distributed multi-UAV cooperative motion model under limited vision, emphasizing the autonomy of each drone in achieving coordinated flight and obstacle avoidance. Our methodological approach and experimental results validate the effectiveness of the proposed vision-based end-to-end controller, paving the way for more sophisticated applications of multi-UAV systems in intricate, real-world scenarios.

查看原文本刊更多论文

视觉受限的多架无人机复杂行为的模仿学习

在复杂和不可预测的环境（如森林）中，多架无人机的自主导航是一项重大挑战，通常采用无线通信进行协调。然而，在中央控制有限或通信受阻的情况下，这种方法就显得力不从心了。针对这一缺陷，我们的论文探讨了多架视觉有限的无人机学习复杂行为的问题。蜂群中的无人机依靠机载传感器（主要是面向前方的立体摄像头）进行环境感知和邻居探测。它们通过模仿一个特权专家系统来学习复杂的动作，这涉及到寻找最优的神经网络参数集，以实现从感官感知到控制指令的最有效映射。训练过程采用 Dagger 算法，采用集中训练、分散执行的框架。利用这种技术，无人机可以在没有无线通信的情况下快速学习复杂的行为，如避开障碍物、协调运动、向指定目标导航等。本文详细介绍了有限视觉下分布式多无人机合作运动模型的构建，强调了每架无人机在实现协调飞行和避障时的自主性。我们的方法论和实验结果验证了所提出的基于视觉的端到端控制器的有效性，为多无人机系统在错综复杂的现实世界场景中的更复杂应用铺平了道路。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Drones Engineering-Aerospace Engineering

CiteScore

5.60

自引率

18.80%

发文量

331