一种基于时域cnn的自主无人机竞赛方法

2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS) Pub Date : 2019-11-01 DOI:10.1109/REDUAS47371.2019.8999703

L. Rojas-Perez, J. Martínez-Carranza

{"title":"一种基于时域cnn的自主无人机竞赛方法","authors":"L. Rojas-Perez, J. Martínez-Carranza","doi":"10.1109/REDUAS47371.2019.8999703","DOIUrl":null,"url":null,"abstract":"Convolutional Neural Networks (CNN) and Deep Learning (DL) have become a popular tool to address all sorts of artificial intelligent challenges. The Autonomous Drone Racing is a challenge consisting of developing an autonomous drone capable of beating a human in a drone race, and DL is a tool that has been included in state of the art solutions to address this problem. Current works have proposed to use CNN and DL to detect the gates, whereas other works have proposed to use a CNN to obtain drone’s control commands and a goal point, with all of these approaches using a single image as input. In this work we propose a CNN based on the well known pose-net network. Originally used for camera relocalisation, we propose to use pose-net to provide control commands to drive the drone towards and to cross the gate autonomously. In contrast to previous works, we also propose to use a temporal set of images as input for the network. In specific, we use 6 images captured every 166 milliseconds in one second to create a mosaic. The latter is used as input of the CNN to predict the control commands. We compare this proposed temporal approach against using a single image as input for the CNN. Our results, although in simulation, demonstrate that the using only our temporal approach is feasible, less noisy and more effective than the single image approach, enabling the drone to autonomously cross a set of gates placed randomly, and even under the scenario where the gate moves dynamically.","PeriodicalId":351115,"journal":{"name":"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Temporal CNN-based Approach for Autonomous Drone Racing\",\"authors\":\"L. Rojas-Perez, J. Martínez-Carranza\",\"doi\":\"10.1109/REDUAS47371.2019.8999703\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional Neural Networks (CNN) and Deep Learning (DL) have become a popular tool to address all sorts of artificial intelligent challenges. The Autonomous Drone Racing is a challenge consisting of developing an autonomous drone capable of beating a human in a drone race, and DL is a tool that has been included in state of the art solutions to address this problem. Current works have proposed to use CNN and DL to detect the gates, whereas other works have proposed to use a CNN to obtain drone’s control commands and a goal point, with all of these approaches using a single image as input. In this work we propose a CNN based on the well known pose-net network. Originally used for camera relocalisation, we propose to use pose-net to provide control commands to drive the drone towards and to cross the gate autonomously. In contrast to previous works, we also propose to use a temporal set of images as input for the network. In specific, we use 6 images captured every 166 milliseconds in one second to create a mosaic. The latter is used as input of the CNN to predict the control commands. We compare this proposed temporal approach against using a single image as input for the CNN. Our results, although in simulation, demonstrate that the using only our temporal approach is feasible, less noisy and more effective than the single image approach, enabling the drone to autonomously cross a set of gates placed randomly, and even under the scenario where the gate moves dynamically.\",\"PeriodicalId\":351115,\"journal\":{\"name\":\"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/REDUAS47371.2019.8999703\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/REDUAS47371.2019.8999703","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

卷积神经网络(CNN)和深度学习(DL)已经成为解决各种人工智能挑战的流行工具。自主无人机竞赛是一项挑战，包括开发能够在无人机竞赛中击败人类的自主无人机，而深度学习是一种工具，已被包含在最先进的解决方案中，以解决这个问题。目前的工作已经提出使用CNN和DL来检测门，而其他工作已经提出使用CNN来获取无人机的控制命令和目标点，所有这些方法都使用单个图像作为输入。在这项工作中，我们提出了一个基于众所周知的pose-net网络的CNN。最初用于相机重新定位，我们建议使用pose-net来提供控制命令，以驱动无人机走向并自主穿过大门。与之前的工作相反，我们还建议使用一组临时图像作为网络的输入。具体来说，我们在一秒钟内每166毫秒拍摄6张图像来创建马赛克。后者作为CNN的输入来预测控制命令。我们将这种提出的时间方法与使用单个图像作为CNN的输入进行比较。虽然在模拟中，我们的结果表明，仅使用我们的时间方法是可行的，比单图像方法噪声更小，更有效，使无人机能够自主穿越随机放置的一组门，甚至在门动态移动的情况下。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Temporal CNN-based Approach for Autonomous Drone Racing

Convolutional Neural Networks (CNN) and Deep Learning (DL) have become a popular tool to address all sorts of artificial intelligent challenges. The Autonomous Drone Racing is a challenge consisting of developing an autonomous drone capable of beating a human in a drone race, and DL is a tool that has been included in state of the art solutions to address this problem. Current works have proposed to use CNN and DL to detect the gates, whereas other works have proposed to use a CNN to obtain drone’s control commands and a goal point, with all of these approaches using a single image as input. In this work we propose a CNN based on the well known pose-net network. Originally used for camera relocalisation, we propose to use pose-net to provide control commands to drive the drone towards and to cross the gate autonomously. In contrast to previous works, we also propose to use a temporal set of images as input for the network. In specific, we use 6 images captured every 166 milliseconds in one second to create a mosaic. The latter is used as input of the CNN to predict the control commands. We compare this proposed temporal approach against using a single image as input for the CNN. Our results, although in simulation, demonstrate that the using only our temporal approach is feasible, less noisy and more effective than the single image approach, enabling the drone to autonomously cross a set of gates placed randomly, and even under the scenario where the gate moves dynamically.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)

自引率

0.00%

发文量