一种基于时域cnn的自主无人机竞赛方法

L. Rojas-Perez, J. Martínez-Carranza
{"title":"一种基于时域cnn的自主无人机竞赛方法","authors":"L. Rojas-Perez, J. Martínez-Carranza","doi":"10.1109/REDUAS47371.2019.8999703","DOIUrl":null,"url":null,"abstract":"Convolutional Neural Networks (CNN) and Deep Learning (DL) have become a popular tool to address all sorts of artificial intelligent challenges. The Autonomous Drone Racing is a challenge consisting of developing an autonomous drone capable of beating a human in a drone race, and DL is a tool that has been included in state of the art solutions to address this problem. Current works have proposed to use CNN and DL to detect the gates, whereas other works have proposed to use a CNN to obtain drone’s control commands and a goal point, with all of these approaches using a single image as input. In this work we propose a CNN based on the well known pose-net network. Originally used for camera relocalisation, we propose to use pose-net to provide control commands to drive the drone towards and to cross the gate autonomously. In contrast to previous works, we also propose to use a temporal set of images as input for the network. In specific, we use 6 images captured every 166 milliseconds in one second to create a mosaic. The latter is used as input of the CNN to predict the control commands. We compare this proposed temporal approach against using a single image as input for the CNN. Our results, although in simulation, demonstrate that the using only our temporal approach is feasible, less noisy and more effective than the single image approach, enabling the drone to autonomously cross a set of gates placed randomly, and even under the scenario where the gate moves dynamically.","PeriodicalId":351115,"journal":{"name":"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Temporal CNN-based Approach for Autonomous Drone Racing\",\"authors\":\"L. Rojas-Perez, J. Martínez-Carranza\",\"doi\":\"10.1109/REDUAS47371.2019.8999703\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional Neural Networks (CNN) and Deep Learning (DL) have become a popular tool to address all sorts of artificial intelligent challenges. The Autonomous Drone Racing is a challenge consisting of developing an autonomous drone capable of beating a human in a drone race, and DL is a tool that has been included in state of the art solutions to address this problem. Current works have proposed to use CNN and DL to detect the gates, whereas other works have proposed to use a CNN to obtain drone’s control commands and a goal point, with all of these approaches using a single image as input. In this work we propose a CNN based on the well known pose-net network. Originally used for camera relocalisation, we propose to use pose-net to provide control commands to drive the drone towards and to cross the gate autonomously. In contrast to previous works, we also propose to use a temporal set of images as input for the network. In specific, we use 6 images captured every 166 milliseconds in one second to create a mosaic. The latter is used as input of the CNN to predict the control commands. We compare this proposed temporal approach against using a single image as input for the CNN. Our results, although in simulation, demonstrate that the using only our temporal approach is feasible, less noisy and more effective than the single image approach, enabling the drone to autonomously cross a set of gates placed randomly, and even under the scenario where the gate moves dynamically.\",\"PeriodicalId\":351115,\"journal\":{\"name\":\"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/REDUAS47371.2019.8999703\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Workshop on Research, Education and Development of Unmanned Aerial Systems (RED UAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/REDUAS47371.2019.8999703","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

卷积神经网络(CNN)和深度学习(DL)已经成为解决各种人工智能挑战的流行工具。自主无人机竞赛是一项挑战,包括开发能够在无人机竞赛中击败人类的自主无人机,而深度学习是一种工具,已被包含在最先进的解决方案中,以解决这个问题。目前的工作已经提出使用CNN和DL来检测门,而其他工作已经提出使用CNN来获取无人机的控制命令和目标点,所有这些方法都使用单个图像作为输入。在这项工作中,我们提出了一个基于众所周知的pose-net网络的CNN。最初用于相机重新定位,我们建议使用pose-net来提供控制命令,以驱动无人机走向并自主穿过大门。与之前的工作相反,我们还建议使用一组临时图像作为网络的输入。具体来说,我们在一秒钟内每166毫秒拍摄6张图像来创建马赛克。后者作为CNN的输入来预测控制命令。我们将这种提出的时间方法与使用单个图像作为CNN的输入进行比较。虽然在模拟中,我们的结果表明,仅使用我们的时间方法是可行的,比单图像方法噪声更小,更有效,使无人机能够自主穿越随机放置的一组门,甚至在门动态移动的情况下。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Temporal CNN-based Approach for Autonomous Drone Racing
Convolutional Neural Networks (CNN) and Deep Learning (DL) have become a popular tool to address all sorts of artificial intelligent challenges. The Autonomous Drone Racing is a challenge consisting of developing an autonomous drone capable of beating a human in a drone race, and DL is a tool that has been included in state of the art solutions to address this problem. Current works have proposed to use CNN and DL to detect the gates, whereas other works have proposed to use a CNN to obtain drone’s control commands and a goal point, with all of these approaches using a single image as input. In this work we propose a CNN based on the well known pose-net network. Originally used for camera relocalisation, we propose to use pose-net to provide control commands to drive the drone towards and to cross the gate autonomously. In contrast to previous works, we also propose to use a temporal set of images as input for the network. In specific, we use 6 images captured every 166 milliseconds in one second to create a mosaic. The latter is used as input of the CNN to predict the control commands. We compare this proposed temporal approach against using a single image as input for the CNN. Our results, although in simulation, demonstrate that the using only our temporal approach is feasible, less noisy and more effective than the single image approach, enabling the drone to autonomously cross a set of gates placed randomly, and even under the scenario where the gate moves dynamically.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信