End-to-end object detection and recognition in forward-looking sonar images with convolutional neural networks

Matias Valdenegro-Toro
{"title":"End-to-end object detection and recognition in forward-looking sonar images with convolutional neural networks","authors":"Matias Valdenegro-Toro","doi":"10.1109/AUV.2016.7778662","DOIUrl":null,"url":null,"abstract":"Object detection and recognition are typically stages that form part of the perception module of Autonomous Underwater Vehicles, used with different sensors such as Sonar and Optical imaging, but their design is usually separate and they are only combined at test time. In this work we present a convolutional neural network that does both object detection (through detection proposals) and recognition in Forward-Looking Sonar images and is trained with bounding boxes and class labels only. Convolutional layers are shared and a 128-element feature vector is shared between both tasks. After training we obtain 93% correct detections and 75% accuracy, but accuracy can be improved by fine-tuning the classifier sub-network with the generated detection proposals. We evaluated fine-tuning with a SVM classifier trained on the shared feature vector, increasing accuracy to 85%. Our detection proposal method can also detect unlabeled and untrained objects, and has good generalization performance. Our unified method can be used in any kind of sonar image, does not make assumptions about an object's shadow, and learns features directly from data.","PeriodicalId":416057,"journal":{"name":"2016 IEEE/OES Autonomous Underwater Vehicles (AUV)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"35","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/OES Autonomous Underwater Vehicles (AUV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AUV.2016.7778662","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 35

Abstract

Object detection and recognition are typically stages that form part of the perception module of Autonomous Underwater Vehicles, used with different sensors such as Sonar and Optical imaging, but their design is usually separate and they are only combined at test time. In this work we present a convolutional neural network that does both object detection (through detection proposals) and recognition in Forward-Looking Sonar images and is trained with bounding boxes and class labels only. Convolutional layers are shared and a 128-element feature vector is shared between both tasks. After training we obtain 93% correct detections and 75% accuracy, but accuracy can be improved by fine-tuning the classifier sub-network with the generated detection proposals. We evaluated fine-tuning with a SVM classifier trained on the shared feature vector, increasing accuracy to 85%. Our detection proposal method can also detect unlabeled and untrained objects, and has good generalization performance. Our unified method can be used in any kind of sonar image, does not make assumptions about an object's shadow, and learns features directly from data.
基于卷积神经网络的前视声纳图像端到端目标检测与识别
目标检测和识别通常是构成自主水下航行器感知模块的一部分,与声纳和光学成像等不同的传感器一起使用,但它们的设计通常是分开的,只有在测试时才会组合在一起。在这项工作中,我们提出了一个卷积神经网络,它既可以进行目标检测(通过检测建议),也可以在前瞻性声纳图像中进行识别,并且只使用边界框和类别标签进行训练。两个任务之间共享卷积层和128个元素的特征向量。经过训练,我们获得了93%的正确率和75%的准确率,但准确率可以通过使用生成的检测建议对分类器子网络进行微调来提高。我们使用在共享特征向量上训练的SVM分类器评估微调,将准确率提高到85%。我们的检测建议方法也可以检测到未标记和未训练的对象,并且具有良好的泛化性能。我们的统一方法可以用于任何类型的声纳图像,不需要对物体的阴影进行假设,而是直接从数据中学习特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信