同时目标检测和定位使用卷积神经网络

Fatima Zahra Ouadiay, Hamza Bouftaih, E. Bouyakhf, M. M. Himmi
{"title":"同时目标检测和定位使用卷积神经网络","authors":"Fatima Zahra Ouadiay, Hamza Bouftaih, E. Bouyakhf, M. M. Himmi","doi":"10.1109/ISACV.2018.8354045","DOIUrl":null,"url":null,"abstract":"Nowadays deep learning is considered as a trendy technique in the computer vision domain. It becomes a pioneer in its main tasks as object classification, object localization, and object detection. Therefore it gave amazing results and records. In this paper, we propose a new approach to identify and localize objects, simultaneously, in a given scene using Convolutional Neural Networks (CNNs). We propose an end-to-end approach for object detection and pose estimation by formulating bounding boxes containing the targeted object and their pose. Our method is based on two main steps, i) produce Bounding boxes on the training images for generating the pose coordinates of each object in the scene and, ii) detect and localize simultaneously each object present in image during the testing step. The contribution performance is assessed on two datasets, Washington RGB scene dataset and LIMIARF dataset that is constructed in our laboratory. We demonstrate that our proposal is able to obtain high precision and reasonable recall levels.","PeriodicalId":184662,"journal":{"name":"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Simultaneous object detection and localization using convolutional neural networks\",\"authors\":\"Fatima Zahra Ouadiay, Hamza Bouftaih, E. Bouyakhf, M. M. Himmi\",\"doi\":\"10.1109/ISACV.2018.8354045\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays deep learning is considered as a trendy technique in the computer vision domain. It becomes a pioneer in its main tasks as object classification, object localization, and object detection. Therefore it gave amazing results and records. In this paper, we propose a new approach to identify and localize objects, simultaneously, in a given scene using Convolutional Neural Networks (CNNs). We propose an end-to-end approach for object detection and pose estimation by formulating bounding boxes containing the targeted object and their pose. Our method is based on two main steps, i) produce Bounding boxes on the training images for generating the pose coordinates of each object in the scene and, ii) detect and localize simultaneously each object present in image during the testing step. The contribution performance is assessed on two datasets, Washington RGB scene dataset and LIMIARF dataset that is constructed in our laboratory. We demonstrate that our proposal is able to obtain high precision and reasonable recall levels.\",\"PeriodicalId\":184662,\"journal\":{\"name\":\"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISACV.2018.8354045\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISACV.2018.8354045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

摘要

目前,深度学习被认为是计算机视觉领域的一种新兴技术。它成为其主要任务的先驱,如目标分类,目标定位和目标检测。因此,它给了惊人的结果和记录。在本文中,我们提出了一种使用卷积神经网络(cnn)在给定场景中同时识别和定位物体的新方法。我们提出了一种端到端的方法,通过制定包含目标物体及其姿态的边界框来进行目标检测和姿态估计。我们的方法基于两个主要步骤,i)在训练图像上生成边界框,用于生成场景中每个物体的姿态坐标;ii)在测试步骤中同时检测和定位图像中存在的每个物体。在华盛顿RGB场景数据集和我们实验室构建的LIMIARF数据集上对贡献性能进行了评估。我们证明了我们的提议能够获得较高的精度和合理的召回水平。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Simultaneous object detection and localization using convolutional neural networks
Nowadays deep learning is considered as a trendy technique in the computer vision domain. It becomes a pioneer in its main tasks as object classification, object localization, and object detection. Therefore it gave amazing results and records. In this paper, we propose a new approach to identify and localize objects, simultaneously, in a given scene using Convolutional Neural Networks (CNNs). We propose an end-to-end approach for object detection and pose estimation by formulating bounding boxes containing the targeted object and their pose. Our method is based on two main steps, i) produce Bounding boxes on the training images for generating the pose coordinates of each object in the scene and, ii) detect and localize simultaneously each object present in image during the testing step. The contribution performance is assessed on two datasets, Washington RGB scene dataset and LIMIARF dataset that is constructed in our laboratory. We demonstrate that our proposal is able to obtain high precision and reasonable recall levels.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信