同时目标检测和定位使用卷积神经网络

2018 International Conference on Intelligent Systems and Computer Vision (ISCV) Pub Date : 2018-04-01 DOI:10.1109/ISACV.2018.8354045

Fatima Zahra Ouadiay, Hamza Bouftaih, E. Bouyakhf, M. M. Himmi

{"title":"同时目标检测和定位使用卷积神经网络","authors":"Fatima Zahra Ouadiay, Hamza Bouftaih, E. Bouyakhf, M. M. Himmi","doi":"10.1109/ISACV.2018.8354045","DOIUrl":null,"url":null,"abstract":"Nowadays deep learning is considered as a trendy technique in the computer vision domain. It becomes a pioneer in its main tasks as object classification, object localization, and object detection. Therefore it gave amazing results and records. In this paper, we propose a new approach to identify and localize objects, simultaneously, in a given scene using Convolutional Neural Networks (CNNs). We propose an end-to-end approach for object detection and pose estimation by formulating bounding boxes containing the targeted object and their pose. Our method is based on two main steps, i) produce Bounding boxes on the training images for generating the pose coordinates of each object in the scene and, ii) detect and localize simultaneously each object present in image during the testing step. The contribution performance is assessed on two datasets, Washington RGB scene dataset and LIMIARF dataset that is constructed in our laboratory. We demonstrate that our proposal is able to obtain high precision and reasonable recall levels.","PeriodicalId":184662,"journal":{"name":"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Simultaneous object detection and localization using convolutional neural networks\",\"authors\":\"Fatima Zahra Ouadiay, Hamza Bouftaih, E. Bouyakhf, M. M. Himmi\",\"doi\":\"10.1109/ISACV.2018.8354045\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays deep learning is considered as a trendy technique in the computer vision domain. It becomes a pioneer in its main tasks as object classification, object localization, and object detection. Therefore it gave amazing results and records. In this paper, we propose a new approach to identify and localize objects, simultaneously, in a given scene using Convolutional Neural Networks (CNNs). We propose an end-to-end approach for object detection and pose estimation by formulating bounding boxes containing the targeted object and their pose. Our method is based on two main steps, i) produce Bounding boxes on the training images for generating the pose coordinates of each object in the scene and, ii) detect and localize simultaneously each object present in image during the testing step. The contribution performance is assessed on two datasets, Washington RGB scene dataset and LIMIARF dataset that is constructed in our laboratory. We demonstrate that our proposal is able to obtain high precision and reasonable recall levels.\",\"PeriodicalId\":184662,\"journal\":{\"name\":\"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISACV.2018.8354045\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISACV.2018.8354045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

目前，深度学习被认为是计算机视觉领域的一种新兴技术。它成为其主要任务的先驱，如目标分类，目标定位和目标检测。因此，它给了惊人的结果和记录。在本文中，我们提出了一种使用卷积神经网络(cnn)在给定场景中同时识别和定位物体的新方法。我们提出了一种端到端的方法，通过制定包含目标物体及其姿态的边界框来进行目标检测和姿态估计。我们的方法基于两个主要步骤，i)在训练图像上生成边界框，用于生成场景中每个物体的姿态坐标;ii)在测试步骤中同时检测和定位图像中存在的每个物体。在华盛顿RGB场景数据集和我们实验室构建的LIMIARF数据集上对贡献性能进行了评估。我们证明了我们的提议能够获得较高的精度和合理的召回水平。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Simultaneous object detection and localization using convolutional neural networks

Nowadays deep learning is considered as a trendy technique in the computer vision domain. It becomes a pioneer in its main tasks as object classification, object localization, and object detection. Therefore it gave amazing results and records. In this paper, we propose a new approach to identify and localize objects, simultaneously, in a given scene using Convolutional Neural Networks (CNNs). We propose an end-to-end approach for object detection and pose estimation by formulating bounding boxes containing the targeted object and their pose. Our method is based on two main steps, i) produce Bounding boxes on the training images for generating the pose coordinates of each object in the scene and, ii) detect and localize simultaneously each object present in image during the testing step. The contribution performance is assessed on two datasets, Washington RGB scene dataset and LIMIARF dataset that is constructed in our laboratory. We demonstrate that our proposal is able to obtain high precision and reasonable recall levels.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2018 International Conference on Intelligent Systems and Computer Vision (ISCV)

自引率

0.00%

发文量