Fatima Zahra Ouadiay, Hamza Bouftaih, E. Bouyakhf, M. M. Himmi
{"title":"Simultaneous object detection and localization using convolutional neural networks","authors":"Fatima Zahra Ouadiay, Hamza Bouftaih, E. Bouyakhf, M. M. Himmi","doi":"10.1109/ISACV.2018.8354045","DOIUrl":null,"url":null,"abstract":"Nowadays deep learning is considered as a trendy technique in the computer vision domain. It becomes a pioneer in its main tasks as object classification, object localization, and object detection. Therefore it gave amazing results and records. In this paper, we propose a new approach to identify and localize objects, simultaneously, in a given scene using Convolutional Neural Networks (CNNs). We propose an end-to-end approach for object detection and pose estimation by formulating bounding boxes containing the targeted object and their pose. Our method is based on two main steps, i) produce Bounding boxes on the training images for generating the pose coordinates of each object in the scene and, ii) detect and localize simultaneously each object present in image during the testing step. The contribution performance is assessed on two datasets, Washington RGB scene dataset and LIMIARF dataset that is constructed in our laboratory. We demonstrate that our proposal is able to obtain high precision and reasonable recall levels.","PeriodicalId":184662,"journal":{"name":"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISACV.2018.8354045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Nowadays deep learning is considered as a trendy technique in the computer vision domain. It becomes a pioneer in its main tasks as object classification, object localization, and object detection. Therefore it gave amazing results and records. In this paper, we propose a new approach to identify and localize objects, simultaneously, in a given scene using Convolutional Neural Networks (CNNs). We propose an end-to-end approach for object detection and pose estimation by formulating bounding boxes containing the targeted object and their pose. Our method is based on two main steps, i) produce Bounding boxes on the training images for generating the pose coordinates of each object in the scene and, ii) detect and localize simultaneously each object present in image during the testing step. The contribution performance is assessed on two datasets, Washington RGB scene dataset and LIMIARF dataset that is constructed in our laboratory. We demonstrate that our proposal is able to obtain high precision and reasonable recall levels.