S. Chevtchenko, Rafaella F. Vale, F. Cordeiro, V. Macário
{"title":"Deep Learning for People Detection on Beach Images","authors":"S. Chevtchenko, Rafaella F. Vale, F. Cordeiro, V. Macário","doi":"10.1109/BRACIS.2018.00045","DOIUrl":null,"url":null,"abstract":"Convolutional architectures have in recent years become state-of-the-art for several object detection tasks. However, these detectors have not yet been evaluated for detection and monitoring of beach areas. As some of these areas need to be continually monitored for dangerous situations, such as shark attacks, an automated system would be an effective risk control measure. The most significant and specific challenges for this problem are variable scene illumination, partial occlusion and distant camera position. In this work we present a study on three recent convolutional architectures for the task of people detection in beach scenarios. Our dataset is composed of images taken in the Boa Viagem beach, in Brazil, and is used to evaluate Faster R-CNN, R-FCN and SSD in terms of quality and speed of detection. The detectors are pretrained on a dataset containing 91 classes of objects, including people with different levels of scale and occlusion. The results suggest that the Faster R-CNN meta-architecture with the Resnet 101 feature extractor generates significantly better detections in terms of F-measure, while performing at 5.6 fps on a GTX 1080 Ti GPU.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRACIS.2018.00045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Convolutional architectures have in recent years become state-of-the-art for several object detection tasks. However, these detectors have not yet been evaluated for detection and monitoring of beach areas. As some of these areas need to be continually monitored for dangerous situations, such as shark attacks, an automated system would be an effective risk control measure. The most significant and specific challenges for this problem are variable scene illumination, partial occlusion and distant camera position. In this work we present a study on three recent convolutional architectures for the task of people detection in beach scenarios. Our dataset is composed of images taken in the Boa Viagem beach, in Brazil, and is used to evaluate Faster R-CNN, R-FCN and SSD in terms of quality and speed of detection. The detectors are pretrained on a dataset containing 91 classes of objects, including people with different levels of scale and occlusion. The results suggest that the Faster R-CNN meta-architecture with the Resnet 101 feature extractor generates significantly better detections in terms of F-measure, while performing at 5.6 fps on a GTX 1080 Ti GPU.