{"title":"A Novel Reduced-Layer Deep Learning System via Pixel Rearrangement for Object Detection in Multispectral Imagery","authors":"Anusha K. Vishwanathan, D. Megherbi","doi":"10.1109/CIVEMSA.2018.8439982","DOIUrl":null,"url":null,"abstract":"In this paper, we study the problem of object detection on multispectral images. We present a generalized “Fully” Convolutional Neural Network (FCNN)-based deep learning system with a novel Pixel Rearrangement Technique, with reduced computational complexity and improved prediction accuracy than its state-of-the-art counterparts. In particular, we (a) define a key strategy based on spectral signatures to select a set of highly informative multispectral bands for the system; (b) for the first time, introduce a pixel rearrangement technique that efficiently utilizes pixels from the network's feature maps that results into accurate pixelwise prediction images; (c) propose dual stage global and adaptive thresholding methodologies that transform the pixelwise prediction images to binary. We evaluate the proposed system for automatic airborne building detection using the SpaceNet dataset. We use the three NVIDIA GeForce GTX 1060 GPUs at CMINDS Research Center and Tensorflow deep learning framework to implement the proposed system. Our findings show an improvement in the performance by 0.3% in comparison to the top winning submission of the national SpaceNet Building Challenge II, that took place in April 2017, but with an additional 43% reduction in the number of FCNN layers. Finally, we also present a comparison chart with various existing approaches to highlight the proposed reduced computational complexity system.","PeriodicalId":305399,"journal":{"name":"2018 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIVEMSA.2018.8439982","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we study the problem of object detection on multispectral images. We present a generalized “Fully” Convolutional Neural Network (FCNN)-based deep learning system with a novel Pixel Rearrangement Technique, with reduced computational complexity and improved prediction accuracy than its state-of-the-art counterparts. In particular, we (a) define a key strategy based on spectral signatures to select a set of highly informative multispectral bands for the system; (b) for the first time, introduce a pixel rearrangement technique that efficiently utilizes pixels from the network's feature maps that results into accurate pixelwise prediction images; (c) propose dual stage global and adaptive thresholding methodologies that transform the pixelwise prediction images to binary. We evaluate the proposed system for automatic airborne building detection using the SpaceNet dataset. We use the three NVIDIA GeForce GTX 1060 GPUs at CMINDS Research Center and Tensorflow deep learning framework to implement the proposed system. Our findings show an improvement in the performance by 0.3% in comparison to the top winning submission of the national SpaceNet Building Challenge II, that took place in April 2017, but with an additional 43% reduction in the number of FCNN layers. Finally, we also present a comparison chart with various existing approaches to highlight the proposed reduced computational complexity system.