A Novel Reduced-Layer Deep Learning System via Pixel Rearrangement for Object Detection in Multispectral Imagery

2018 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA) Pub Date : 2018-06-01 DOI:10.1109/CIVEMSA.2018.8439982

Anusha K. Vishwanathan, D. Megherbi

{"title":"A Novel Reduced-Layer Deep Learning System via Pixel Rearrangement for Object Detection in Multispectral Imagery","authors":"Anusha K. Vishwanathan, D. Megherbi","doi":"10.1109/CIVEMSA.2018.8439982","DOIUrl":null,"url":null,"abstract":"In this paper, we study the problem of object detection on multispectral images. We present a generalized “Fully” Convolutional Neural Network (FCNN)-based deep learning system with a novel Pixel Rearrangement Technique, with reduced computational complexity and improved prediction accuracy than its state-of-the-art counterparts. In particular, we (a) define a key strategy based on spectral signatures to select a set of highly informative multispectral bands for the system; (b) for the first time, introduce a pixel rearrangement technique that efficiently utilizes pixels from the network's feature maps that results into accurate pixelwise prediction images; (c) propose dual stage global and adaptive thresholding methodologies that transform the pixelwise prediction images to binary. We evaluate the proposed system for automatic airborne building detection using the SpaceNet dataset. We use the three NVIDIA GeForce GTX 1060 GPUs at CMINDS Research Center and Tensorflow deep learning framework to implement the proposed system. Our findings show an improvement in the performance by 0.3% in comparison to the top winning submission of the national SpaceNet Building Challenge II, that took place in April 2017, but with an additional 43% reduction in the number of FCNN layers. Finally, we also present a comparison chart with various existing approaches to highlight the proposed reduced computational complexity system.","PeriodicalId":305399,"journal":{"name":"2018 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIVEMSA.2018.8439982","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In this paper, we study the problem of object detection on multispectral images. We present a generalized “Fully” Convolutional Neural Network (FCNN)-based deep learning system with a novel Pixel Rearrangement Technique, with reduced computational complexity and improved prediction accuracy than its state-of-the-art counterparts. In particular, we (a) define a key strategy based on spectral signatures to select a set of highly informative multispectral bands for the system; (b) for the first time, introduce a pixel rearrangement technique that efficiently utilizes pixels from the network's feature maps that results into accurate pixelwise prediction images; (c) propose dual stage global and adaptive thresholding methodologies that transform the pixelwise prediction images to binary. We evaluate the proposed system for automatic airborne building detection using the SpaceNet dataset. We use the three NVIDIA GeForce GTX 1060 GPUs at CMINDS Research Center and Tensorflow deep learning framework to implement the proposed system. Our findings show an improvement in the performance by 0.3% in comparison to the top winning submission of the national SpaceNet Building Challenge II, that took place in April 2017, but with an additional 43% reduction in the number of FCNN layers. Finally, we also present a comparison chart with various existing approaches to highlight the proposed reduced computational complexity system.

查看原文本刊更多论文

一种新的基于像素重排的多光谱图像目标检测的减层深度学习系统

本文主要研究多光谱图像的目标检测问题。我们提出了一种基于广义“完全”卷积神经网络(FCNN)的深度学习系统，该系统采用了一种新颖的像素重排技术，与最先进的同类系统相比，它降低了计算复杂度，提高了预测精度。具体而言，我们(a)定义了一种基于光谱特征的关键策略，为系统选择一组高信息量的多光谱波段;(b)首次引入了像素重排技术，该技术有效地利用了网络特征映射中的像素，从而生成了精确的像素预测图像;(c)提出双阶段全局和自适应阈值方法，将像素预测图像转换为二值化。我们使用SpaceNet数据集评估了所提出的自动机载建筑物检测系统。我们使用CMINDS研究中心的三颗NVIDIA GeForce GTX 1060 gpu和Tensorflow深度学习框架来实现所提出的系统。我们的研究结果表明，与2017年4月举行的国家SpaceNet建筑挑战赛II的最高获奖作品相比，该方案的性能提高了0.3%，但FCNN层的数量又减少了43%。最后，我们还提供了与各种现有方法的比较图表，以突出所提出的降低计算复杂度的系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)

自引率

0.00%

发文量