ABCD: A Compact Object Detector Based on Channel Quantization and Tensor Decomposition

2020 IEEE 10th International Conference on Consumer Electronics (ICCE-Berlin) Pub Date : 2020-11-09 DOI:10.1109/ICCE-Berlin50680.2020.9352200

Bingyi Zhang, Peining Zhen, Junyan Yang, Saisai Niu, Hang Yi, Hai-Bao Chen

{"title":"ABCD: A Compact Object Detector Based on Channel Quantization and Tensor Decomposition","authors":"Bingyi Zhang, Peining Zhen, Junyan Yang, Saisai Niu, Hang Yi, Hai-Bao Chen","doi":"10.1109/ICCE-Berlin50680.2020.9352200","DOIUrl":null,"url":null,"abstract":"Object detection and tracking are critical computer vision tasks because of the broad needs in society; however, deep neural network-based methods cost many computational resources that hinder them from real scene applications. Quantization is a widely adopted technique to reduce the storage space and memory footprint which makes deep learning models more energy-efficient and resource-friendly. Traditional network quantization methods directly quantize neural networks layer-wise, which means the parameters in different channels take the same quantization range. In this paper, we propose a low-bit learning method for convolutional neural network object detector quantization. Different from previous methods, we quantize the detector channel-wisely to avoid accuracy loss in the low-bit framework. We use progressive quantization, progressive batch normalization fusion, and cut the unnecessary long-tail weights and activations to reduce quantization loss. Moreover, based on the object detector and long short-term memory network (LSTM), we develop a high-performance tracking system. We leverage the tensor decomposition to compress weights in LSTM for getting a higher compression ratio. Experiments are conducted on public datasets and our infrared aerial dataset for object detection and tracking. The experimental results show that our approach obtains better performance compared with the state-of-the-art methods in terms of accuracy and compression ratio.","PeriodicalId":438631,"journal":{"name":"2020 IEEE 10th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 10th International Conference on Consumer Electronics (ICCE-Berlin)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE-Berlin50680.2020.9352200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Object detection and tracking are critical computer vision tasks because of the broad needs in society; however, deep neural network-based methods cost many computational resources that hinder them from real scene applications. Quantization is a widely adopted technique to reduce the storage space and memory footprint which makes deep learning models more energy-efficient and resource-friendly. Traditional network quantization methods directly quantize neural networks layer-wise, which means the parameters in different channels take the same quantization range. In this paper, we propose a low-bit learning method for convolutional neural network object detector quantization. Different from previous methods, we quantize the detector channel-wisely to avoid accuracy loss in the low-bit framework. We use progressive quantization, progressive batch normalization fusion, and cut the unnecessary long-tail weights and activations to reduce quantization loss. Moreover, based on the object detector and long short-term memory network (LSTM), we develop a high-performance tracking system. We leverage the tensor decomposition to compress weights in LSTM for getting a higher compression ratio. Experiments are conducted on public datasets and our infrared aerial dataset for object detection and tracking. The experimental results show that our approach obtains better performance compared with the state-of-the-art methods in terms of accuracy and compression ratio.

查看原文本刊更多论文

ABCD:一种基于信道量化和张量分解的紧凑目标检测器

由于社会的广泛需求，目标检测和跟踪是关键的计算机视觉任务;然而，基于深度神经网络的方法耗费了大量的计算资源，阻碍了它们在真实场景中的应用。量化是一种广泛采用的技术，可以减少存储空间和内存占用，使深度学习模型更加节能和资源友好。传统的网络量化方法是直接分层量化神经网络，即不同通道的参数采用相同的量化范围。本文提出了一种卷积神经网络目标检测器量化的低比特学习方法。与以往的方法不同，我们对检测器信道进行了明智的量化，避免了低比特框架下的精度损失。我们采用渐进式量化、渐进式批归一化融合，并去除不必要的长尾权重和激活来减少量化损失。此外，基于目标检测器和长短期记忆网络(LSTM)，我们开发了一个高性能的跟踪系统。我们利用张量分解来压缩LSTM中的权重，以获得更高的压缩比。在公共数据集和我们的红外航空数据集上进行了目标检测和跟踪实验。实验结果表明，该方法在准确率和压缩比方面都优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE 10th International Conference on Consumer Electronics (ICCE-Berlin)

自引率

0.00%

发文量