{"title":"ABCD: A Compact Object Detector Based on Channel Quantization and Tensor Decomposition","authors":"Bingyi Zhang, Peining Zhen, Junyan Yang, Saisai Niu, Hang Yi, Hai-Bao Chen","doi":"10.1109/ICCE-Berlin50680.2020.9352200","DOIUrl":null,"url":null,"abstract":"Object detection and tracking are critical computer vision tasks because of the broad needs in society; however, deep neural network-based methods cost many computational resources that hinder them from real scene applications. Quantization is a widely adopted technique to reduce the storage space and memory footprint which makes deep learning models more energy-efficient and resource-friendly. Traditional network quantization methods directly quantize neural networks layer-wise, which means the parameters in different channels take the same quantization range. In this paper, we propose a low-bit learning method for convolutional neural network object detector quantization. Different from previous methods, we quantize the detector channel-wisely to avoid accuracy loss in the low-bit framework. We use progressive quantization, progressive batch normalization fusion, and cut the unnecessary long-tail weights and activations to reduce quantization loss. Moreover, based on the object detector and long short-term memory network (LSTM), we develop a high-performance tracking system. We leverage the tensor decomposition to compress weights in LSTM for getting a higher compression ratio. Experiments are conducted on public datasets and our infrared aerial dataset for object detection and tracking. The experimental results show that our approach obtains better performance compared with the state-of-the-art methods in terms of accuracy and compression ratio.","PeriodicalId":438631,"journal":{"name":"2020 IEEE 10th International Conference on Consumer Electronics (ICCE-Berlin)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 10th International Conference on Consumer Electronics (ICCE-Berlin)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE-Berlin50680.2020.9352200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Object detection and tracking are critical computer vision tasks because of the broad needs in society; however, deep neural network-based methods cost many computational resources that hinder them from real scene applications. Quantization is a widely adopted technique to reduce the storage space and memory footprint which makes deep learning models more energy-efficient and resource-friendly. Traditional network quantization methods directly quantize neural networks layer-wise, which means the parameters in different channels take the same quantization range. In this paper, we propose a low-bit learning method for convolutional neural network object detector quantization. Different from previous methods, we quantize the detector channel-wisely to avoid accuracy loss in the low-bit framework. We use progressive quantization, progressive batch normalization fusion, and cut the unnecessary long-tail weights and activations to reduce quantization loss. Moreover, based on the object detector and long short-term memory network (LSTM), we develop a high-performance tracking system. We leverage the tensor decomposition to compress weights in LSTM for getting a higher compression ratio. Experiments are conducted on public datasets and our infrared aerial dataset for object detection and tracking. The experimental results show that our approach obtains better performance compared with the state-of-the-art methods in terms of accuracy and compression ratio.