Learning-based Tensor Decomposition with Adaptive Rank Penalty for CNNs Compression

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR) Pub Date : 2021-09-01 DOI:10.1109/MIPR51284.2021.00014

Deli Yu, Peipei Yang, Cheng-Lin Liu

{"title":"Learning-based Tensor Decomposition with Adaptive Rank Penalty for CNNs Compression","authors":"Deli Yu, Peipei Yang, Cheng-Lin Liu","doi":"10.1109/MIPR51284.2021.00014","DOIUrl":null,"url":null,"abstract":"Low-rank tensor decomposition is a widely-used strategy to compress convolutional neural networks (CNNs). Existing learning-based decomposition methods encourage low-rank filter weights via regularizer of filters’ pair-wise force or nuclear norm during training. However, these methods can not obtain the satisfactory low-rank structure. We propose a new method with an adaptive rank penalty to learn more compact CNNs. Specifically, we transform rank constraint into a differentiable one and impose its adaptive violation-aware penalty on filters. Moreover, this paper is the first work to integrate the learning-based decomposition and group decomposition to make a better trade-off, especially for the tough task of compression of 1×1 convolution.The obtained low-rank model can be easily decomposed while nearly keeping the full accuracy without additional fine-tuning process. The effectiveness is verified by compression experiments of VGG and ResNet on CIFAR-10 and ILSVRC-2012. Our method can reduce about 65% parameters of ResNet-110 with 0.04% Top-1 accuracy drop on CIFAR-10, and reduce about 60% parameters of ResNet-50 with 0.57% Top-1 accuracy drop on ILSVRC-2012.","PeriodicalId":139543,"journal":{"name":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIPR51284.2021.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Low-rank tensor decomposition is a widely-used strategy to compress convolutional neural networks (CNNs). Existing learning-based decomposition methods encourage low-rank filter weights via regularizer of filters’ pair-wise force or nuclear norm during training. However, these methods can not obtain the satisfactory low-rank structure. We propose a new method with an adaptive rank penalty to learn more compact CNNs. Specifically, we transform rank constraint into a differentiable one and impose its adaptive violation-aware penalty on filters. Moreover, this paper is the first work to integrate the learning-based decomposition and group decomposition to make a better trade-off, especially for the tough task of compression of 1×1 convolution.The obtained low-rank model can be easily decomposed while nearly keeping the full accuracy without additional fine-tuning process. The effectiveness is verified by compression experiments of VGG and ResNet on CIFAR-10 and ILSVRC-2012. Our method can reduce about 65% parameters of ResNet-110 with 0.04% Top-1 accuracy drop on CIFAR-10, and reduce about 60% parameters of ResNet-50 with 0.57% Top-1 accuracy drop on ILSVRC-2012.

查看原文本刊更多论文

基于自适应秩惩罚的学习张量分解在cnn压缩中的应用

低秩张量分解是卷积神经网络压缩的一种常用策略。现有的基于学习的分解方法在训练过程中通过滤波器成对力或核范数的正则化器来鼓励低秩滤波器权重。然而，这些方法都不能得到满意的低阶结构。我们提出了一种新的自适应秩惩罚方法来学习更紧凑的cnn。具体而言，我们将秩约束转化为可微约束，并对过滤器施加其自适应违例感知惩罚。此外，本文是第一个将基于学习的分解和分组分解相结合的工作，以更好地权衡，特别是对于1×1卷积压缩的艰巨任务。所得到的低秩模型可以很容易地分解，而无需额外的微调过程，几乎可以保持全部精度。在CIFAR-10和ILSVRC-2012上进行了VGG和ResNet压缩实验，验证了其有效性。该方法在CIFAR-10上可减少约65%的ResNet-110参数，Top-1精度下降0.04%;在ILSVRC-2012上可减少约60%的ResNet-50参数，Top-1精度下降0.57%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval (MIPR)

自引率

0.00%

发文量