Improve Convolutional Neural Network Pruning by Maximizing Filter Variety

Proceedings of the ... International Conference on Image Analysis and Processing. International Conference on Image Analysis and Processing Pub Date : 2022-03-11 DOI:10.48550/arXiv.2203.05807

Nathan Hubens, M. Mancas, B. Gosselin, Marius Preda, T. Zaharia

{"title":"Improve Convolutional Neural Network Pruning by Maximizing Filter Variety","authors":"Nathan Hubens, M. Mancas, B. Gosselin, Marius Preda, T. Zaharia","doi":"10.48550/arXiv.2203.05807","DOIUrl":null,"url":null,"abstract":"Neural network pruning is a widely used strategy for reducing model storage and computing requirements. It allows to lower the complexity of the network by introducing sparsity in the weights. Because taking advantage of sparse matrices is still challenging, pruning is often performed in a structured way, i.e. removing entire convolution filters in the case of ConvNets, according to a chosen pruning criteria. Common pruning criteria, such as l1-norm or movement, usually do not consider the individual utility of filters, which may lead to: (1) the removal of filters exhibiting rare, thus important and discriminative behaviour, and (2) the retaining of filters with redundant information. In this paper, we present a technique solving those two issues, and which can be appended to any pruning criteria. This technique ensures that the criteria of selection focuses on redundant filters, while retaining the rare ones, thus maximizing the variety of remaining filters. The experimental results, carried out on different datasets (CIFAR-10, CIFAR-100 and CALTECH-101) and using different architectures (VGG-16 and ResNet-18) demonstrate that it is possible to achieve similar sparsity levels while maintaining a higher performance when appending our filter selection technique to pruning criteria. Moreover, we assess the quality of the found sparse sub-networks by applying the Lottery Ticket Hypothesis and find that the addition of our method allows to discover better performing tickets in most cases","PeriodicalId":74527,"journal":{"name":"Proceedings of the ... International Conference on Image Analysis and Processing. International Conference on Image Analysis and Processing","volume":"73 1","pages":"379-390"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... International Conference on Image Analysis and Processing. International Conference on Image Analysis and Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2203.05807","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Neural network pruning is a widely used strategy for reducing model storage and computing requirements. It allows to lower the complexity of the network by introducing sparsity in the weights. Because taking advantage of sparse matrices is still challenging, pruning is often performed in a structured way, i.e. removing entire convolution filters in the case of ConvNets, according to a chosen pruning criteria. Common pruning criteria, such as l1-norm or movement, usually do not consider the individual utility of filters, which may lead to: (1) the removal of filters exhibiting rare, thus important and discriminative behaviour, and (2) the retaining of filters with redundant information. In this paper, we present a technique solving those two issues, and which can be appended to any pruning criteria. This technique ensures that the criteria of selection focuses on redundant filters, while retaining the rare ones, thus maximizing the variety of remaining filters. The experimental results, carried out on different datasets (CIFAR-10, CIFAR-100 and CALTECH-101) and using different architectures (VGG-16 and ResNet-18) demonstrate that it is possible to achieve similar sparsity levels while maintaining a higher performance when appending our filter selection technique to pruning criteria. Moreover, we assess the quality of the found sparse sub-networks by applying the Lottery Ticket Hypothesis and find that the addition of our method allows to discover better performing tickets in most cases

查看原文本刊更多论文

通过最大化滤波器种类来改进卷积神经网络剪枝

神经网络剪枝是一种广泛使用的减少模型存储和计算需求的策略。它允许通过在权重中引入稀疏性来降低网络的复杂性。由于利用稀疏矩阵仍然具有挑战性，因此修剪通常以结构化的方式进行，即根据选择的修剪标准去除卷积网络中的整个卷积过滤器。常见的修剪标准，如11范数或运动，通常不考虑过滤器的个别效用，这可能导致:(1)去除表现出罕见的，因此重要的和有区别的行为的过滤器，以及(2)保留冗余信息的过滤器。在本文中，我们提出了一种解决这两个问题的技术，它可以附加到任何修剪标准中。这种技术确保了选择的标准集中在冗余的过滤器上，同时保留了罕见的过滤器，从而最大限度地增加了剩余过滤器的多样性。在不同的数据集(CIFAR-10、CIFAR-100和CALTECH-101)和不同的架构(VGG-16和ResNet-18)上进行的实验结果表明，当将我们的滤波器选择技术附加到修剪标准中时，可以在保持更高性能的同时实现相似的稀疏度水平。此外，我们通过应用彩票假设来评估发现的稀疏子网络的质量，并发现添加我们的方法可以在大多数情况下发现性能更好的彩票

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the ... International Conference on Image Analysis and Processing. International Conference on Image Analysis and Processing

自引率

0.00%

发文量