按层组的重要程度进行滤波剪枝，以加速卷积神经网络

The Herald of KSUCTA, №3, 2021 Pub Date : 2021-09-27 DOI:10.35803/1694-5298.2021.3.358-365

Yunseok Jang, Jaeseok Kim

{"title":"按层组的重要程度进行滤波剪枝，以加速卷积神经网络","authors":"Yunseok Jang, Jaeseok Kim","doi":"10.35803/1694-5298.2021.3.358-365","DOIUrl":null,"url":null,"abstract":"Various acceleration approaches have been studied to deploy convolutional neural networks in embedded devices. Among them, filter pruning is the most active research because it is easy to implement in hardware and keeps high accuracy while reducing the computational and memory cost. In this paper, we propose a method of grouping layers, finding the importance of each group, and groupwise pruning according to the order of importance to achieve high FLOPs reduction while retaining high accuracy. First, we divide the layers of the pre-trained network into groups according to the size of the output feature map. Next, we calculate the importance score per group using first-order Taylor expansion. Finally, filter pruning is performed in order from the group with the highest importance score. When pruning VGG and ResNet trained on CIFAR-10, our proposed method shows superior performance in accuracy and FLOPs compared to the state-of-art methods. Notably, on ResNet-50, we achieve 70.85% FLOPs reduction by removing 50% of the filters, with a slight loss of 0.41% in the baseline accuracy.","PeriodicalId":22490,"journal":{"name":"The Herald of KSUCTA, №3, 2021","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FILTER PRUNING IN ORDER OF IMPORTANCE OF THE LAYER GROUPS TO ACCELERATE CONVOLUTIONAL NEURAL NETWORKS\",\"authors\":\"Yunseok Jang, Jaeseok Kim\",\"doi\":\"10.35803/1694-5298.2021.3.358-365\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Various acceleration approaches have been studied to deploy convolutional neural networks in embedded devices. Among them, filter pruning is the most active research because it is easy to implement in hardware and keeps high accuracy while reducing the computational and memory cost. In this paper, we propose a method of grouping layers, finding the importance of each group, and groupwise pruning according to the order of importance to achieve high FLOPs reduction while retaining high accuracy. First, we divide the layers of the pre-trained network into groups according to the size of the output feature map. Next, we calculate the importance score per group using first-order Taylor expansion. Finally, filter pruning is performed in order from the group with the highest importance score. When pruning VGG and ResNet trained on CIFAR-10, our proposed method shows superior performance in accuracy and FLOPs compared to the state-of-art methods. Notably, on ResNet-50, we achieve 70.85% FLOPs reduction by removing 50% of the filters, with a slight loss of 0.41% in the baseline accuracy.\",\"PeriodicalId\":22490,\"journal\":{\"name\":\"The Herald of KSUCTA, №3, 2021\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Herald of KSUCTA, №3, 2021\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.35803/1694-5298.2021.3.358-365\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Herald of KSUCTA, №3, 2021","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35803/1694-5298.2021.3.358-365","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

为了在嵌入式设备中部署卷积神经网络，已经研究了各种加速方法。其中，滤波剪枝由于在硬件上易于实现，在降低计算和内存成本的同时保持较高的精度，是目前研究最为活跃的研究方向。在本文中，我们提出了一种对层进行分组，找出每一组的重要性，并根据重要性的先后顺序进行分组剪枝的方法，以达到在保持较高精度的同时实现较高的FLOPs降低。首先，我们根据输出特征图的大小将预训练网络的各层进行分组。接下来，我们使用一阶泰勒展开计算每组的重要性得分。最后，从重要性得分最高的组开始，按顺序进行过滤器修剪。当在CIFAR-10上训练VGG和ResNet剪枝时，我们提出的方法在精度和FLOPs方面都比目前最先进的方法有更好的表现。值得注意的是，在ResNet-50上，通过去除50%的滤波器，我们实现了70.85%的FLOPs降低，基线精度略有损失0.41%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

FILTER PRUNING IN ORDER OF IMPORTANCE OF THE LAYER GROUPS TO ACCELERATE CONVOLUTIONAL NEURAL NETWORKS

Various acceleration approaches have been studied to deploy convolutional neural networks in embedded devices. Among them, filter pruning is the most active research because it is easy to implement in hardware and keeps high accuracy while reducing the computational and memory cost. In this paper, we propose a method of grouping layers, finding the importance of each group, and groupwise pruning according to the order of importance to achieve high FLOPs reduction while retaining high accuracy. First, we divide the layers of the pre-trained network into groups according to the size of the output feature map. Next, we calculate the importance score per group using first-order Taylor expansion. Finally, filter pruning is performed in order from the group with the highest importance score. When pruning VGG and ResNet trained on CIFAR-10, our proposed method shows superior performance in accuracy and FLOPs compared to the state-of-art methods. Notably, on ResNet-50, we achieve 70.85% FLOPs reduction by removing 50% of the filters, with a slight loss of 0.41% in the baseline accuracy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The Herald of KSUCTA, №3, 2021

自引率

0.00%

发文量