第三章:基于信道剪枝的cnn高效推理

2020 International Conference on Omni-layer Intelligent Systems (COINS) Pub Date : 2019-08-08 DOI:10.1109/COINS49042.2020.9191636

Boyu Zhang, A. Davoodi, Y. Hu

{"title":"第三章:基于信道剪枝的cnn高效推理","authors":"Boyu Zhang, A. Davoodi, Y. Hu","doi":"10.1109/COINS49042.2020.9191636","DOIUrl":null,"url":null,"abstract":"To deploy a CNN on resource-constrained edge platforms, channel pruning techniques promise a significant reduction of implementation costs including memory, computation, and energy consumption without special hardware or software libraries. This paper proposes CHaPR, a novel pruning technique to structurally prune the redundant channels in a trained deep Convolutional Neural Network. CHaPR utilizes a proposed subset selection problem formulation for pruning which it solves using pivoted QR factorization. CHaPR also includes an additional pruning technique for ResNet-like architectures which resolves the issue encountered by some existing channel pruning methods that not all the layers can be pruned. Experimental results on VGG-16 and ResNet-50 models show 4.29X and 2.84X reduction, respectively in computation cost while incurring 2.50% top-1 and 1.40% top-5 accuracy losses. Compared to many existing works, CHaPR performs better when considering an Overall Score metric which accounts for both computation and accuracy.","PeriodicalId":350108,"journal":{"name":"2020 International Conference on Omni-layer Intelligent Systems (COINS)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"CHaPR: Efficient Inference of CNNs via Channel Pruning\",\"authors\":\"Boyu Zhang, A. Davoodi, Y. Hu\",\"doi\":\"10.1109/COINS49042.2020.9191636\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To deploy a CNN on resource-constrained edge platforms, channel pruning techniques promise a significant reduction of implementation costs including memory, computation, and energy consumption without special hardware or software libraries. This paper proposes CHaPR, a novel pruning technique to structurally prune the redundant channels in a trained deep Convolutional Neural Network. CHaPR utilizes a proposed subset selection problem formulation for pruning which it solves using pivoted QR factorization. CHaPR also includes an additional pruning technique for ResNet-like architectures which resolves the issue encountered by some existing channel pruning methods that not all the layers can be pruned. Experimental results on VGG-16 and ResNet-50 models show 4.29X and 2.84X reduction, respectively in computation cost while incurring 2.50% top-1 and 1.40% top-5 accuracy losses. Compared to many existing works, CHaPR performs better when considering an Overall Score metric which accounts for both computation and accuracy.\",\"PeriodicalId\":350108,\"journal\":{\"name\":\"2020 International Conference on Omni-layer Intelligent Systems (COINS)\",\"volume\":\"60 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Omni-layer Intelligent Systems (COINS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/COINS49042.2020.9191636\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Omni-layer Intelligent Systems (COINS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COINS49042.2020.9191636","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

为了在资源受限的边缘平台上部署CNN，通道修剪技术有望显著降低实现成本，包括内存、计算和能耗，而无需特殊的硬件或软件库。提出了一种新颖的CHaPR剪枝技术，对训练好的深度卷积神经网络中的冗余通道进行结构化剪枝。CHaPR利用提出的子集选择问题公式进行剪枝，它使用pivot QR分解来解决。CHaPR还为类似resnet的体系结构提供了一种额外的修剪技术，它解决了一些现有的通道修剪方法遇到的问题，即不是所有的层都可以修剪。在VGG-16和ResNet-50模型上的实验结果显示，计算成本分别降低了4.29X和2.84X，但top-1和top-5的精度损失分别为2.50%和1.40%。与许多现有作品相比，CHaPR在考虑计算和准确性的综合得分指标时表现更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

CHaPR: Efficient Inference of CNNs via Channel Pruning

To deploy a CNN on resource-constrained edge platforms, channel pruning techniques promise a significant reduction of implementation costs including memory, computation, and energy consumption without special hardware or software libraries. This paper proposes CHaPR, a novel pruning technique to structurally prune the redundant channels in a trained deep Convolutional Neural Network. CHaPR utilizes a proposed subset selection problem formulation for pruning which it solves using pivoted QR factorization. CHaPR also includes an additional pruning technique for ResNet-like architectures which resolves the issue encountered by some existing channel pruning methods that not all the layers can be pruned. Experimental results on VGG-16 and ResNet-50 models show 4.29X and 2.84X reduction, respectively in computation cost while incurring 2.50% top-1 and 1.40% top-5 accuracy losses. Compared to many existing works, CHaPR performs better when considering an Overall Score metric which accounts for both computation and accuracy.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 International Conference on Omni-layer Intelligent Systems (COINS)

自引率

0.00%

发文量