{"title":"Work-in-Progress: A SIMD-Aware Pruning Technique for Convolutional Neural Networks with Multi-Sparsity Levels","authors":"Jeonggyu Jang, Kyusik Choi, Hoeseok Yang","doi":"10.1145/3349567.3351718","DOIUrl":null,"url":null,"abstract":"Designs of electronic systems often require considering multiple design concerns. In this paper, we propose a novel multi-phase pruning technique for convolutional neural networks (CNNs) that is capable of efficient exploration of multiple design objectives and constraints. To truly take advantage of the sparsity obtained by pruning, we present two different levels of pruning granularity, fine- and coarse-grain, and show how they are combined in the design space exploration. In particular, we propose to take the SIMD architecture into account in the fine-grain pruning. By iteratively pruning to a single CNN, multiple candidates can be obtained from the trade-off between the given design concerns. Experiments with existing CNNs verify that the proposed technique enables more efficient design space exploration over the accuracy-speed trade-off.","PeriodicalId":194982,"journal":{"name":"2019 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","volume":"18 22","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3349567.3351718","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Designs of electronic systems often require considering multiple design concerns. In this paper, we propose a novel multi-phase pruning technique for convolutional neural networks (CNNs) that is capable of efficient exploration of multiple design objectives and constraints. To truly take advantage of the sparsity obtained by pruning, we present two different levels of pruning granularity, fine- and coarse-grain, and show how they are combined in the design space exploration. In particular, we propose to take the SIMD architecture into account in the fine-grain pruning. By iteratively pruning to a single CNN, multiple candidates can be obtained from the trade-off between the given design concerns. Experiments with existing CNNs verify that the proposed technique enables more efficient design space exploration over the accuracy-speed trade-off.