基于输入通道显著卷积核的DCNN滤波器可控自适应剪枝

IF 2.9 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing Pub Date : 2025-03-18 DOI:10.1016/j.dsp.2025.105161

Jiajian Cai , Suyun Lian , Yang Zhao , Jihong Zhu , Jihong Pei

{"title":"基于输入通道显著卷积核的DCNN滤波器可控自适应剪枝","authors":"Jiajian Cai , Suyun Lian , Yang Zhao , Jihong Zhu , Jihong Pei","doi":"10.1016/j.dsp.2025.105161","DOIUrl":null,"url":null,"abstract":"<div><div>With the development of the Internet of Things (IoT) and autonomous driving, many edge devices need to deploy deep network models to improve their performance. However, deep neural network models typically involve a substantial number of parameters and computational demands. Filter pruning can substantially compress and accelerate the deep network models, transforming them into compact and lightweight versions, and facilitating more efficient deployment on edge devices. In this paper, a controlled adaptive pruning method based on significant convolutional kernels of input channels called CAPSCK is proposed for DCNN filters. First, the significance of each group of input channel convolutional kernels is evaluated. The significant convolutional kernels (SCK) are used to measure filter importance, with interference caused by insignificant convolutional kernels minimized during evaluation. Second, a controllable adaptive pruning (CAP) evaluation is constructed. This assesses pruning sensitivity by utilizing the baseline pruning rate for each layer, which is determined based on filter importance. It reduces the subjectivity caused by manually setting the pruning rate. Experiments of pruning on multiple datasets and different networks show that CAPSCK can effectively compress and accelerate network models. For example, on CIFAR-10 with VGG16, CAPSCK achieves a compression ratio of 12.58× and an acceleration ratio of 4.14×, with only 0.16% drop in accuracy. The compression and acceleration performance on various datasets and networks surpasses several state-of-the-art methods.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"162 ","pages":"Article 105161"},"PeriodicalIF":2.9000,"publicationDate":"2025-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Controllable adaptive pruning for DCNN filters based on significant convolution kernel of input channels\",\"authors\":\"Jiajian Cai , Suyun Lian , Yang Zhao , Jihong Zhu , Jihong Pei\",\"doi\":\"10.1016/j.dsp.2025.105161\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the development of the Internet of Things (IoT) and autonomous driving, many edge devices need to deploy deep network models to improve their performance. However, deep neural network models typically involve a substantial number of parameters and computational demands. Filter pruning can substantially compress and accelerate the deep network models, transforming them into compact and lightweight versions, and facilitating more efficient deployment on edge devices. In this paper, a controlled adaptive pruning method based on significant convolutional kernels of input channels called CAPSCK is proposed for DCNN filters. First, the significance of each group of input channel convolutional kernels is evaluated. The significant convolutional kernels (SCK) are used to measure filter importance, with interference caused by insignificant convolutional kernels minimized during evaluation. Second, a controllable adaptive pruning (CAP) evaluation is constructed. This assesses pruning sensitivity by utilizing the baseline pruning rate for each layer, which is determined based on filter importance. It reduces the subjectivity caused by manually setting the pruning rate. Experiments of pruning on multiple datasets and different networks show that CAPSCK can effectively compress and accelerate network models. For example, on CIFAR-10 with VGG16, CAPSCK achieves a compression ratio of 12.58× and an acceleration ratio of 4.14×, with only 0.16% drop in accuracy. The compression and acceleration performance on various datasets and networks surpasses several state-of-the-art methods.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"162 \",\"pages\":\"Article 105161\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-03-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425001836\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425001836","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

随着物联网（IoT）和自动驾驶的发展，许多边缘设备需要部署深度网络模型来提高其性能。然而，深度神经网络模型通常涉及大量的参数和计算需求。过滤器修剪可以大大压缩和加速深度网络模型，将其转换为紧凑和轻量级的版本，并促进在边缘设备上更有效的部署。针对DCNN滤波器，提出了一种基于输入通道显著卷积核的可控自适应剪枝方法CAPSCK。首先，评估每组输入通道卷积核的重要性。采用显著卷积核（SCK）来衡量滤波器的重要性，在评估过程中最小化不显著卷积核造成的干扰。其次，构造了一个可控的自适应剪枝（CAP）评价。它通过利用每层的基线剪枝率来评估剪枝敏感性，这是根据过滤器的重要性来确定的。减少了人为设置剪枝速率的主观性。在多个数据集和不同网络上的剪枝实验表明，CAPSCK可以有效地压缩和加速网络模型。例如，在带有VGG16的CIFAR-10上，CAPSCK实现了12.58 x的压缩比和4.14 x的加速比，精度仅下降了0.16%。在各种数据集和网络上的压缩和加速性能超过了几种最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Controllable adaptive pruning for DCNN filters based on significant convolution kernel of input channels

With the development of the Internet of Things (IoT) and autonomous driving, many edge devices need to deploy deep network models to improve their performance. However, deep neural network models typically involve a substantial number of parameters and computational demands. Filter pruning can substantially compress and accelerate the deep network models, transforming them into compact and lightweight versions, and facilitating more efficient deployment on edge devices. In this paper, a controlled adaptive pruning method based on significant convolutional kernels of input channels called CAPSCK is proposed for DCNN filters. First, the significance of each group of input channel convolutional kernels is evaluated. The significant convolutional kernels (SCK) are used to measure filter importance, with interference caused by insignificant convolutional kernels minimized during evaluation. Second, a controllable adaptive pruning (CAP) evaluation is constructed. This assesses pruning sensitivity by utilizing the baseline pruning rate for each layer, which is determined based on filter importance. It reduces the subjectivity caused by manually setting the pruning rate. Experiments of pruning on multiple datasets and different networks show that CAPSCK can effectively compress and accelerate network models. For example, on CIFAR-10 with VGG16, CAPSCK achieves a compression ratio of 12.58× and an acceleration ratio of 4.14×, with only 0.16% drop in accuracy. The compression and acceleration performance on various datasets and networks surpasses several state-of-the-art methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Digital Signal Processing 工程技术-工程：电子与电气

CiteScore

5.30

自引率

17.20%

发文量

435

审稿时长

66 days

期刊介绍： Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal. The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as: • big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,