High Efficient Compression: Model Compression Method Based on Channel Pruning and Knowledge Distillation

2023 Asia-Europe Conference on Electronics, Data Processing and Informatics (ACEDPI) Pub Date : 2023-04-01 DOI:10.1109/ACEDPI58926.2023.00058

Junan Lin, Zekai Ye, Junhan Wang

{"title":"High Efficient Compression: Model Compression Method Based on Channel Pruning and Knowledge Distillation","authors":"Junan Lin, Zekai Ye, Junhan Wang","doi":"10.1109/ACEDPI58926.2023.00058","DOIUrl":null,"url":null,"abstract":"With the development of neural network models in the field of deep learning image processing, more complex and higher performance neural network models can be used, but with it comes greater parameter quantities and computing power requirements. Although the model compression method has been updated in recent years, it still has problems such as inefficiency and reduced accuracy. Based on this, we propose a model compression method based on channel pruning and knowledge distillation, we sort and prune the L2 regularization weights of the convolutional layer filter containing a large number of calculations in the original model and prune it according to a certain proportion, and assist in restoring the accuracy by the knowledge distillation of the pruning model through the original model. Experimental results of the VGG16 model show that our pruning method reduced the parameters of the model by half and increased its processing speed to 2.7 times that of the original model, but the accuracy was reduced by about 10%. To solve this problem, we propose a knowledge distillation method to assist the training of the pruning model, so that the problem of model accuracy degradation has been improved and maintained at about 3.3%, to achieve a balanced state of model compression and accuracy reduction, thereby realizing the efficient compression of the model.","PeriodicalId":124469,"journal":{"name":"2023 Asia-Europe Conference on Electronics, Data Processing and Informatics (ACEDPI)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Asia-Europe Conference on Electronics, Data Processing and Informatics (ACEDPI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACEDPI58926.2023.00058","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

With the development of neural network models in the field of deep learning image processing, more complex and higher performance neural network models can be used, but with it comes greater parameter quantities and computing power requirements. Although the model compression method has been updated in recent years, it still has problems such as inefficiency and reduced accuracy. Based on this, we propose a model compression method based on channel pruning and knowledge distillation, we sort and prune the L2 regularization weights of the convolutional layer filter containing a large number of calculations in the original model and prune it according to a certain proportion, and assist in restoring the accuracy by the knowledge distillation of the pruning model through the original model. Experimental results of the VGG16 model show that our pruning method reduced the parameters of the model by half and increased its processing speed to 2.7 times that of the original model, but the accuracy was reduced by about 10%. To solve this problem, we propose a knowledge distillation method to assist the training of the pruning model, so that the problem of model accuracy degradation has been improved and maintained at about 3.3%, to achieve a balanced state of model compression and accuracy reduction, thereby realizing the efficient compression of the model.

查看原文本刊更多论文

高效压缩:基于通道剪枝和知识蒸馏的模型压缩方法

随着神经网络模型在深度学习图像处理领域的发展，可以使用更复杂、性能更高的神经网络模型，但随之而来的是对参数量和计算能力的更高要求。虽然近年来模型压缩方法不断更新，但仍然存在效率低下、精度降低等问题。在此基础上，我们提出了一种基于通道剪枝和知识蒸馏的模型压缩方法，对原始模型中包含大量计算的卷积层滤波器的L2正则化权值进行排序和剪枝，并按一定比例进行剪枝，通过原始模型对剪枝模型进行知识蒸馏，辅助恢复精度。VGG16模型的实验结果表明，我们的剪枝方法使模型的参数减少了一半，处理速度提高到原模型的2.7倍，但精度降低了约10%。针对这一问题，我们提出了一种知识蒸馏的方法来辅助剪枝模型的训练，使模型精度退化问题得到改善，保持在3.3%左右，达到了模型压缩和精度降低的平衡状态，从而实现了模型的高效压缩。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 Asia-Europe Conference on Electronics, Data Processing and Informatics (ACEDPI)

自引率

0.00%

发文量