A Summary of convolution Neural Network Compression and Acceleration Technology

2020 International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI) Pub Date : 2020-12-01 DOI:10.1109/ICHCI51889.2020.00065

Bingzhen Li, Wenzhi Jiang, Jiaojiao Gu, Ke Liu

引用次数: 0

Abstract

Although convolution neural network has achieved remarkable results in different application scenarios, there are a large number of parameters and computation in its structure, which limit its development in mobile and embedded devices. How to reduce parameters, compress model and optimize structure to improve network performance without losing accuracy has become a hot issue of convolution neural network. This paper summarizes and summarizes the convolution neural network structure optimization technology from five aspects: granularity pruning, weight quantization sharing, knowledge distillation, tensor decomposition and fine network design, and analyzes the technical core of it. Their advantages and disadvantages, applicable scenarios and optimization results are analyzed and summarized respectively, and the future research direction is prospected.

查看原文本刊更多论文

卷积神经网络压缩与加速技术综述

虽然卷积神经网络在不同的应用场景中取得了显著的效果，但其结构中存在大量的参数和计算量，限制了其在移动和嵌入式设备中的发展。如何在不损失精度的前提下减少参数、压缩模型和优化结构以提高网络性能已成为卷积神经网络研究的热点问题。本文从粒度剪枝、权值量化共享、知识蒸馏、张量分解和精细网络设计五个方面对卷积神经网络结构优化技术进行了总结和总结，并分析了其技术核心。分别对其优缺点、适用场景和优化结果进行了分析总结，并对未来的研究方向进行了展望。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI)

自引率

0.00%

发文量