通过扰动正交投影进行结构化定向剪枝

IF 4.6 2区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Transactions on Signal Processing Pub Date : 2024-11-19 DOI:10.1109/TSP.2024.3501674

Xiaofeng Liu;Qing Wang;Yunfeng Shao;Yanhui Geng;Yinchuan Li

{"title":"通过扰动正交投影进行结构化定向剪枝","authors":"Xiaofeng Liu;Qing Wang;Yunfeng Shao;Yanhui Geng;Yinchuan Li","doi":"10.1109/TSP.2024.3501674","DOIUrl":null,"url":null,"abstract":"Despite the great potential of artificial intelligence (AI), which promotes machines to mimic human intelligence in performing tasks, it requires a deep/extensive model with a sufficient number of parameters to enhance the expressive ability. This aspect often hinders the application of AI on resource-constrained devices. Structured pruning is an effective compression technique that reduces the computation of neural networks. However, it typically achieves parameter reduction at the cost of non-negligible accuracy loss, necessitating fine-tuning. This paper introduces a novel technique called Structured Directional Pruning (SDP) and its fast solver, Alternating Structured Directional Pruning (\n<monospace>AltSDP</monospace>\n). SDP is a general energy-efficient coarse-grained pruning method that enables efficient model pruning without requiring fine-tuning or expert knowledge of the desired sparsity level. Theoretical analysis confirms that the fast solver, \n<monospace>AltSDP</monospace>\n, achieves SDP asymptotically after sufficient training. Experimental results validate that \n<monospace>AltSDP</monospace>\n reaches the same minimum valley as the vanilla optimizer, namely stochastic gradient descent (SGD), while maintaining a constant training loss. Additionally, \n<monospace>AltSDP</monospace>\n achieves state-of-the-art pruned accuracy integrating pruning into the initial training process without the need for fine-tuning. Consequently, the newly proposed SDP, along with its fast solver \n<monospace>AltSDP</monospace>\n, can significantly facilitate the development of shrinking deep neural networks (DNNs) and enable the deployment of AI on resource-constrained devices.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"72 ","pages":"5439-5453"},"PeriodicalIF":4.6000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Structured Directional Pruning via Perturbation Orthogonal Projection\",\"authors\":\"Xiaofeng Liu;Qing Wang;Yunfeng Shao;Yanhui Geng;Yinchuan Li\",\"doi\":\"10.1109/TSP.2024.3501674\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Despite the great potential of artificial intelligence (AI), which promotes machines to mimic human intelligence in performing tasks, it requires a deep/extensive model with a sufficient number of parameters to enhance the expressive ability. This aspect often hinders the application of AI on resource-constrained devices. Structured pruning is an effective compression technique that reduces the computation of neural networks. However, it typically achieves parameter reduction at the cost of non-negligible accuracy loss, necessitating fine-tuning. This paper introduces a novel technique called Structured Directional Pruning (SDP) and its fast solver, Alternating Structured Directional Pruning (\\n<monospace>AltSDP</monospace>\\n). SDP is a general energy-efficient coarse-grained pruning method that enables efficient model pruning without requiring fine-tuning or expert knowledge of the desired sparsity level. Theoretical analysis confirms that the fast solver, \\n<monospace>AltSDP</monospace>\\n, achieves SDP asymptotically after sufficient training. Experimental results validate that \\n<monospace>AltSDP</monospace>\\n reaches the same minimum valley as the vanilla optimizer, namely stochastic gradient descent (SGD), while maintaining a constant training loss. Additionally, \\n<monospace>AltSDP</monospace>\\n achieves state-of-the-art pruned accuracy integrating pruning into the initial training process without the need for fine-tuning. Consequently, the newly proposed SDP, along with its fast solver \\n<monospace>AltSDP</monospace>\\n, can significantly facilitate the development of shrinking deep neural networks (DNNs) and enable the deployment of AI on resource-constrained devices.\",\"PeriodicalId\":13330,\"journal\":{\"name\":\"IEEE Transactions on Signal Processing\",\"volume\":\"72 \",\"pages\":\"5439-5453\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2024-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10757364/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10757364/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

尽管人工智能（AI）具有巨大的潜力，它促使机器在执行任务时模仿人类的智能，但它需要一个具有足够数量参数的深度/广泛模型来增强表达能力。这方面往往会阻碍AI在资源受限设备上的应用。结构化剪枝是一种有效的压缩技术，可以减少神经网络的计算量。然而，它通常以不可忽略的精度损失为代价来实现参数的减少，需要进行微调。本文介绍了结构化定向剪枝（SDP）及其快速求解器——交替结构化定向剪枝（AltSDP）。SDP是一种通用的高能效粗粒度剪枝方法，它可以实现高效的模型剪枝，而不需要微调或所需稀疏度级别的专家知识。理论分析证实，快速求解器AltSDP经过充分的训练后，渐近地达到了SDP。实验结果证明，AltSDP在保持恒定的训练损失的同时，达到了与普通优化器相同的最小谷值，即随机梯度下降（SGD）。此外，AltSDP实现了最先进的修剪精度，将修剪整合到初始训练过程中，而无需微调。因此，新提出的SDP及其快速求解器AltSDP可以显著促进收缩深度神经网络（dnn）的发展，并使人工智能能够在资源受限的设备上部署。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Structured Directional Pruning via Perturbation Orthogonal Projection

Despite the great potential of artificial intelligence (AI), which promotes machines to mimic human intelligence in performing tasks, it requires a deep/extensive model with a sufficient number of parameters to enhance the expressive ability. This aspect often hinders the application of AI on resource-constrained devices. Structured pruning is an effective compression technique that reduces the computation of neural networks. However, it typically achieves parameter reduction at the cost of non-negligible accuracy loss, necessitating fine-tuning. This paper introduces a novel technique called Structured Directional Pruning (SDP) and its fast solver, Alternating Structured Directional Pruning ( AltSDP ). SDP is a general energy-efficient coarse-grained pruning method that enables efficient model pruning without requiring fine-tuning or expert knowledge of the desired sparsity level. Theoretical analysis confirms that the fast solver, AltSDP , achieves SDP asymptotically after sufficient training. Experimental results validate that AltSDP reaches the same minimum valley as the vanilla optimizer, namely stochastic gradient descent (SGD), while maintaining a constant training loss. Additionally, AltSDP achieves state-of-the-art pruned accuracy integrating pruning into the initial training process without the need for fine-tuning. Consequently, the newly proposed SDP, along with its fast solver AltSDP , can significantly facilitate the development of shrinking deep neural networks (DNNs) and enable the deployment of AI on resource-constrained devices.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Signal Processing 工程技术-工程：电子与电气

CiteScore

11.20

自引率

9.30%

发文量

310

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.