Sparse loss-aware ternarization for neural networks

IF 8.1 1区计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Sciences Pub Date : 2024-11-17 DOI:10.1016/j.ins.2024.121668

Ruizhi Zhou , Lingfeng Niu , Dachuan Xu

{"title":"Sparse loss-aware ternarization for neural networks","authors":"Ruizhi Zhou , Lingfeng Niu , Dachuan Xu","doi":"10.1016/j.ins.2024.121668","DOIUrl":null,"url":null,"abstract":"<div><div>Deep neural networks (DNNs) have shown great success in machine learning tasks and widely used in many fields. However, the substantial computational and storage requirements inherent to DNNs are usually high, which poses challenges for deploying deep learning models on resource-limited devices and hindering further applications. To address this issue, the lightweight nature of neural networks has garnered significant attention, and quantization has become one of the most popular approaches to compress DNNs. In this paper, we introduce a sparse loss-aware ternarization (SLT) model for training ternary neural networks, which encodes the floating-point parameters into <span><math><mo>{</mo><mo>−</mo><mn>1</mn><mo>,</mo><mn>0</mn><mo>,</mo><mn>1</mn><mo>}</mo></math></span>. Specifically, we abstract the ternarization process as an optimization problem with discrete constraints, and then modify it by applying sparse regularization to identify insignificant weights. To deal with the challenges brought by the discreteness of the model, we decouple discrete constraints from the objective function and design a new algorithm based on the Alternating Direction Method of Multipliers (ADMM). Extensive experiments are conducted on public datasets with popular network architectures. Comparisons with several state-of-the-art baselines demonstrate that SLT always attains comparable accuracy while having better compression performance.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"693 ","pages":"Article 121668"},"PeriodicalIF":8.1000,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025524015822","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Deep neural networks (DNNs) have shown great success in machine learning tasks and widely used in many fields. However, the substantial computational and storage requirements inherent to DNNs are usually high, which poses challenges for deploying deep learning models on resource-limited devices and hindering further applications. To address this issue, the lightweight nature of neural networks has garnered significant attention, and quantization has become one of the most popular approaches to compress DNNs. In this paper, we introduce a sparse loss-aware ternarization (SLT) model for training ternary neural networks, which encodes the floating-point parameters into

{- 1, 0, 1}

. Specifically, we abstract the ternarization process as an optimization problem with discrete constraints, and then modify it by applying sparse regularization to identify insignificant weights. To deal with the challenges brought by the discreteness of the model, we decouple discrete constraints from the objective function and design a new algorithm based on the Alternating Direction Method of Multipliers (ADMM). Extensive experiments are conducted on public datasets with popular network architectures. Comparisons with several state-of-the-art baselines demonstrate that SLT always attains comparable accuracy while having better compression performance.

查看原文本刊更多论文

神经网络的稀疏损失感知三元化

深度神经网络（DNN）在机器学习任务中取得了巨大成功，并广泛应用于许多领域。然而，DNN 固有的大量计算和存储要求通常很高，这给在资源有限的设备上部署深度学习模型带来了挑战，并阻碍了进一步的应用。为解决这一问题，神经网络的轻量级特性受到了广泛关注，量化已成为压缩 DNN 的最常用方法之一。本文介绍了一种用于训练三元神经网络的稀疏损失感知三元化（SLT）模型，它将浮点参数编码为 {-1,0,1}。具体来说，我们将三元化过程抽象为一个具有离散约束的优化问题，然后通过应用稀疏正则化来识别不重要的权重，从而对其进行修改。为了应对模型离散性带来的挑战，我们将离散约束与目标函数解耦，并设计了一种基于交替方向乘法（ADMM）的新算法。我们在采用流行网络架构的公共数据集上进行了广泛的实验。与几种最先进的基线算法进行比较后发现，SLT 算法总能达到相当的精度，同时具有更好的压缩性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information Sciences 工程技术-计算机：信息系统

CiteScore

14.00

自引率

17.30%

发文量

1322

审稿时长

10.4 months

期刊介绍： Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.