Quantized Convolutional Neural Networks Robustness under Perturbation.

Q2 Pharmacology, Toxicology and Pharmaceutics

F1000Research Pub Date : 2025-04-09 eCollection Date: 2025-01-01 DOI:10.12688/f1000research.163144.1

Jack Langille, Issam Hammad, Guy Kember

{"title":"Quantized Convolutional Neural Networks Robustness under Perturbation.","authors":"Jack Langille, Issam Hammad, Guy Kember","doi":"10.12688/f1000research.163144.1","DOIUrl":null,"url":null,"abstract":"<p><p>Contemporary machine learning models are increasingly becoming restricted by size and subsequent operations per forward pass, demanding increasing compute requirements. Quantization has emerged as a convenient approach to addressing this, in which weights and activations are mapped from their conventionally used floating-point 32-bit numeric representations to lower precision integers. This process introduces significant reductions in inference time and simplifies the hardware requirements. It is a well-studied result that the performance of such reduced precision models is congruent with their floating-point counterparts. However, there is a lack of literature that addresses the performance of quantized models in a perturbed input space, as is common when stress testing regular full-precision models, particularly for real-world deployments. We focus on addressing this gap in the context of 8-bit quantized convolutional neural networks (CNNs). We study three state-of-the-art CNNs: ResNet-18, VGG-16, and SqueezeNet1_1, and subject their floating point and fixed point forms to various noise regimes with varying intensities. We characterize performance in terms of traditional metrics, including top-1 and top-5 accuracy, as well as the F1 score. We also introduce a new metric, the Kullback-Liebler divergence of the two output distributions for a given floating-point/fixed-point model pair, as a means to examine how the model's output distribution has changed as a result of quantization, which, we contend, can be interpreted as a proxy for model similarity in decision making. We find that across all three models and under each perturbation scheme, the relative error between the quantized and full-precision model was consistently low. We also find that Kullback-Liebler divergence was on the same order of magnitude as the unperturbed tests across all perturbation regimes except Brownian noise, where significant divergences were observed for VGG-16 and SqueezeNet1_1.</p>","PeriodicalId":12260,"journal":{"name":"F1000Research","volume":"14 ","pages":"419"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12041843/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"F1000Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12688/f1000research.163144.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"Pharmacology, Toxicology and Pharmaceutics","Score":null,"Total":0}

引用次数: 0

Abstract

Contemporary machine learning models are increasingly becoming restricted by size and subsequent operations per forward pass, demanding increasing compute requirements. Quantization has emerged as a convenient approach to addressing this, in which weights and activations are mapped from their conventionally used floating-point 32-bit numeric representations to lower precision integers. This process introduces significant reductions in inference time and simplifies the hardware requirements. It is a well-studied result that the performance of such reduced precision models is congruent with their floating-point counterparts. However, there is a lack of literature that addresses the performance of quantized models in a perturbed input space, as is common when stress testing regular full-precision models, particularly for real-world deployments. We focus on addressing this gap in the context of 8-bit quantized convolutional neural networks (CNNs). We study three state-of-the-art CNNs: ResNet-18, VGG-16, and SqueezeNet1_1, and subject their floating point and fixed point forms to various noise regimes with varying intensities. We characterize performance in terms of traditional metrics, including top-1 and top-5 accuracy, as well as the F1 score. We also introduce a new metric, the Kullback-Liebler divergence of the two output distributions for a given floating-point/fixed-point model pair, as a means to examine how the model's output distribution has changed as a result of quantization, which, we contend, can be interpreted as a proxy for model similarity in decision making. We find that across all three models and under each perturbation scheme, the relative error between the quantized and full-precision model was consistently low. We also find that Kullback-Liebler divergence was on the same order of magnitude as the unperturbed tests across all perturbation regimes except Brownian noise, where significant divergences were observed for VGG-16 and SqueezeNet1_1.

查看原文本刊更多论文

扰动下量化卷积神经网络的鲁棒性。

当代机器学习模型越来越受到大小和每次向前传递的后续操作的限制，要求越来越高的计算需求。量化已经成为解决这个问题的一种方便方法，其中权重和激活从它们通常使用的32位浮点数字表示形式映射到精度较低的整数。这个过程大大减少了推理时间并简化了硬件需求。研究结果表明，这种降精度模型的性能与浮点模型完全一致。然而，缺乏解决在扰动输入空间中量化模型性能的文献，这在对常规全精度模型进行压力测试时很常见，特别是在实际部署中。我们专注于在8位量化卷积神经网络（cnn）的背景下解决这一差距。我们研究了三个最先进的cnn: ResNet-18， VGG-16和SqueezeNet1_1，并将其浮点和定点形式置于不同强度的各种噪声状态下。我们根据传统指标来描述性能，包括前1名和前5名的准确性，以及F1分数。我们还引入了一个新的度量，即给定浮点/固定点模型对的两个输出分布的Kullback-Liebler散度，作为检查模型的输出分布如何因量化而变化的一种手段，我们认为，这可以被解释为决策中模型相似性的代理。我们发现，在所有三种模型中，在每种扰动方案下，量子化模型与全精度模型之间的相对误差始终很低。我们还发现，除了布朗噪声外，在所有扰动状态下，kullbackliebler散度与未扰动测试在同一个数量级上，在布朗噪声中，VGG-16和SqueezeNet1_1观察到显著的散度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

F1000Research Pharmacology, Toxicology and Pharmaceutics-Pharmacology, Toxicology and Pharmaceutics (all)

CiteScore

5.00

自引率

0.00%

发文量

1646

审稿时长

1 weeks

期刊介绍： F1000Research publishes articles and other research outputs reporting basic scientific, scholarly, translational and clinical research across the physical and life sciences, engineering, medicine, social sciences and humanities. F1000Research is a scholarly publication platform set up for the scientific, scholarly and medical research community; each article has at least one author who is a qualified researcher, scholar or clinician actively working in their speciality and who has made a key contribution to the article. Articles must be original (not duplications). All research is suitable irrespective of the perceived level of interest or novelty; we welcome confirmatory and negative results, as well as null studies. F1000Research publishes different type of research, including clinical trials, systematic reviews, software tools, method articles, and many others. Reviews and Opinion articles providing a balanced and comprehensive overview of the latest discoveries in a particular field, or presenting a personal perspective on recent developments, are also welcome. See the full list of article types we accept for more information.