{"title":"On Different Criteria for Optimizing the Two-bit Uniform Quantizer","authors":"J. Nikolić, Z. Perić, Stefan S. Tomic, D. Aleksić","doi":"10.1109/INFOTEH53737.2022.9751268","DOIUrl":null,"url":null,"abstract":"In this paper, we address the problem of determining the most influential parameter of the two-bit uniform quantizer, i.e. the support region threshold ($x$max), according to different optimization criteria. We analyze the dependences of the quantized neural network (QNN) model quality indicators on $\\boldsymbol{x_{\\max}}$, for the case of applying two-bit uniform quantization of weights in the post-training phase for the MNIST dataset. In addition to the theoretical quality indicator of quantized signal for the Laplacian distribution - theoretical SQNR, our quality indicators are also the accuracy of QNN and the experimentally determined SQNR for the Laplacian-like weight distribution in the three-layer fully connected neural network. The goal is to determine the results of optimizing these quality indicators per Xmax for the considered research framework and to make their comparison for setting a good foundation for future research.","PeriodicalId":6839,"journal":{"name":"2022 21st International Symposium INFOTEH-JAHORINA (INFOTEH)","volume":"27 1","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 21st International Symposium INFOTEH-JAHORINA (INFOTEH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFOTEH53737.2022.9751268","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we address the problem of determining the most influential parameter of the two-bit uniform quantizer, i.e. the support region threshold ($x$max), according to different optimization criteria. We analyze the dependences of the quantized neural network (QNN) model quality indicators on $\boldsymbol{x_{\max}}$, for the case of applying two-bit uniform quantization of weights in the post-training phase for the MNIST dataset. In addition to the theoretical quality indicator of quantized signal for the Laplacian distribution - theoretical SQNR, our quality indicators are also the accuracy of QNN and the experimentally determined SQNR for the Laplacian-like weight distribution in the three-layer fully connected neural network. The goal is to determine the results of optimizing these quality indicators per Xmax for the considered research framework and to make their comparison for setting a good foundation for future research.