A CNN Approximation Method Based on Low-bit Quantization and Random Forests

Shota Yatabe, Sora Isobe, Yoichi Tomioka, H. Saito, Y. Kohira, Qiangfu Zhao
{"title":"A CNN Approximation Method Based on Low-bit Quantization and Random Forests","authors":"Shota Yatabe, Sora Isobe, Yoichi Tomioka, H. Saito, Y. Kohira, Qiangfu Zhao","doi":"10.1109/CYBCONF51991.2021.9464152","DOIUrl":null,"url":null,"abstract":"In recent years, the use of image recognition technology in edge devices has been increasing. To achieve low-power and low-latency inference of convolutional neural networks (CNNs) in edge devices, methods that reduce the number of operations, such as pruning, have been actively researched. However, even after applying these existing methods, we still need to calculate many multiply-accumulate (MAC) operations. In this paper, we propose a hardware-friendly CNN approximation method based on low-bit quantization and random forests to reduce the number of operations and operation cost of CNN inference. In our experiments, we reduce the number of operations by 30.8% for LeNet and by 27.1% for ResNet18 while maintaining high image classification accuracy.","PeriodicalId":231194,"journal":{"name":"2021 5th IEEE International Conference on Cybernetics (CYBCONF)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th IEEE International Conference on Cybernetics (CYBCONF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CYBCONF51991.2021.9464152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In recent years, the use of image recognition technology in edge devices has been increasing. To achieve low-power and low-latency inference of convolutional neural networks (CNNs) in edge devices, methods that reduce the number of operations, such as pruning, have been actively researched. However, even after applying these existing methods, we still need to calculate many multiply-accumulate (MAC) operations. In this paper, we propose a hardware-friendly CNN approximation method based on low-bit quantization and random forests to reduce the number of operations and operation cost of CNN inference. In our experiments, we reduce the number of operations by 30.8% for LeNet and by 27.1% for ResNet18 while maintaining high image classification accuracy.
基于低比特量化和随机森林的CNN逼近方法
近年来,图像识别技术在边缘设备中的应用越来越多。为了在边缘设备上实现卷积神经网络(cnn)的低功耗和低延迟推理,人们积极研究减少运算次数的方法,如剪枝。然而,即使在应用这些现有的方法之后,我们仍然需要计算许多乘法累加(MAC)操作。在本文中,我们提出了一种基于低比特量化和随机森林的硬件友好CNN近似方法,以减少CNN推理的运算次数和运算成本。在我们的实验中,我们将LeNet的操作次数减少了30.8%,将ResNet18的操作次数减少了27.1%,同时保持了较高的图像分类精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信