EdgeNet: a low-power image recognition model based on small sample information

IF 3.7 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Weiyue Bao, Hong Zhang, Yaoyao Ding, Fangzhou Shen, Liujun Li
{"title":"EdgeNet: a low-power image recognition model based on small sample information","authors":"Weiyue Bao, Hong Zhang, Yaoyao Ding, Fangzhou Shen, Liujun Li","doi":"10.1007/s10044-024-01289-6","DOIUrl":null,"url":null,"abstract":"<p>Existing deep convolutional neural networks that rely on large datasets typically require images with high resolution and deep neural network models trained and called upon to improve accuracy of image recognition and classification. It is needed to use lightweight model to adapt to such low-power devices. However, lightweight small models are limited in their ability to classify and recognize small-sized images with low-resolution and are constrained by the number of parameters in the model and unable to perform deep-level feature extraction, since the low-resolution indicates small sample information. In the intelligent interaction in digital media, capturing, storing, transmitting, and computing high-resolution, high-precision images incur high power consumption and operating costs. When deploying an image recognition system on the client-side of IoT devices, it is difficult to meet the hardware requirements of high storage space and fast computation speed. It is also challenging to directly use high-resolution image data for model fine-tuning and training, and the size and parameter updates of the model are also limited by the storage and operating capacity of the hardware facilities. We proposed a low-power image recognition framework consists data pre-processing part and lightweight modeling architecture part. The data pre-processing method for image data based on an Auto-Encoder that filters R, G, B color channel data using a resolution filter to realize data compression, that is Downscaling large input data to a smaller size, thus to address the limitations of low-power deep learning model deployment and training. Based on the resolution filter, a channel normalization method is proposed to perform batch normalization on each channel dimension to encode the original image data at the same size and improve the mean squared error discrimination of the image data. And the lightweight model uses a depth-separable convolutional neural network and two kinds of blocks: one with batch normalization and the other without, EdgeNet. The architecture makes it possible to deploy more suitable for IoT device. The proposed framework achieves only a small precision loss within permission, but improves the forward inference speed of the model, and reduce the memory storage to 8.7 MB.</p>","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"24 1","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Analysis and Applications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10044-024-01289-6","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Existing deep convolutional neural networks that rely on large datasets typically require images with high resolution and deep neural network models trained and called upon to improve accuracy of image recognition and classification. It is needed to use lightweight model to adapt to such low-power devices. However, lightweight small models are limited in their ability to classify and recognize small-sized images with low-resolution and are constrained by the number of parameters in the model and unable to perform deep-level feature extraction, since the low-resolution indicates small sample information. In the intelligent interaction in digital media, capturing, storing, transmitting, and computing high-resolution, high-precision images incur high power consumption and operating costs. When deploying an image recognition system on the client-side of IoT devices, it is difficult to meet the hardware requirements of high storage space and fast computation speed. It is also challenging to directly use high-resolution image data for model fine-tuning and training, and the size and parameter updates of the model are also limited by the storage and operating capacity of the hardware facilities. We proposed a low-power image recognition framework consists data pre-processing part and lightweight modeling architecture part. The data pre-processing method for image data based on an Auto-Encoder that filters R, G, B color channel data using a resolution filter to realize data compression, that is Downscaling large input data to a smaller size, thus to address the limitations of low-power deep learning model deployment and training. Based on the resolution filter, a channel normalization method is proposed to perform batch normalization on each channel dimension to encode the original image data at the same size and improve the mean squared error discrimination of the image data. And the lightweight model uses a depth-separable convolutional neural network and two kinds of blocks: one with batch normalization and the other without, EdgeNet. The architecture makes it possible to deploy more suitable for IoT device. The proposed framework achieves only a small precision loss within permission, but improves the forward inference speed of the model, and reduce the memory storage to 8.7 MB.

Abstract Image

边缘网:基于小样本信息的低功耗图像识别模型
现有的深度卷积神经网络依赖于大型数据集,通常需要高分辨率的图像以及经过训练和调用的深度神经网络模型,以提高图像识别和分类的准确性。需要使用轻量级模型来适应这种低功耗设备。然而,轻量级的小型模型在对低分辨率的小型图像进行分类和识别时能力有限,并且受到模型参数数量的限制,无法进行深层次的特征提取,因为低分辨率表明样本信息较少。在数字媒体的智能交互中,捕捉、存储、传输和计算高分辨率、高精度的图像会产生很高的功耗和运行成本。在物联网设备的客户端部署图像识别系统时,很难满足存储空间大、计算速度快的硬件要求。直接使用高分辨率图像数据进行模型微调和训练也具有挑战性,而且模型的大小和参数更新也受到硬件设施的存储和运行能力的限制。我们提出的低功耗图像识别框架包括数据预处理部分和轻量级建模架构部分。图像数据预处理方法基于自动编码器,利用分辨率滤波器过滤R、G、B三色通道数据,实现数据压缩,即把大输入数据降维到更小的尺寸,从而解决低功耗深度学习模型部署和训练的局限性。在分辨率滤波器的基础上,提出了一种通道归一化方法,对每个通道维度进行批量归一化,以相同的大小对原始图像数据进行编码,提高图像数据的均方误差判别能力。轻量级模型使用深度分离卷积神经网络和两种区块:一种是带批量归一化的,另一种是不带批量归一化的 EdgeNet。这种架构使其更适合物联网设备的部署成为可能。所提出的框架只在允许范围内实现了较小的精度损失,但提高了模型的前向推理速度,并将内存存储量减少到 8.7 MB。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Pattern Analysis and Applications
Pattern Analysis and Applications 工程技术-计算机:人工智能
CiteScore
7.40
自引率
2.60%
发文量
76
审稿时长
13.5 months
期刊介绍: The journal publishes high quality articles in areas of fundamental research in intelligent pattern analysis and applications in computer science and engineering. It aims to provide a forum for original research which describes novel pattern analysis techniques and industrial applications of the current technology. In addition, the journal will also publish articles on pattern analysis applications in medical imaging. The journal solicits articles that detail new technology and methods for pattern recognition and analysis in applied domains including, but not limited to, computer vision and image processing, speech analysis, robotics, multimedia, document analysis, character recognition, knowledge engineering for pattern recognition, fractal analysis, and intelligent control. The journal publishes articles on the use of advanced pattern recognition and analysis methods including statistical techniques, neural networks, genetic algorithms, fuzzy pattern recognition, machine learning, and hardware implementations which are either relevant to the development of pattern analysis as a research area or detail novel pattern analysis applications. Papers proposing new classifier systems or their development, pattern analysis systems for real-time applications, fuzzy and temporal pattern recognition and uncertainty management in applied pattern recognition are particularly solicited.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信