Lightweight convolutional neural network for fast visual perception of storage location status in stereo warehouse

IF 5.9 2区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Liangrui Zhang, Xi Zhang, Mingzhou Liu
{"title":"Lightweight convolutional neural network for fast visual perception of storage location status in stereo warehouse","authors":"Liangrui Zhang, Xi Zhang, Mingzhou Liu","doi":"10.1007/s10845-024-02397-0","DOIUrl":null,"url":null,"abstract":"<p>Accurate storage location status data is an important input for location assignment in the inbound stage. Traditional Internet of Things (IoT) identification technologies require high costs and are easily affected by warehouse environments. A lightweight convolutional neural network is proposed for perceiving storage status to achieve high stability and low cost of location availability monitoring. Based on the existing You Only Look Once (YOLOv5) algorithm, the Hough transform is used in the pre-processing to implement tilt correction on the image to improve the stability of object localization. Then the feature extraction unit CBlock is designed based on a new depthwise separable convolution in which the convolutional block attention module is embedded, focusing on both channel and spatial information. The backbone network is constructed by stacking these CBlock blocks to compress the computational cost. The improved neck network adds cross-layer information fusion to reduce the information loss caused by sampling and ensure perceptual accuracy. Moreover, the penalty metric is redefined by SIoU, which considers the vector angle of the bounding box regression and improves the convergence speed and accuracy. The experiments show that the proposed model achieves successful results for storage location status perception in stereo warehouse.</p>","PeriodicalId":16193,"journal":{"name":"Journal of Intelligent Manufacturing","volume":"22 1","pages":""},"PeriodicalIF":5.9000,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent Manufacturing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s10845-024-02397-0","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate storage location status data is an important input for location assignment in the inbound stage. Traditional Internet of Things (IoT) identification technologies require high costs and are easily affected by warehouse environments. A lightweight convolutional neural network is proposed for perceiving storage status to achieve high stability and low cost of location availability monitoring. Based on the existing You Only Look Once (YOLOv5) algorithm, the Hough transform is used in the pre-processing to implement tilt correction on the image to improve the stability of object localization. Then the feature extraction unit CBlock is designed based on a new depthwise separable convolution in which the convolutional block attention module is embedded, focusing on both channel and spatial information. The backbone network is constructed by stacking these CBlock blocks to compress the computational cost. The improved neck network adds cross-layer information fusion to reduce the information loss caused by sampling and ensure perceptual accuracy. Moreover, the penalty metric is redefined by SIoU, which considers the vector angle of the bounding box regression and improves the convergence speed and accuracy. The experiments show that the proposed model achieves successful results for storage location status perception in stereo warehouse.

Abstract Image

用于快速视觉感知立体仓库存储位置状态的轻量级卷积神经网络
准确的存储位置状态数据是入库阶段位置分配的重要输入。传统的物联网(IoT)识别技术不仅成本高,而且容易受到仓库环境的影响。本文提出了一种用于感知存储状态的轻量级卷积神经网络,以实现高稳定性和低成本的位置可用性监控。基于现有的 "只看一次"(YOLOv5)算法,在预处理中使用 Hough 变换对图像进行倾斜校正,以提高物体定位的稳定性。然后,基于新的深度可分离卷积设计了特征提取单元 CBlock,其中嵌入了卷积块注意模块,同时关注通道和空间信息。骨干网络由这些 CBlock 块堆叠而成,以压缩计算成本。改进后的颈部网络增加了跨层信息融合,以减少采样造成的信息损失,确保感知准确性。此外,利用 SIoU 重新定义了惩罚度量,考虑了边界框回归的向量角度,提高了收敛速度和准确性。实验表明,所提出的模型在立体仓库的存储位置状态感知方面取得了成功的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Intelligent Manufacturing
Journal of Intelligent Manufacturing 工程技术-工程:制造
CiteScore
19.30
自引率
9.60%
发文量
171
审稿时长
5.2 months
期刊介绍: The Journal of Nonlinear Engineering aims to be a platform for sharing original research results in theoretical, experimental, practical, and applied nonlinear phenomena within engineering. It serves as a forum to exchange ideas and applications of nonlinear problems across various engineering disciplines. Articles are considered for publication if they explore nonlinearities in engineering systems, offering realistic mathematical modeling, utilizing nonlinearity for new designs, stabilizing systems, understanding system behavior through nonlinearity, optimizing systems based on nonlinear interactions, and developing algorithms to harness and leverage nonlinear elements.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信