Wei Zhou , Honggang Li , Li Zheng , Shan Xiao , Jun Yi
{"title":"A Lightweight Detection and Recognition Framework for cigarette laser code","authors":"Wei Zhou , Honggang Li , Li Zheng , Shan Xiao , Jun Yi","doi":"10.1016/j.engappai.2025.111777","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic detection and recognition of laser codes on cigarette case is important in distinguishing the authenticity of cigarettes. However, detecting and recognizing cigarette laser codes presents a challenging industrial problem due to the intricate background of cigarette images. In this paper, a Lightweight Detection and Recognition Framework (LDRF) is proposed to detect and recognize cigarette laser code. The LDRF model consists of the Lightweight Detection Network (LDNet) and the Lightweight Recognition Network (LRNet). In the LDNet stage, a lightweight feature extraction network is proposed to extract features of the cigarette code area. Furthermore, a bidirectional feature pyramid network (BiFPN) feature fusion structure is introduced to tackle the multi-scale feature fusion challenge in cigarette code detection scenarios. Notably, alignment and normalization of all features channels are conducted to reduce computational requirements in the post-processing stage, enabling precise detection performance while significantly reducing parameters and computational complexity. In the LRNet stage, an integrated network architecture is designed to enhance the fusion of visual and temporal features. Furthermore, a combination of bidirectional temporal convolutional network (BiTCN) and Transformer is employed in the feature extraction stage to differentiate between background and characters, as well as capture the interdependence among different characters. Specifically, DownSampling is utilized to adjust the size of input images and Merging or Combining methods are applied at each stage to capture multi-level features. Experimental results demonstrate that the proposed LDRF method provides better performance than state-of-the-art models, and achieves trade-off between accuracy and speed.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"160 ","pages":"Article 111777"},"PeriodicalIF":8.0000,"publicationDate":"2025-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625017798","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Automatic detection and recognition of laser codes on cigarette case is important in distinguishing the authenticity of cigarettes. However, detecting and recognizing cigarette laser codes presents a challenging industrial problem due to the intricate background of cigarette images. In this paper, a Lightweight Detection and Recognition Framework (LDRF) is proposed to detect and recognize cigarette laser code. The LDRF model consists of the Lightweight Detection Network (LDNet) and the Lightweight Recognition Network (LRNet). In the LDNet stage, a lightweight feature extraction network is proposed to extract features of the cigarette code area. Furthermore, a bidirectional feature pyramid network (BiFPN) feature fusion structure is introduced to tackle the multi-scale feature fusion challenge in cigarette code detection scenarios. Notably, alignment and normalization of all features channels are conducted to reduce computational requirements in the post-processing stage, enabling precise detection performance while significantly reducing parameters and computational complexity. In the LRNet stage, an integrated network architecture is designed to enhance the fusion of visual and temporal features. Furthermore, a combination of bidirectional temporal convolutional network (BiTCN) and Transformer is employed in the feature extraction stage to differentiate between background and characters, as well as capture the interdependence among different characters. Specifically, DownSampling is utilized to adjust the size of input images and Merging or Combining methods are applied at each stage to capture multi-level features. Experimental results demonstrate that the proposed LDRF method provides better performance than state-of-the-art models, and achieves trade-off between accuracy and speed.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.