{"title":"An efficient and lightweight rotating target detection model for industrial scenarios","authors":"Mingyao Teng, Guoyang Wan, Shoujun Bai, Yunhao Zhu, Hanqi Li, Chengwen Wang","doi":"10.1016/j.dsp.2025.105631","DOIUrl":null,"url":null,"abstract":"<div><div>Currently, target detection technology has largely matured in industrial scenarios, but most applications still rely on horizontal bounding boxes for target detection. For industrial parts with irregular shapes, varying sizes, and diverse categories, horizontal bounding box detection tends to introduce unnecessary background information, leading to false positives or missed detections. It also suffers from the issue of losing boundary direction. To address these challenges, this paper proposes a novel YOLO11 model for detecting rotating targets (DAGP-YOLO). First, a rotating detection head is introduced to minimize the interference of redundant background information. Dynamic convolution is applied to expand the receptive field of the network, and the ADown module replaces the original down sampling method to improve detail extraction. Second, an orientation-aware attention mechanism (GCA) is designed to better focus on the directional features of rotating targets. Lastly, to meet the demand of small storage space and high detection accuracy of edge devices in industry, this paper adopts the L1 filter pruning strategy to compress the improved model. We performed experimental validation on a self-constructed dataset, the publicly available MVTec Screws dataset, and the UCAS AOD dataset in the aerial photography domain. The results demonstrate the superiority and effectiveness of our approach. Additionally, we developed a visualization system based on the C# WinForms framework, which allows for real-time detection and display of workpiece images, further showcasing the practical applicability of the improved method in industrial settings.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"168 ","pages":"Article 105631"},"PeriodicalIF":3.0000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425006530","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Currently, target detection technology has largely matured in industrial scenarios, but most applications still rely on horizontal bounding boxes for target detection. For industrial parts with irregular shapes, varying sizes, and diverse categories, horizontal bounding box detection tends to introduce unnecessary background information, leading to false positives or missed detections. It also suffers from the issue of losing boundary direction. To address these challenges, this paper proposes a novel YOLO11 model for detecting rotating targets (DAGP-YOLO). First, a rotating detection head is introduced to minimize the interference of redundant background information. Dynamic convolution is applied to expand the receptive field of the network, and the ADown module replaces the original down sampling method to improve detail extraction. Second, an orientation-aware attention mechanism (GCA) is designed to better focus on the directional features of rotating targets. Lastly, to meet the demand of small storage space and high detection accuracy of edge devices in industry, this paper adopts the L1 filter pruning strategy to compress the improved model. We performed experimental validation on a self-constructed dataset, the publicly available MVTec Screws dataset, and the UCAS AOD dataset in the aerial photography domain. The results demonstrate the superiority and effectiveness of our approach. Additionally, we developed a visualization system based on the C# WinForms framework, which allows for real-time detection and display of workpiece images, further showcasing the practical applicability of the improved method in industrial settings.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,