Yaqian Li , Guoping Liu , Haibin Li , Wenming Zhang , Xiaoyang Shen
{"title":"Enhancing object detection with large kernel convolution and cross convolution","authors":"Yaqian Li , Guoping Liu , Haibin Li , Wenming Zhang , Xiaoyang Shen","doi":"10.1016/j.dsp.2025.105433","DOIUrl":null,"url":null,"abstract":"<div><div>Existing object detection models often struggle with detecting small objects due to their limited ability to capture sufficient contextual information. In this paper, we introduce a lightweight object detection model that leverages large kernel convolution with attention (LKA) and a hierarchical feature fusion group (HFFG) to address this issue. The LKA module employs large kernel convolution to capture long-range dependencies and contextual information, combined with depthwise separate convolution to maintain a lightweight design. An incorporated attention mechanism further enables the modal to adaptively focus on key areas, thereby improving detection performance for small objects. The HFFG module, which integrates Cross Convolution Blocks, explores and retains structural information across different scales. By effectively extracting structural details, our model exhibits enhanced performance on object of various sizes. Extensive experiments on the VisDrone2019 and PASACAL VOC datasets demonstrate that our model achieves an outstanding mAP of 23.4 %, surpassing the baseline YOLOX-s model by +1.5 %. These results not only validate the effectiveness but also demonstrate its robustness and generalization capability.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"167 ","pages":"Article 105433"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425004555","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Existing object detection models often struggle with detecting small objects due to their limited ability to capture sufficient contextual information. In this paper, we introduce a lightweight object detection model that leverages large kernel convolution with attention (LKA) and a hierarchical feature fusion group (HFFG) to address this issue. The LKA module employs large kernel convolution to capture long-range dependencies and contextual information, combined with depthwise separate convolution to maintain a lightweight design. An incorporated attention mechanism further enables the modal to adaptively focus on key areas, thereby improving detection performance for small objects. The HFFG module, which integrates Cross Convolution Blocks, explores and retains structural information across different scales. By effectively extracting structural details, our model exhibits enhanced performance on object of various sizes. Extensive experiments on the VisDrone2019 and PASACAL VOC datasets demonstrate that our model achieves an outstanding mAP of 23.4 %, surpassing the baseline YOLOX-s model by +1.5 %. These results not only validate the effectiveness but also demonstrate its robustness and generalization capability.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,