Towards more accurate object detection via encoding reinforcement and multi-channel enhancement

IF 3.5 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence Pub Date : 2024-12-23 DOI:10.1007/s10489-024-06200-8

Weina Wang, Shuangyong Li, Huxidan Jumahong

引用次数: 0

Abstract

The existing object detection networks typically apply small kernel convolution that can extract sufficient features for recognizing targets but have poor long-range dependency capability and smaller receptive fields. This paper proposes an object detection network with structure featuring large kernel convolutions and multiple channels. Firstly, the encoding reinforcement module using large kernel convolutions is designed to enlarge the receptive field and improve global feature extraction. Then, the channel enhancement module is constructed to enhance structural information learning. In addition, the encoding reinforcement and channel enhancement are designed in a lightweight way. Finally, the WIOU loss function is introduced to enhance the model’s robustness in poor-quality datasets. In the experiments, the proposed model can achieve optimal performance with similar parameters or computational complexity to existing CNN-based lightweight models.

查看原文本刊更多论文

通过编码增强和多通道增强实现更精确的目标检测

现有的目标检测网络通常采用小核卷积，可以提取足够的特征来识别目标，但远程依赖能力较差，接受域较小。本文提出了一种具有大核卷积和多通道结构的目标检测网络。首先，设计了基于大核卷积的编码增强模块，扩大接收域，提高全局特征提取的质量；然后，构建通道增强模块，增强结构信息学习。此外，对编码增强和信道增强进行了轻量化设计。最后，引入WIOU损失函数来增强模型在低质量数据集中的鲁棒性。在实验中，该模型在参数相似或计算复杂度与现有的基于cnn的轻量级模型相似的情况下，可以达到最优的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Applied Intelligence 工程技术-计算机：人工智能

CiteScore

6.60

自引率

20.80%

发文量

1361

审稿时长

5.9 months

期刊介绍： With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance. The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.