Vis-YOLO: a lightweight and efficient image detector for unmanned aerial vehicle small objects

IF 1 4区计算机科学 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC

Journal of Electronic Imaging Pub Date : 2024-09-01 DOI:10.1117/1.jei.33.5.053003

Xiangyu Deng, Jiangyong Du

引用次数: 0

Abstract

Yolo series models are extensive within the domain of object detection. Aiming at the challenge of small object detection, we analyze the limitations of existing detection models and propose a Vis-YOLO object detection algorithm based on YOLOv8s. First, the down-sampling times are reduced to retain more features, and the detection head is replaced to adapt to the small object. Then, deformable convolutional networks are used to improve the C2f module, improving its feature extraction ability. Finally, the separation and enhancement attention module is introduced to the model to give more weight to the useful information. Experiments show that the improved Vis-YOLO model outperforms the YOLOv8s model on the visdrone-2019 dataset. The precision improved by 5.4%, the recall by 6.3%, and the mAP50 by 6.8%. Moreover, Vis-YOLO models are smaller and suitable for mobile deployment. This research provides a new method and idea for small object detection, which has excellent potential application value.

查看原文本刊更多论文

Vis-YOLO：轻便高效的无人飞行器小型物体图像探测器

Yolo 系列模型在物体检测领域应用广泛。针对小物体检测的挑战，我们分析了现有检测模型的局限性，提出了基于 YOLOv8s 的 Vis-YOLO 物体检测算法。首先，减少下采样时间以保留更多特征，并更换检测头以适应小物体。然后，使用可变形卷积网络改进 C2f 模块，提高其特征提取能力。最后，在模型中引入分离和增强注意模块，以提高有用信息的权重。实验表明，在 visdrone-2019 数据集上，改进后的 Vis-YOLO 模型优于 YOLOv8s 模型。精确度提高了 5.4%，召回率提高了 6.3%，mAP50 提高了 6.8%。此外，Vis-YOLO 模型体积更小，适合移动部署。这项研究为小物体检测提供了一种新的方法和思路，具有极高的潜在应用价值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Electronic Imaging 工程技术-成像科学与照相技术

CiteScore

1.70

自引率

27.30%

发文量

341

审稿时长

4.0 months

期刊介绍： The Journal of Electronic Imaging publishes peer-reviewed papers in all technology areas that make up the field of electronic imaging and are normally considered in the design, engineering, and applications of electronic imaging systems.