Efficient Detection of Small and Complex Objects for Autonomous Driving Using Deep Learning

2023 International Conference on Communication System, Computing and IT Applications (CSCITA) Pub Date : 2023-03-31 DOI:10.1109/CSCITA55725.2023.10104969

Ansh Sharma, Rashmi Gupta

引用次数: 0

Abstract

The YOLOv2 is one of the most prominent model used for object detection, it works on the concept of anchor boxes. However, this model is prone to some problems like double anchor boxes, missing small objects, and high time complexity. In this paper, we aim to solve the problem of double anchor boxes and undetected small objects by tuning the parameters like intersection over union (IoU) and customizing non-max suppression thresholds. Also, to reduce the time complexity of the model, we have proposed the use of depth wise convolution (DW-Conv2D) instead of fundamental convolution (Conv2D) in this paper. Once we applied the proposed model to datasets like PASCAL VOC07 and VOC12, we observed significant improvements like reduced floating-point operations per second by 9.5% and better accuracy than the existing state-of-the-art models.

查看原文本刊更多论文

基于深度学习的自动驾驶小而复杂物体的有效检测

YOLOv2是用于目标检测的最突出的模型之一，它基于锚盒的概念工作。但该模型容易出现双锚盒、小对象缺失、时间复杂度高等问题。在本文中，我们的目标是通过调整交集超过联合(IoU)等参数和自定义非最大抑制阈值来解决双锚盒和未检测到的小目标问题。此外，为了降低模型的时间复杂度，本文提出使用深度卷积(DW-Conv2D)来代替基本卷积(Conv2D)。一旦我们将提出的模型应用于PASCAL VOC07和VOC12等数据集，我们观察到显著的改进，如每秒浮点运算次数减少9.5%，并且比现有的最先进的模型精度更高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Conference on Communication System, Computing and IT Applications (CSCITA)

自引率

0.00%

发文量