基于自适应卷积和分岔解耦的车辆目标检测

IEEE transactions on artificial intelligence Pub Date : 2024-12-27 DOI:10.1109/TAI.2024.3522871

Yunfei Yin;Zheng Yuan;Yu He;Xianjian Bao

{"title":"基于自适应卷积和分岔解耦的车辆目标检测","authors":"Yunfei Yin;Zheng Yuan;Yu He;Xianjian Bao","doi":"10.1109/TAI.2024.3522871","DOIUrl":null,"url":null,"abstract":"Vehicle object detection is the foundation of autonomous driving system development. The existing state-of-the-art methods mainly focus on the applications and improvement of general-purpose single shot multibox detector (SSD) and you only look once (YOLO) methods. However, these methods overlook the specific characteristics of traffic scenarios, such as frequent changes of camera angles and rapid changes in surrounding environment, thus leading to peculiar deformations and blurring of vehicle objects. To address these issues, we consider making improvements on the targeted deformations, classification, positioning, and other operations for vehicle object images to alleviate the object deformation and blurring, and therefore propose a Vehicle Object Detection method based on adaptive convolution and bifurcation decoupling (VODACBD). Specifically, in VODACBD, to solve the deformation problem of vehicle objects, adaptive convolution, and feature redivision upsampling are proposed to dynamically capture object features; to alleviate the blurring of vehicle objects, a bifurcation decoupling head is proposed to learn vehicle categories, positions, and confidences. Moreover, to further enhance the overall performance, a global optimal transportation algorithm (GlobalOTA) is well designed to improve the quality of training samples. Extensive experiments were conducted on publicly available traffic object detection datasets such as BDD100K, KITTI, and VOC. The experimental results demonstrate that, compared with current state-of-the-art methods, VODACBD not only achieves an average performance improvement of 1.4% but also an average speed improvement of 1.57 times that of the state-of-the-art.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"6 5","pages":"1298-1308"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"VODACBD: Vehicle Object Detection Based on Adaptive Convolution and Bifurcation Decoupling\",\"authors\":\"Yunfei Yin;Zheng Yuan;Yu He;Xianjian Bao\",\"doi\":\"10.1109/TAI.2024.3522871\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vehicle object detection is the foundation of autonomous driving system development. The existing state-of-the-art methods mainly focus on the applications and improvement of general-purpose single shot multibox detector (SSD) and you only look once (YOLO) methods. However, these methods overlook the specific characteristics of traffic scenarios, such as frequent changes of camera angles and rapid changes in surrounding environment, thus leading to peculiar deformations and blurring of vehicle objects. To address these issues, we consider making improvements on the targeted deformations, classification, positioning, and other operations for vehicle object images to alleviate the object deformation and blurring, and therefore propose a Vehicle Object Detection method based on adaptive convolution and bifurcation decoupling (VODACBD). Specifically, in VODACBD, to solve the deformation problem of vehicle objects, adaptive convolution, and feature redivision upsampling are proposed to dynamically capture object features; to alleviate the blurring of vehicle objects, a bifurcation decoupling head is proposed to learn vehicle categories, positions, and confidences. Moreover, to further enhance the overall performance, a global optimal transportation algorithm (GlobalOTA) is well designed to improve the quality of training samples. Extensive experiments were conducted on publicly available traffic object detection datasets such as BDD100K, KITTI, and VOC. The experimental results demonstrate that, compared with current state-of-the-art methods, VODACBD not only achieves an average performance improvement of 1.4% but also an average speed improvement of 1.57 times that of the state-of-the-art.\",\"PeriodicalId\":73305,\"journal\":{\"name\":\"IEEE transactions on artificial intelligence\",\"volume\":\"6 5\",\"pages\":\"1298-1308\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-12-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on artificial intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10816662/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10816662/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

车辆目标检测是自动驾驶系统开发的基础。现有的最先进的方法主要集中在通用单镜头多盒探测器（SSD）和你只看一次（YOLO）方法的应用和改进上。然而，这些方法忽略了交通场景的具体特征，如摄像机角度的频繁变化和周围环境的快速变化，从而导致车辆物体的特殊变形和模糊。为了解决这些问题，我们考虑对车辆目标图像的目标变形、分类、定位等操作进行改进，以减轻物体的变形和模糊，因此提出了一种基于自适应卷积和分岔解耦（VODACBD）的车辆目标检测方法。具体来说，在VODACBD中，针对车辆物体的变形问题，提出了自适应卷积和特征重划分上采样来动态捕获物体特征；为了减轻车辆对象的模糊，提出了一种分叉解耦头来学习车辆类别、位置和置信度。此外，为了进一步提高整体性能，设计了全局最优运输算法（GlobalOTA）来提高训练样本的质量。在BDD100K、KITTI和VOC等公开可用的交通目标检测数据集上进行了大量实验。实验结果表明，与目前最先进的方法相比，VODACBD不仅平均性能提高了1.4%，而且平均速度提高了1.57倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

VODACBD: Vehicle Object Detection Based on Adaptive Convolution and Bifurcation Decoupling

Vehicle object detection is the foundation of autonomous driving system development. The existing state-of-the-art methods mainly focus on the applications and improvement of general-purpose single shot multibox detector (SSD) and you only look once (YOLO) methods. However, these methods overlook the specific characteristics of traffic scenarios, such as frequent changes of camera angles and rapid changes in surrounding environment, thus leading to peculiar deformations and blurring of vehicle objects. To address these issues, we consider making improvements on the targeted deformations, classification, positioning, and other operations for vehicle object images to alleviate the object deformation and blurring, and therefore propose a Vehicle Object Detection method based on adaptive convolution and bifurcation decoupling (VODACBD). Specifically, in VODACBD, to solve the deformation problem of vehicle objects, adaptive convolution, and feature redivision upsampling are proposed to dynamically capture object features; to alleviate the blurring of vehicle objects, a bifurcation decoupling head is proposed to learn vehicle categories, positions, and confidences. Moreover, to further enhance the overall performance, a global optimal transportation algorithm (GlobalOTA) is well designed to improve the quality of training samples. Extensive experiments were conducted on publicly available traffic object detection datasets such as BDD100K, KITTI, and VOC. The experimental results demonstrate that, compared with current state-of-the-art methods, VODACBD not only achieves an average performance improvement of 1.4% but also an average speed improvement of 1.57 times that of the state-of-the-art.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE transactions on artificial intelligence

CiteScore

7.70

自引率

0.00%

发文量