Q-YOLOP: Quantization-Aware You Only Look Once for Panoptic Driving Perception

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) Pub Date : 2023-07-01 DOI:10.1109/ICMEW59549.2023.00015

Chi-Chih Chang, Wei-Cheng Lin, Peide Wang, Shengtao Yu, Yunrong Lu, Kuan-Cheng Lin, Kaiyang Wu

引用次数: 8

Abstract

In this work, we present an efficient and quantization-aware panoptic driving perception model (Q-YOLOP) for object detection, drivable area segmentation, and lane line segmentation, in the context of autonomous driving. Our model employs the Efficient Layer Aggregation Network (ELAN) as its backbone and task-specific heads for each task. We employ a four-stage training process that includes pretraining on the BDD100K dataset, finetuning on both the BDD100K and iVS datasets, and quantization-aware training (QAT) on BDD100K. During the training process, we use powerful data augmentation techniques, such as random perspective and mosaic, and train the model on a combination of the BDD100K and iVS datasets. Both strategies enhance the model's generalization capabilities. The proposed model achieves state-of-the-art performance with an mAP@0.5 of 0.622 for object detection and an mIoU of 0.612 for segmentation, while maintaining low computational and memory requirements.

查看原文本刊更多论文

Q-YOLOP:量化意识，你只看一次全景驾驶感知

在这项工作中，我们提出了一种高效的、量化感知的全光驾驶感知模型(Q-YOLOP)，用于自动驾驶背景下的目标检测、可驾驶区域分割和车道线分割。我们的模型采用高效层聚合网络(ELAN)作为其主干，并为每个任务指定特定的头。我们采用了一个四阶段的训练过程，包括对BDD100K数据集的预训练，对BDD100K和iVS数据集的微调，以及对BDD100K的量化感知训练(QAT)。在训练过程中，我们使用了强大的数据增强技术，如随机视角和马赛克，并在BDD100K和iVS数据集的组合上训练模型。这两种策略都增强了模型的泛化能力。提出的模型实现了最先进的性能，目标检测的mAP@0.5为0.622，分割的mIoU为0.612，同时保持了较低的计算和内存需求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)

自引率

0.00%

发文量