The Design of a Multi-Task Learning System for Autonomous Vehicles

2022 6th International Conference on Green Technology and Sustainable Development (GTSD) Pub Date : 2022-07-29 DOI:10.1109/GTSD54989.2022.9989259

Khanh-Phong Bui, Hoang-Lam Ngoc Le, Quang-Thang Le, Dinh-Hiep Huynh, Vu-Hoang Tran

{"title":"The Design of a Multi-Task Learning System for Autonomous Vehicles","authors":"Khanh-Phong Bui, Hoang-Lam Ngoc Le, Quang-Thang Le, Dinh-Hiep Huynh, Vu-Hoang Tran","doi":"10.1109/GTSD54989.2022.9989259","DOIUrl":null,"url":null,"abstract":"Recently, a lot of research and applications regarding autonomous vehicles have been invested in and developed. These applications have many complex scenarios to handle such as lane segmentation, object detection, traffic sign recognition, and steering control prediction. Many methods handle these tasks separately. Despite the excellent performance these methods achieve, processing these tasks one after another takes a longer time than tackling them all at once. So, in this paper, to reduce the inference time of the autonomous driving system, we proposed a multi-task framework to conduct three tasks: lane segmentation, object detection, and traffic sign recognition simultaneously. Our framework is composed of one encoder for feature extraction and two decoders to handle specific tasks. We only use one encoder for multiple tasks because these tasks complement each other, we hope that the information can be shared among these tasks through the single encoder to improve the performance of each task and also to reduce the amount of data required for training. The decoders include a detection decoder and a segmentation decoder. The detection decoder is designed to detect objects and recognize traffic signs. On the other hand, the segmentation decoder is designed to focus solely on the task of separating the drivable area. By testing on the challenging Carla dataset, our model shows that it can achieve better results compared to state-of-the-art methods. Besides, experimental results also show that, compared with solving tasks independently, our framework can achieve similar performance but greatly reduce processing time.","PeriodicalId":125445,"journal":{"name":"2022 6th International Conference on Green Technology and Sustainable Development (GTSD)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th International Conference on Green Technology and Sustainable Development (GTSD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GTSD54989.2022.9989259","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recently, a lot of research and applications regarding autonomous vehicles have been invested in and developed. These applications have many complex scenarios to handle such as lane segmentation, object detection, traffic sign recognition, and steering control prediction. Many methods handle these tasks separately. Despite the excellent performance these methods achieve, processing these tasks one after another takes a longer time than tackling them all at once. So, in this paper, to reduce the inference time of the autonomous driving system, we proposed a multi-task framework to conduct three tasks: lane segmentation, object detection, and traffic sign recognition simultaneously. Our framework is composed of one encoder for feature extraction and two decoders to handle specific tasks. We only use one encoder for multiple tasks because these tasks complement each other, we hope that the information can be shared among these tasks through the single encoder to improve the performance of each task and also to reduce the amount of data required for training. The decoders include a detection decoder and a segmentation decoder. The detection decoder is designed to detect objects and recognize traffic signs. On the other hand, the segmentation decoder is designed to focus solely on the task of separating the drivable area. By testing on the challenging Carla dataset, our model shows that it can achieve better results compared to state-of-the-art methods. Besides, experimental results also show that, compared with solving tasks independently, our framework can achieve similar performance but greatly reduce processing time.

查看原文本刊更多论文

自动驾驶汽车多任务学习系统的设计

近年来，人们对自动驾驶汽车进行了大量的研究和应用。这些应用程序有许多复杂的场景需要处理，如车道分割、目标检测、交通标志识别和转向控制预测。许多方法分别处理这些任务。尽管这些方法实现了出色的性能，但一个接一个地处理这些任务比一次处理所有任务需要更长的时间。因此，为了减少自动驾驶系统的推理时间，本文提出了一个多任务框架，同时进行车道分割、目标检测和交通标志识别三个任务。我们的框架由一个用于特征提取的编码器和两个用于处理特定任务的解码器组成。对于多个任务，我们只使用一个编码器，因为这些任务是相互补充的，我们希望通过单个编码器可以在这些任务之间共享信息，以提高每个任务的性能，也减少训练所需的数据量。解码器包括检测解码器和分割解码器。检测解码器设计用于检测物体和识别交通标志。另一方面，分割解码器被设计成只专注于分离可驱动区域的任务。通过对具有挑战性的Carla数据集的测试，我们的模型表明，与最先进的方法相比，它可以获得更好的结果。此外，实验结果也表明，与独立解决任务相比，我们的框架可以达到相似的性能，但大大减少了处理时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 6th International Conference on Green Technology and Sustainable Development (GTSD)

自引率

0.00%

发文量