Learning Road Scene-level Representations via Semantic Region Prediction

Conference on Robot Learning Pub Date : 2023-01-02 DOI:10.48550/arXiv.2301.00714

Zihao Xiao, A. Yuille, Yi-Ting Chen

引用次数: 1

Abstract

In this work, we tackle two vital tasks in automated driving systems, i.e., driver intent prediction and risk object identification from egocentric images. Mainly, we investigate the question: what would be good road scene-level representations for these two tasks? We contend that a scene-level representation must capture higher-level semantic and geometric representations of traffic scenes around ego-vehicle while performing actions to their destinations. To this end, we introduce the representation of semantic regions, which are areas where ego-vehicles visit while taking an afforded action (e.g., left-turn at 4-way intersections). We propose to learn scene-level representations via a novel semantic region prediction task and an automatic semantic region labeling algorithm. Extensive evaluations are conducted on the HDD and nuScenes datasets, and the learned representations lead to state-of-the-art performance for driver intention prediction and risk object identification.

查看原文本刊更多论文

通过语义区域预测学习道路场景级表示

在这项工作中，我们解决了自动驾驶系统中的两个重要任务，即驾驶员意图预测和基于自我中心图像的风险对象识别。我们主要研究的问题是:对于这两个任务，什么是好的道路场景级表示?我们认为，场景级表示必须在执行动作到目的地时捕获自我车辆周围交通场景的更高级别语义和几何表示。为此，我们引入了语义区域的表示，这些区域是自我车辆在采取适当行动时访问的区域(例如，在4路交叉路口左转)。我们提出了一种新的语义区域预测任务和自动语义区域标注算法来学习场景级表示。在HDD和nuScenes数据集上进行了广泛的评估，学习表征导致了驾驶员意图预测和风险对象识别的最先进性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Conference on Robot Learning

自引率

0.00%

发文量