Traffic scene segmentation based on boosting over multimodal low, intermediate and high order multi-range channel features

2017 IEEE Intelligent Vehicles Symposium (IV) Pub Date : 2017-06-11 DOI:10.1109/IVS.2017.7995701

A. Costea, S. Nedevschi

{"title":"Traffic scene segmentation based on boosting over multimodal low, intermediate and high order multi-range channel features","authors":"A. Costea, S. Nedevschi","doi":"10.1109/IVS.2017.7995701","DOIUrl":null,"url":null,"abstract":"In this paper we introduce a novel multimodal boosting based solution for semantic segmentation of traffic scenarios. Local structure and context are captured from both monocular color and depth modalities in the form of image channels. We define multiple channel types at three different levels: low, intermediate and high order channels. The low order channels are computed using a multimodal multiresolution filtering scheme and capture structure and color information from lower receptive fields. For the intermediate order channels, we employ deep convolutional channels that are able to capture more complex structures, having a larger receptive field. The high order channels are scale invariant channels that consist of spatial, geometric and semantic channels. These channels are enhanced by additional pyramidal context channels, capturing context at multiple levels. The semantic segmentation is achieved by a boosting based classification scheme over superpixels using multi-range channel features and pyramidal context features. A pre-segmentation is used to generate semantic channels as input for more powerful final segmentation. The final segmentation is refined using a superpixel-level dense conditional random field. The proposed solution is evaluated on the Cityscapes segmentation benchmark and achieves competitive results at low computational costs. It is the first boosting based solution that is able to keep up with the performance of deep learning based approaches.","PeriodicalId":143367,"journal":{"name":"2017 IEEE Intelligent Vehicles Symposium (IV)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Intelligent Vehicles Symposium (IV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IVS.2017.7995701","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

In this paper we introduce a novel multimodal boosting based solution for semantic segmentation of traffic scenarios. Local structure and context are captured from both monocular color and depth modalities in the form of image channels. We define multiple channel types at three different levels: low, intermediate and high order channels. The low order channels are computed using a multimodal multiresolution filtering scheme and capture structure and color information from lower receptive fields. For the intermediate order channels, we employ deep convolutional channels that are able to capture more complex structures, having a larger receptive field. The high order channels are scale invariant channels that consist of spatial, geometric and semantic channels. These channels are enhanced by additional pyramidal context channels, capturing context at multiple levels. The semantic segmentation is achieved by a boosting based classification scheme over superpixels using multi-range channel features and pyramidal context features. A pre-segmentation is used to generate semantic channels as input for more powerful final segmentation. The final segmentation is refined using a superpixel-level dense conditional random field. The proposed solution is evaluated on the Cityscapes segmentation benchmark and achieves competitive results at low computational costs. It is the first boosting based solution that is able to keep up with the performance of deep learning based approaches.

查看原文本刊更多论文

基于多模态低、中、高阶多范围通道特征的增强交通场景分割

本文提出了一种基于多模态增强的交通场景语义分割方法。以图像通道的形式从单目颜色和深度模式中捕获局部结构和上下文。我们在三个不同的级别定义多个通道类型:低、中、高阶通道。使用多模态多分辨率滤波方案计算低阶通道，并从低感受野捕获结构和颜色信息。对于中间阶通道，我们采用能够捕获更复杂结构的深度卷积通道，具有更大的接受野。高阶通道是由空间通道、几何通道和语义通道组成的尺度不变通道。这些通道通过额外的金字塔形上下文通道得到增强，在多个层面捕获上下文。语义分割是利用多距离通道特征和金字塔形上下文特征在超像素上实现的。预分割用于生成语义通道作为输入，用于更强大的最终分割。最后使用超像素级密集条件随机场对分割进行细化。该方案在城市景观分割基准上进行了评估，以较低的计算成本获得了具有竞争力的结果。这是第一个基于增强的解决方案，能够跟上基于深度学习的方法的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE Intelligent Vehicles Symposium (IV)

自引率

0.00%

发文量