{"title":"Semantic Segmentation Model for Road Scene Based on Encoder-Decoder Structure","authors":"Yuanzhe Peng, Weichao Han, Y. Ou","doi":"10.1109/ROBIO49542.2019.8961595","DOIUrl":null,"url":null,"abstract":"Semantic segmentation as a pixel-wise segmentation task provides rich object information, which is an important research topic in robotic perception. It has been widely applied in many fields, such as autonomous driving and robot navigation. In the application of understanding road scene, the semantic segmentation model should accurately describe the appearance and shape of different categories of objects. In addition, the semantic segmentation model need to understand the spatial relationships between different categories. In order to improve the performance of semantic segmentation model for road scene, we present a model based on encoder-decoder structure with dilated convolution. We apply this model on the Cityscapes dataset and compare it with other classical models. To assess performance, we rely on the standard Jaccard Index IoU (Intersection over Union) and mIoU (mean Intersection over Union). The experimental results verify that this model can effectively improve the performance of semantic segmentation and meet the requirements for road scene.","PeriodicalId":121822,"journal":{"name":"2019 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"133 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Robotics and Biomimetics (ROBIO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBIO49542.2019.8961595","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Semantic segmentation as a pixel-wise segmentation task provides rich object information, which is an important research topic in robotic perception. It has been widely applied in many fields, such as autonomous driving and robot navigation. In the application of understanding road scene, the semantic segmentation model should accurately describe the appearance and shape of different categories of objects. In addition, the semantic segmentation model need to understand the spatial relationships between different categories. In order to improve the performance of semantic segmentation model for road scene, we present a model based on encoder-decoder structure with dilated convolution. We apply this model on the Cityscapes dataset and compare it with other classical models. To assess performance, we rely on the standard Jaccard Index IoU (Intersection over Union) and mIoU (mean Intersection over Union). The experimental results verify that this model can effectively improve the performance of semantic segmentation and meet the requirements for road scene.
语义分割作为一种逐像素分割任务,提供了丰富的目标信息,是机器人感知领域的重要研究课题。它被广泛应用于许多领域,如自动驾驶和机器人导航。在道路场景理解的应用中,语义分割模型应该准确地描述不同类别物体的外观和形状。此外,语义分割模型需要理解不同类别之间的空间关系。为了提高道路场景语义分割模型的性能,提出了一种基于扩展卷积编码器结构的道路场景语义分割模型。我们将该模型应用于城市景观数据集,并与其他经典模型进行比较。为了评估性能,我们依赖于标准的Jaccard Index IoU(交集/联合)和mIoU(平均交集/联合)。实验结果表明,该模型能有效提高语义分割性能,满足道路场景的要求。