一种新的扩展卷积神经网络道路场景分割模型

IF 0.8 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS

EAI Endorsed Transactions on Scalable Information Systems Pub Date : 2022-01-27 DOI:10.4108/eai.27-1-2022.173164

Yachao Zhang, Yuxia Yuan

{"title":"一种新的扩展卷积神经网络道路场景分割模型","authors":"Yachao Zhang, Yuxia Yuan","doi":"10.4108/eai.27-1-2022.173164","DOIUrl":null,"url":null,"abstract":"Road scene understanding is one of the important modules in the field of autonomous driving. It can provide more information about roads and play an important role in building high-precision maps and real-time planning. Among them, semantic segmentation can assign category information to each pixel of image, which is the most commonly used method in automatic driving scene understanding. However, most commonly used semantic segmentation algorithms cannot achieve a good balance between speed and precision. In this paper, a road scene segmentation model based on dilated convolutional neural network is constructed. The model consists of a front-end module and a context module. The front-end module is an improved structure of VGG-16 fused dilated convolution, and the context module is a cascade of dilated convolution layers with different expansion coefficients, which is trained by a two-stage training method. The network proposed in this paper can run in real time and ensure the accuracy to meet the requirements of practical applications, and has been verified and analyzed on Cityscapes data set.","PeriodicalId":43034,"journal":{"name":"EAI Endorsed Transactions on Scalable Information Systems","volume":"80 1","pages":"18"},"PeriodicalIF":0.8000,"publicationDate":"2022-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A novel dilated convolutional neural network model for road scene segmentation\",\"authors\":\"Yachao Zhang, Yuxia Yuan\",\"doi\":\"10.4108/eai.27-1-2022.173164\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Road scene understanding is one of the important modules in the field of autonomous driving. It can provide more information about roads and play an important role in building high-precision maps and real-time planning. Among them, semantic segmentation can assign category information to each pixel of image, which is the most commonly used method in automatic driving scene understanding. However, most commonly used semantic segmentation algorithms cannot achieve a good balance between speed and precision. In this paper, a road scene segmentation model based on dilated convolutional neural network is constructed. The model consists of a front-end module and a context module. The front-end module is an improved structure of VGG-16 fused dilated convolution, and the context module is a cascade of dilated convolution layers with different expansion coefficients, which is trained by a two-stage training method. The network proposed in this paper can run in real time and ensure the accuracy to meet the requirements of practical applications, and has been verified and analyzed on Cityscapes data set.\",\"PeriodicalId\":43034,\"journal\":{\"name\":\"EAI Endorsed Transactions on Scalable Information Systems\",\"volume\":\"80 1\",\"pages\":\"18\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2022-01-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EAI Endorsed Transactions on Scalable Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4108/eai.27-1-2022.173164\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EAI Endorsed Transactions on Scalable Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/eai.27-1-2022.173164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 1

摘要

道路场景理解是自动驾驶领域的重要模块之一。它可以提供更多的道路信息，在构建高精度地图和实时规划中发挥重要作用。其中，语义分割可以为图像的每个像素分配类别信息，这是自动驾驶场景理解中最常用的方法。然而，大多数常用的语义分割算法无法在速度和精度之间取得很好的平衡。本文建立了一种基于扩展卷积神经网络的道路场景分割模型。该模型由前端模块和上下文模块组成。前端模块是VGG-16融合展开卷积的改进结构，上下文模块是不同展开系数的展开卷积层级联，采用两阶段训练方法进行训练。本文提出的网络能够实时运行，保证准确性，满足实际应用的要求，并在cityscape数据集上进行了验证和分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A novel dilated convolutional neural network model for road scene segmentation

Road scene understanding is one of the important modules in the field of autonomous driving. It can provide more information about roads and play an important role in building high-precision maps and real-time planning. Among them, semantic segmentation can assign category information to each pixel of image, which is the most commonly used method in automatic driving scene understanding. However, most commonly used semantic segmentation algorithms cannot achieve a good balance between speed and precision. In this paper, a road scene segmentation model based on dilated convolutional neural network is constructed. The model consists of a front-end module and a context module. The front-end module is an improved structure of VGG-16 fused dilated convolution, and the context module is a cascade of dilated convolution layers with different expansion coefficients, which is trained by a two-stage training method. The network proposed in this paper can run in real time and ensure the accuracy to meet the requirements of practical applications, and has been verified and analyzed on Cityscapes data set.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

EAI Endorsed Transactions on Scalable Information Systems COMPUTER SCIENCE, INFORMATION SYSTEMS-

CiteScore

2.80

自引率

15.40%

发文量

审稿时长

10 weeks