一种自动驾驶场景的实时语义分割方法

Q3 Computer Science

计算机辅助设计与图形学学报 Pub Date : 2021-07-01 DOI:10.3724/sp.j.1089.2021.18631

Feiwei Qin, Xiyue Shen, Yong Peng, Yanli Shao, Wenqiang Yuan, Zhongping Ji, Jing Bai

{"title":"一种自动驾驶场景的实时语义分割方法","authors":"Feiwei Qin, Xiyue Shen, Yong Peng, Yanli Shao, Wenqiang Yuan, Zhongping Ji, Jing Bai","doi":"10.3724/sp.j.1089.2021.18631","DOIUrl":null,"url":null,"abstract":"An important part of autonomous driving is the perception of the driving environment of the car, which has created a strong demand for high precision semantic segmentation algorithms that can be run in real time on low-power mobile devices. However, when analyzing the factors that affect the accuracy and speed of the semantic segmentation network, it can be found that in the structure of the previous semantic segmentation algorithm, spatial information and context features are difficult to take into account at the same time, and using two networks to obtain spatial information and context information separately will increase the amount of calculation and storage. Therefore, a new structure is proposed that divides the spatial path and context path from the network based on the residual structure, and a two-path real-time semantic segmentation network is designed based on this structure. The network contains a feature fusion module and an attention refinement module, which are used to realize the function of fusing the multi-scale features of two 第 7 期秦飞巍, 等: 无人驾驶中的场景实时语义分割方法 1027 paths and optimizing the output results of context path. The network is based on the PyTorch framework and uses NVIDIA 1080Ti graphics cards for experiments. On the road scene data set Cityscapes, mIoU reached 78.8%, and the running speed reached 27.5 fps.","PeriodicalId":52442,"journal":{"name":"计算机辅助设计与图形学学报","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Real-Time Semantic Segmentation Approach for Autonomous Driving Scenes\",\"authors\":\"Feiwei Qin, Xiyue Shen, Yong Peng, Yanli Shao, Wenqiang Yuan, Zhongping Ji, Jing Bai\",\"doi\":\"10.3724/sp.j.1089.2021.18631\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An important part of autonomous driving is the perception of the driving environment of the car, which has created a strong demand for high precision semantic segmentation algorithms that can be run in real time on low-power mobile devices. However, when analyzing the factors that affect the accuracy and speed of the semantic segmentation network, it can be found that in the structure of the previous semantic segmentation algorithm, spatial information and context features are difficult to take into account at the same time, and using two networks to obtain spatial information and context information separately will increase the amount of calculation and storage. Therefore, a new structure is proposed that divides the spatial path and context path from the network based on the residual structure, and a two-path real-time semantic segmentation network is designed based on this structure. The network contains a feature fusion module and an attention refinement module, which are used to realize the function of fusing the multi-scale features of two 第 7 期秦飞巍, 等: 无人驾驶中的场景实时语义分割方法 1027 paths and optimizing the output results of context path. The network is based on the PyTorch framework and uses NVIDIA 1080Ti graphics cards for experiments. On the road scene data set Cityscapes, mIoU reached 78.8%, and the running speed reached 27.5 fps.\",\"PeriodicalId\":52442,\"journal\":{\"name\":\"计算机辅助设计与图形学学报\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"计算机辅助设计与图形学学报\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.3724/sp.j.1089.2021.18631\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"计算机辅助设计与图形学学报","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.3724/sp.j.1089.2021.18631","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

摘要

自动驾驶的一个重要部分是对汽车驾驶环境的感知，这对能够在低功耗移动设备上实时运行的高精度语义分割算法产生了强烈的需求。然而，在分析影响语义分割网络准确性和速度的因素时，可以发现，在以前的语义分割算法的结构中，空间信息和上下文特征很难同时考虑，使用两个网络分别获得空间信息和上下文信息将增加计算量和存储量。因此，提出了一种基于残差结构将空间路径和上下文路径从网络中分割出来的新结构，并基于该结构设计了一个双路径实时语义分割网络。该网络包含一个特征融合模块和一个注意力细化模块，用于实现融合两个多尺度特征的功能第 7.期秦飞巍, 等: 无人驾驶中的场景实时语义分割方法 1027条路径，并优化上下文路径的输出结果。该网络基于PyTorch框架，并使用NVIDIA 1080Ti显卡进行实验。在道路场景数据集Cityscapes上，mIoU达到78.8%，运行速度达到27.5帧/秒。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Real-Time Semantic Segmentation Approach for Autonomous Driving Scenes

An important part of autonomous driving is the perception of the driving environment of the car, which has created a strong demand for high precision semantic segmentation algorithms that can be run in real time on low-power mobile devices. However, when analyzing the factors that affect the accuracy and speed of the semantic segmentation network, it can be found that in the structure of the previous semantic segmentation algorithm, spatial information and context features are difficult to take into account at the same time, and using two networks to obtain spatial information and context information separately will increase the amount of calculation and storage. Therefore, a new structure is proposed that divides the spatial path and context path from the network based on the residual structure, and a two-path real-time semantic segmentation network is designed based on this structure. The network contains a feature fusion module and an attention refinement module, which are used to realize the function of fusing the multi-scale features of two 第 7 期秦飞巍, 等: 无人驾驶中的场景实时语义分割方法 1027 paths and optimizing the output results of context path. The network is based on the PyTorch framework and uses NVIDIA 1080Ti graphics cards for experiments. On the road scene data set Cityscapes, mIoU reached 78.8%, and the running speed reached 27.5 fps.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

计算机辅助设计与图形学学报 Computer Science-Computer Graphics and Computer-Aided Design

CiteScore

1.20

自引率

0.00%

发文量

6833

期刊介绍：