Feiwei Qin, Xiyue Shen, Yong Peng, Yanli Shao, Wenqiang Yuan, Zhongping Ji, Jing Bai
{"title":"一种自动驾驶场景的实时语义分割方法","authors":"Feiwei Qin, Xiyue Shen, Yong Peng, Yanli Shao, Wenqiang Yuan, Zhongping Ji, Jing Bai","doi":"10.3724/sp.j.1089.2021.18631","DOIUrl":null,"url":null,"abstract":"An important part of autonomous driving is the perception of the driving environment of the car, which has created a strong demand for high precision semantic segmentation algorithms that can be run in real time on low-power mobile devices. However, when analyzing the factors that affect the accuracy and speed of the semantic segmentation network, it can be found that in the structure of the previous semantic segmentation algorithm, spatial information and context features are difficult to take into account at the same time, and using two networks to obtain spatial information and context information separately will increase the amount of calculation and storage. Therefore, a new structure is proposed that divides the spatial path and context path from the network based on the residual structure, and a two-path real-time semantic segmentation network is designed based on this structure. The network contains a feature fusion module and an attention refinement module, which are used to realize the function of fusing the multi-scale features of two 第 7 期 秦飞巍, 等: 无人驾驶中的场景实时语义分割方法 1027 paths and optimizing the output results of context path. The network is based on the PyTorch framework and uses NVIDIA 1080Ti graphics cards for experiments. On the road scene data set Cityscapes, mIoU reached 78.8%, and the running speed reached 27.5 fps.","PeriodicalId":52442,"journal":{"name":"计算机辅助设计与图形学学报","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Real-Time Semantic Segmentation Approach for Autonomous Driving Scenes\",\"authors\":\"Feiwei Qin, Xiyue Shen, Yong Peng, Yanli Shao, Wenqiang Yuan, Zhongping Ji, Jing Bai\",\"doi\":\"10.3724/sp.j.1089.2021.18631\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An important part of autonomous driving is the perception of the driving environment of the car, which has created a strong demand for high precision semantic segmentation algorithms that can be run in real time on low-power mobile devices. However, when analyzing the factors that affect the accuracy and speed of the semantic segmentation network, it can be found that in the structure of the previous semantic segmentation algorithm, spatial information and context features are difficult to take into account at the same time, and using two networks to obtain spatial information and context information separately will increase the amount of calculation and storage. Therefore, a new structure is proposed that divides the spatial path and context path from the network based on the residual structure, and a two-path real-time semantic segmentation network is designed based on this structure. The network contains a feature fusion module and an attention refinement module, which are used to realize the function of fusing the multi-scale features of two 第 7 期 秦飞巍, 等: 无人驾驶中的场景实时语义分割方法 1027 paths and optimizing the output results of context path. The network is based on the PyTorch framework and uses NVIDIA 1080Ti graphics cards for experiments. On the road scene data set Cityscapes, mIoU reached 78.8%, and the running speed reached 27.5 fps.\",\"PeriodicalId\":52442,\"journal\":{\"name\":\"计算机辅助设计与图形学学报\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"计算机辅助设计与图形学学报\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.3724/sp.j.1089.2021.18631\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"计算机辅助设计与图形学学报","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.3724/sp.j.1089.2021.18631","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
A Real-Time Semantic Segmentation Approach for Autonomous Driving Scenes
An important part of autonomous driving is the perception of the driving environment of the car, which has created a strong demand for high precision semantic segmentation algorithms that can be run in real time on low-power mobile devices. However, when analyzing the factors that affect the accuracy and speed of the semantic segmentation network, it can be found that in the structure of the previous semantic segmentation algorithm, spatial information and context features are difficult to take into account at the same time, and using two networks to obtain spatial information and context information separately will increase the amount of calculation and storage. Therefore, a new structure is proposed that divides the spatial path and context path from the network based on the residual structure, and a two-path real-time semantic segmentation network is designed based on this structure. The network contains a feature fusion module and an attention refinement module, which are used to realize the function of fusing the multi-scale features of two 第 7 期 秦飞巍, 等: 无人驾驶中的场景实时语义分割方法 1027 paths and optimizing the output results of context path. The network is based on the PyTorch framework and uses NVIDIA 1080Ti graphics cards for experiments. On the road scene data set Cityscapes, mIoU reached 78.8%, and the running speed reached 27.5 fps.