Shichao Yan, Lu Chen, Yang Liu, Peng Zhai, Lihua Zhang
{"title":"基于驾驶场景的场景理解网络","authors":"Shichao Yan, Lu Chen, Yang Liu, Peng Zhai, Lihua Zhang","doi":"10.1117/12.2680491","DOIUrl":null,"url":null,"abstract":"Accurate prediction of the surrounding traffic environment is crucial for the safety of autonomous vehicles. However, the limitation of onboard system resources and the complexity and diversity of driving scenes hinder the deployment of scene understanding in the auto-drive system. This paper optimizes the backbone network, uses deep separable convolution to reduce the complexity of network operations and uses multiple attention mechanisms in the decoding stage. On this basis, this paper adopts the shared strategy for the feature extraction module and jointly trains the semantic segmentation and Object detection, which can reduce the network parameters, improve the reasoning speed, and improve the accuracy. We have evaluated the proposed method on the public data set. The results show that our method achieves the most advanced performance and can balance speed and accuracy.","PeriodicalId":201466,"journal":{"name":"Symposium on Advances in Electrical, Electronics and Computer Engineering","volume":"12704 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A scene understanding network based on driving scene\",\"authors\":\"Shichao Yan, Lu Chen, Yang Liu, Peng Zhai, Lihua Zhang\",\"doi\":\"10.1117/12.2680491\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate prediction of the surrounding traffic environment is crucial for the safety of autonomous vehicles. However, the limitation of onboard system resources and the complexity and diversity of driving scenes hinder the deployment of scene understanding in the auto-drive system. This paper optimizes the backbone network, uses deep separable convolution to reduce the complexity of network operations and uses multiple attention mechanisms in the decoding stage. On this basis, this paper adopts the shared strategy for the feature extraction module and jointly trains the semantic segmentation and Object detection, which can reduce the network parameters, improve the reasoning speed, and improve the accuracy. We have evaluated the proposed method on the public data set. The results show that our method achieves the most advanced performance and can balance speed and accuracy.\",\"PeriodicalId\":201466,\"journal\":{\"name\":\"Symposium on Advances in Electrical, Electronics and Computer Engineering\",\"volume\":\"12704 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Symposium on Advances in Electrical, Electronics and Computer Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2680491\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Symposium on Advances in Electrical, Electronics and Computer Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2680491","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A scene understanding network based on driving scene
Accurate prediction of the surrounding traffic environment is crucial for the safety of autonomous vehicles. However, the limitation of onboard system resources and the complexity and diversity of driving scenes hinder the deployment of scene understanding in the auto-drive system. This paper optimizes the backbone network, uses deep separable convolution to reduce the complexity of network operations and uses multiple attention mechanisms in the decoding stage. On this basis, this paper adopts the shared strategy for the feature extraction module and jointly trains the semantic segmentation and Object detection, which can reduce the network parameters, improve the reasoning speed, and improve the accuracy. We have evaluated the proposed method on the public data set. The results show that our method achieves the most advanced performance and can balance speed and accuracy.