{"title":"Methods of Enriching The Flow of Information in The Real-Time Semantic Segmentation Using Deep Neural Networks","authors":"J. Bednarek, K. Piaskowski, Michał Bednarek","doi":"10.23919/SPA.2018.8563422","DOIUrl":null,"url":null,"abstract":"Semantic Segmentation is one of the visual tasks that gained the significant boost in performance in recent years due to the popularization of Convolutional Neural Networks (CNNs). In this paper, we addressed the problem of losing information while changing the size of input images during training neural models. Moreover, our method of downsampling and upsampling could be easily injected into current autoencoder models. We show that without any significant changes in a model architecture it is possible to noticeably improve IoU metric. On popular Cityscapes benchmark, our model is achieving almost 2.5% boost in the accuracy of segmentation in comparison to the widely known ERF model. Additionally, to the ability to real-time usages, we run our network on GPU comparable to NVIDIA Jetson Tx2, what let us implement our algorithm in autonomous vehicles.","PeriodicalId":265587,"journal":{"name":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/SPA.2018.8563422","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Semantic Segmentation is one of the visual tasks that gained the significant boost in performance in recent years due to the popularization of Convolutional Neural Networks (CNNs). In this paper, we addressed the problem of losing information while changing the size of input images during training neural models. Moreover, our method of downsampling and upsampling could be easily injected into current autoencoder models. We show that without any significant changes in a model architecture it is possible to noticeably improve IoU metric. On popular Cityscapes benchmark, our model is achieving almost 2.5% boost in the accuracy of segmentation in comparison to the widely known ERF model. Additionally, to the ability to real-time usages, we run our network on GPU comparable to NVIDIA Jetson Tx2, what let us implement our algorithm in autonomous vehicles.