A Lightweight Network for Fast Semantic Segmentation

2020 7th International Conference on Behavioural and Social Computing (BESC) Pub Date : 2020-11-05 DOI:10.1109/BESC51023.2020.9348326

Ruiqi Luo, Yuanzhouhan Cao, Yi Jin, Yidong Li

引用次数: 0

Abstract

Semantic segmentation is a fundamental task in computer vision and is widely used in industry. However, current state-of-the-art architectures usually bring heavy computation complexity, making it hard to meet the demand for real-time, and can not be implemented in industry. In this paper, we propose a lightweight network to complete fast segmentation. Our network follows encoder-decoder style, which encodes rich spatial information at shallow layers and gains sufficient semantic information at deep layers. At the decoder part, we use attention mechanism to re-weight features and gradually fuse high-level features back to low-level features. We evaluate our network on Cityscapes dataset. Our method achieves an accuracy of 68.0 % mean intersection over union, and runs at 50.7 frames per second at full resolution (1024x2048) on one NVIDIA GeForce GTX 1080Ti card.

查看原文本刊更多论文

一种快速语义分割的轻量级网络

语义分割是计算机视觉的一项基本任务，在工业中有着广泛的应用。然而，目前的先进架构通常带来巨大的计算复杂度，难以满足实时性的需求，无法在工业中实现。在本文中，我们提出了一个轻量级的网络来完成快速分割。我们的网络采用编码器-解码器的方式，在浅层编码丰富的空间信息，在深层获得足够的语义信息。在解码器部分，我们使用注意机制对特征进行重新加权，并逐步将高级特征融合回低级特征。我们在cityscape数据集上评估我们的网络。我们的方法在NVIDIA GeForce GTX 1080Ti卡上以全分辨率(1024x2048)的50.7帧/秒的速度运行，实现了68.0%的平均交叉集精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 7th International Conference on Behavioural and Social Computing (BESC)

自引率

0.00%

发文量