S. Gao, Changzhu Zhang, Zhuping Wang, Hao Zhang, Chao Huang
{"title":"SPMNet:一个具有可分离金字塔模块的轻量级实时语义分割网络","authors":"S. Gao, Changzhu Zhang, Zhuping Wang, Hao Zhang, Chao Huang","doi":"10.1080/0952813X.2021.1908432","DOIUrl":null,"url":null,"abstract":"ABSTRACT Real-time semantic segmentation aims to generate high-quality prediction in limited time. Recently, with the development of many related potential applications, such as autonomous driving, robot sensing and augmented reality devices, semantic segmentation is desirable to make a trade-off between accuracy and inference speed with limited computation resources. This paper introduces a novel effective and light-weighted network based on Separable Pyramid Module (SPM) to achieve competitive accuracy and inference speed with fewer parameters and computation. Our proposed SPM unit utilises factorised convolution and dilated convolution in the form of a feature pyramid to build a bottleneck structure, which extracts local and context information in a simple but effective way. Experiments on Cityscapes and Camvid datasets demonstrate our superior trade-off between speed and precision. Without pre-training or any additional processing, our SPMNet achieves 71.22% mIoU on Cityscapes test set at the speed of 94 FPS on a single GTX 1080Ti GPU card.","PeriodicalId":15677,"journal":{"name":"Journal of Experimental & Theoretical Artificial Intelligence","volume":"17 1","pages":"651 - 662"},"PeriodicalIF":1.7000,"publicationDate":"2021-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SPMNet: A light-weighted network with separable pyramid module for real-time semantic segmentation\",\"authors\":\"S. Gao, Changzhu Zhang, Zhuping Wang, Hao Zhang, Chao Huang\",\"doi\":\"10.1080/0952813X.2021.1908432\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT Real-time semantic segmentation aims to generate high-quality prediction in limited time. Recently, with the development of many related potential applications, such as autonomous driving, robot sensing and augmented reality devices, semantic segmentation is desirable to make a trade-off between accuracy and inference speed with limited computation resources. This paper introduces a novel effective and light-weighted network based on Separable Pyramid Module (SPM) to achieve competitive accuracy and inference speed with fewer parameters and computation. Our proposed SPM unit utilises factorised convolution and dilated convolution in the form of a feature pyramid to build a bottleneck structure, which extracts local and context information in a simple but effective way. Experiments on Cityscapes and Camvid datasets demonstrate our superior trade-off between speed and precision. Without pre-training or any additional processing, our SPMNet achieves 71.22% mIoU on Cityscapes test set at the speed of 94 FPS on a single GTX 1080Ti GPU card.\",\"PeriodicalId\":15677,\"journal\":{\"name\":\"Journal of Experimental & Theoretical Artificial Intelligence\",\"volume\":\"17 1\",\"pages\":\"651 - 662\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2021-04-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Experimental & Theoretical Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1080/0952813X.2021.1908432\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Experimental & Theoretical Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1080/0952813X.2021.1908432","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
SPMNet: A light-weighted network with separable pyramid module for real-time semantic segmentation
ABSTRACT Real-time semantic segmentation aims to generate high-quality prediction in limited time. Recently, with the development of many related potential applications, such as autonomous driving, robot sensing and augmented reality devices, semantic segmentation is desirable to make a trade-off between accuracy and inference speed with limited computation resources. This paper introduces a novel effective and light-weighted network based on Separable Pyramid Module (SPM) to achieve competitive accuracy and inference speed with fewer parameters and computation. Our proposed SPM unit utilises factorised convolution and dilated convolution in the form of a feature pyramid to build a bottleneck structure, which extracts local and context information in a simple but effective way. Experiments on Cityscapes and Camvid datasets demonstrate our superior trade-off between speed and precision. Without pre-training or any additional processing, our SPMNet achieves 71.22% mIoU on Cityscapes test set at the speed of 94 FPS on a single GTX 1080Ti GPU card.
期刊介绍:
Journal of Experimental & Theoretical Artificial Intelligence (JETAI) is a world leading journal dedicated to publishing high quality, rigorously reviewed, original papers in artificial intelligence (AI) research.
The journal features work in all subfields of AI research and accepts both theoretical and applied research. Topics covered include, but are not limited to, the following:
• cognitive science
• games
• learning
• knowledge representation
• memory and neural system modelling
• perception
• problem-solving