Hui-Shi Song, Yun Zhou, Zhuqing Jiang, Xiaoqiang Guo, Zixuan Yang
{"title":"Multi-path Fusion Network For Semantic Image Segmentation","authors":"Hui-Shi Song, Yun Zhou, Zhuqing Jiang, Xiaoqiang Guo, Zixuan Yang","doi":"10.1109/ICCCHINA.2018.8641259","DOIUrl":null,"url":null,"abstract":"Recently, deep convolutional neural networks (CNNs) have led to significant improvement over semantic image segmentation and have also been the best choice. In this paper, we propose a deep neural network architecture, Multi-Path Fusion Network (MPFNet), for semantic image segmentation. In MPFNet, we add more convolution paths to every convolution layer. The depth of each convolutional path increases linearly, which provides a superior method for pixel level prediction. Using this method, we integrate contextual information and local information to produce good quality results on the semantic segmentation task. In addition, dense skip connections are added to repeatedly leverage previous features. The proposed approach improves strong baselines built upon VGG16 on two urban scene datasets, CamVid and Cityscapes, which demonstrate its effectiveness in modeling context information.","PeriodicalId":170216,"journal":{"name":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCHINA.2018.8641259","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Recently, deep convolutional neural networks (CNNs) have led to significant improvement over semantic image segmentation and have also been the best choice. In this paper, we propose a deep neural network architecture, Multi-Path Fusion Network (MPFNet), for semantic image segmentation. In MPFNet, we add more convolution paths to every convolution layer. The depth of each convolutional path increases linearly, which provides a superior method for pixel level prediction. Using this method, we integrate contextual information and local information to produce good quality results on the semantic segmentation task. In addition, dense skip connections are added to repeatedly leverage previous features. The proposed approach improves strong baselines built upon VGG16 on two urban scene datasets, CamVid and Cityscapes, which demonstrate its effectiveness in modeling context information.