Hui-Shi Song, Yun Zhou, Zhuqing Jiang, Xiaoqiang Guo, Zixuan Yang
{"title":"语义图像分割的多路径融合网络","authors":"Hui-Shi Song, Yun Zhou, Zhuqing Jiang, Xiaoqiang Guo, Zixuan Yang","doi":"10.1109/ICCCHINA.2018.8641259","DOIUrl":null,"url":null,"abstract":"Recently, deep convolutional neural networks (CNNs) have led to significant improvement over semantic image segmentation and have also been the best choice. In this paper, we propose a deep neural network architecture, Multi-Path Fusion Network (MPFNet), for semantic image segmentation. In MPFNet, we add more convolution paths to every convolution layer. The depth of each convolutional path increases linearly, which provides a superior method for pixel level prediction. Using this method, we integrate contextual information and local information to produce good quality results on the semantic segmentation task. In addition, dense skip connections are added to repeatedly leverage previous features. The proposed approach improves strong baselines built upon VGG16 on two urban scene datasets, CamVid and Cityscapes, which demonstrate its effectiveness in modeling context information.","PeriodicalId":170216,"journal":{"name":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Multi-path Fusion Network For Semantic Image Segmentation\",\"authors\":\"Hui-Shi Song, Yun Zhou, Zhuqing Jiang, Xiaoqiang Guo, Zixuan Yang\",\"doi\":\"10.1109/ICCCHINA.2018.8641259\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, deep convolutional neural networks (CNNs) have led to significant improvement over semantic image segmentation and have also been the best choice. In this paper, we propose a deep neural network architecture, Multi-Path Fusion Network (MPFNet), for semantic image segmentation. In MPFNet, we add more convolution paths to every convolution layer. The depth of each convolutional path increases linearly, which provides a superior method for pixel level prediction. Using this method, we integrate contextual information and local information to produce good quality results on the semantic segmentation task. In addition, dense skip connections are added to repeatedly leverage previous features. The proposed approach improves strong baselines built upon VGG16 on two urban scene datasets, CamVid and Cityscapes, which demonstrate its effectiveness in modeling context information.\",\"PeriodicalId\":170216,\"journal\":{\"name\":\"2018 IEEE/CIC International Conference on Communications in China (ICCC)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE/CIC International Conference on Communications in China (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCHINA.2018.8641259\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCHINA.2018.8641259","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-path Fusion Network For Semantic Image Segmentation
Recently, deep convolutional neural networks (CNNs) have led to significant improvement over semantic image segmentation and have also been the best choice. In this paper, we propose a deep neural network architecture, Multi-Path Fusion Network (MPFNet), for semantic image segmentation. In MPFNet, we add more convolution paths to every convolution layer. The depth of each convolutional path increases linearly, which provides a superior method for pixel level prediction. Using this method, we integrate contextual information and local information to produce good quality results on the semantic segmentation task. In addition, dense skip connections are added to repeatedly leverage previous features. The proposed approach improves strong baselines built upon VGG16 on two urban scene datasets, CamVid and Cityscapes, which demonstrate its effectiveness in modeling context information.