{"title":"扩大卷积神经网络的有效接受野以实现更好的语义分割","authors":"Yifan Gu, Zuofeng Zhong, Shuai Wu, Yong Xu","doi":"10.1109/ACPR.2017.7","DOIUrl":null,"url":null,"abstract":"Recently, convolutional neural networks have shown powerful capability in different fields of computer vision, and have become the most effective means for dense prediction problems such as semantic segmentation. However, methods based on fully convolution network(FCN) are inherently limited to the size of the receptive field for each pixel, which leads to the bad performance of predicting object boundary. In this paper, we propose a novel deep neural network module, namely group dilated convolution(GDC), to effectively enlarge the receptive field, and a top-to-down pathway network is exploited simultaneously. The idea is that dilation convolution with different ratios can cover features of different scales, which shows a significant Mean IOU improvement in comparison with the baseline network.","PeriodicalId":426561,"journal":{"name":"2017 4th IAPR Asian Conference on Pattern Recognition (ACPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Enlarging Effective Receptive Field of Convolutional Neural Networks for Better Semantic Segmentation\",\"authors\":\"Yifan Gu, Zuofeng Zhong, Shuai Wu, Yong Xu\",\"doi\":\"10.1109/ACPR.2017.7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, convolutional neural networks have shown powerful capability in different fields of computer vision, and have become the most effective means for dense prediction problems such as semantic segmentation. However, methods based on fully convolution network(FCN) are inherently limited to the size of the receptive field for each pixel, which leads to the bad performance of predicting object boundary. In this paper, we propose a novel deep neural network module, namely group dilated convolution(GDC), to effectively enlarge the receptive field, and a top-to-down pathway network is exploited simultaneously. The idea is that dilation convolution with different ratios can cover features of different scales, which shows a significant Mean IOU improvement in comparison with the baseline network.\",\"PeriodicalId\":426561,\"journal\":{\"name\":\"2017 4th IAPR Asian Conference on Pattern Recognition (ACPR)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 4th IAPR Asian Conference on Pattern Recognition (ACPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACPR.2017.7\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 4th IAPR Asian Conference on Pattern Recognition (ACPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACPR.2017.7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Enlarging Effective Receptive Field of Convolutional Neural Networks for Better Semantic Segmentation
Recently, convolutional neural networks have shown powerful capability in different fields of computer vision, and have become the most effective means for dense prediction problems such as semantic segmentation. However, methods based on fully convolution network(FCN) are inherently limited to the size of the receptive field for each pixel, which leads to the bad performance of predicting object boundary. In this paper, we propose a novel deep neural network module, namely group dilated convolution(GDC), to effectively enlarge the receptive field, and a top-to-down pathway network is exploited simultaneously. The idea is that dilation convolution with different ratios can cover features of different scales, which shows a significant Mean IOU improvement in comparison with the baseline network.