{"title":"语义分割的超分辨率","authors":"Xuan‐Zhi Zhang, Guoping Xu, Wentao Liao, Xing Wu","doi":"10.1117/12.2643026","DOIUrl":null,"url":null,"abstract":"Image segmentation is a classical problem in the field of computer vision. With the extensive development of deep learning, it has achieved much progress in semantic segmentation. However, the mainstream networks used in deep learning such as Fast-SCNN, U-Net, which still face challenges in image segmentation. A common problem is that linear interpolation is used in the up-sampling stage of these networks to obtain high-resolution images. Due to the lack of sufficient feature information, the contours of the objects in the image are blurred and grided. For this reason, we propose a new super-resolution (SR) method to replace the up-sampling with linear interpolation in the network model. Five representative networks integrated with our proposed SR module are used for verification on the CamVid data set. The experimental results show that our method has a 2%~4% improvement in mIoU (the mean value of Intersection over Union) and a 2%~3% improvement in pixel accuracy, which demonstrates its generalization and effectiveness of our method.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Super-resolution for semantic segmentation\",\"authors\":\"Xuan‐Zhi Zhang, Guoping Xu, Wentao Liao, Xing Wu\",\"doi\":\"10.1117/12.2643026\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Image segmentation is a classical problem in the field of computer vision. With the extensive development of deep learning, it has achieved much progress in semantic segmentation. However, the mainstream networks used in deep learning such as Fast-SCNN, U-Net, which still face challenges in image segmentation. A common problem is that linear interpolation is used in the up-sampling stage of these networks to obtain high-resolution images. Due to the lack of sufficient feature information, the contours of the objects in the image are blurred and grided. For this reason, we propose a new super-resolution (SR) method to replace the up-sampling with linear interpolation in the network model. Five representative networks integrated with our proposed SR module are used for verification on the CamVid data set. The experimental results show that our method has a 2%~4% improvement in mIoU (the mean value of Intersection over Union) and a 2%~3% improvement in pixel accuracy, which demonstrates its generalization and effectiveness of our method.\",\"PeriodicalId\":314555,\"journal\":{\"name\":\"International Conference on Digital Image Processing\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Digital Image Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2643026\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Digital Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2643026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
图像分割是计算机视觉领域的一个经典问题。随着深度学习的广泛发展,它在语义分割方面取得了很大的进展。然而,用于深度学习的主流网络如Fast-SCNN、U-Net在图像分割方面仍然面临挑战。一个常见的问题是在这些网络的上采样阶段使用线性插值来获得高分辨率图像。由于缺乏足够的特征信息,图像中物体的轮廓被模糊和网格化。为此,我们提出了一种新的超分辨率(SR)方法,用线性插值代替网络模型中的上采样。结合我们提出的SR模块的五个代表性网络用于在CamVid数据集上进行验证。实验结果表明,该方法在mIoU (Intersection over Union的均值)和像元精度上分别提高了2%~4%和2%~3%,证明了该方法的泛化和有效性。
Image segmentation is a classical problem in the field of computer vision. With the extensive development of deep learning, it has achieved much progress in semantic segmentation. However, the mainstream networks used in deep learning such as Fast-SCNN, U-Net, which still face challenges in image segmentation. A common problem is that linear interpolation is used in the up-sampling stage of these networks to obtain high-resolution images. Due to the lack of sufficient feature information, the contours of the objects in the image are blurred and grided. For this reason, we propose a new super-resolution (SR) method to replace the up-sampling with linear interpolation in the network model. Five representative networks integrated with our proposed SR module are used for verification on the CamVid data set. The experimental results show that our method has a 2%~4% improvement in mIoU (the mean value of Intersection over Union) and a 2%~3% improvement in pixel accuracy, which demonstrates its generalization and effectiveness of our method.