{"title":"少镜头分割的图关联网络","authors":"Xiaoliu Luo, Taiping Zhang","doi":"10.1109/ICIP42928.2021.9506452","DOIUrl":null,"url":null,"abstract":"Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with a few annotations. Previous methods mainly establish the correspondence between support images and query images with global information. However, human perception does not tend to learn a whole representation in its entirety at once. In this paper, we propose a novel network to build the correspondence from subparts, parts and whole. Our network mainly contain two novel designs: we firstly adopt graph convolutional network to make pixels not only contain the information of each pixel itself but also include its contextual pixels, and then a learnable Graph Affinity Module(GAM) is proposed to mine more accurate relationships as well as common object location inference between the support images and the query images. Experiments on the PASCAL-5i dataset show that our method achieves state-of-the-art performance.","PeriodicalId":314429,"journal":{"name":"2021 IEEE International Conference on Image Processing (ICIP)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Graph Affinity Network for Few-Shot Segmentation\",\"authors\":\"Xiaoliu Luo, Taiping Zhang\",\"doi\":\"10.1109/ICIP42928.2021.9506452\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with a few annotations. Previous methods mainly establish the correspondence between support images and query images with global information. However, human perception does not tend to learn a whole representation in its entirety at once. In this paper, we propose a novel network to build the correspondence from subparts, parts and whole. Our network mainly contain two novel designs: we firstly adopt graph convolutional network to make pixels not only contain the information of each pixel itself but also include its contextual pixels, and then a learnable Graph Affinity Module(GAM) is proposed to mine more accurate relationships as well as common object location inference between the support images and the query images. Experiments on the PASCAL-5i dataset show that our method achieves state-of-the-art performance.\",\"PeriodicalId\":314429,\"journal\":{\"name\":\"2021 IEEE International Conference on Image Processing (ICIP)\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Image Processing (ICIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIP42928.2021.9506452\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Image Processing (ICIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIP42928.2021.9506452","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
few -shot segmentation的目的是学习一种可以用少量注释推广到新类的分割模型。以前的方法主要是用全局信息建立支持图像和查询图像之间的对应关系。然而,人类的感知并不倾向于一次性完整地学习整个表象。在本文中,我们提出了一种新的网络来建立子部分、部分和整体之间的对应关系。我们的网络主要有两种新颖的设计:首先采用图卷积网络,使像素不仅包含每个像素本身的信息,还包含其上下文像素,然后提出一个可学习的图关联模块(GAM),在支持图像和查询图像之间挖掘更精确的关系以及共同的目标位置推断。在PASCAL-5i数据集上的实验表明,我们的方法达到了最先进的性能。
Few-shot segmentation aims to learn a segmentation model that can be generalized to novel classes with a few annotations. Previous methods mainly establish the correspondence between support images and query images with global information. However, human perception does not tend to learn a whole representation in its entirety at once. In this paper, we propose a novel network to build the correspondence from subparts, parts and whole. Our network mainly contain two novel designs: we firstly adopt graph convolutional network to make pixels not only contain the information of each pixel itself but also include its contextual pixels, and then a learnable Graph Affinity Module(GAM) is proposed to mine more accurate relationships as well as common object location inference between the support images and the query images. Experiments on the PASCAL-5i dataset show that our method achieves state-of-the-art performance.