Multi-instance Referring Image Segmentation of Scene Sketches based on Global Reference Mechanism

Proceedings. Pacific Conference on Computer Graphics and Applications Pub Date : 2022-01-01 DOI:10.2312/pg.20221238

Pengyang Ling, Haoran Mo, Chengying Gao

{"title":"Multi-instance Referring Image Segmentation of Scene Sketches based on Global Reference Mechanism","authors":"Pengyang Ling, Haoran Mo, Chengying Gao","doi":"10.2312/pg.20221238","DOIUrl":null,"url":null,"abstract":"Scene sketch segmentation based on referring expression plays an important role in sketch editing of anime industry. While most existing referring image segmentation approaches are designed for the standard task of generating a binary segmentation mask for a single or a group of target(s), we think it necessary to equip these models with the ability of multi-instance segmentation. To this end, we propose GRM-Net, a one-stage framework tailored for multi-instance referring image segmentation of scene sketches. We extract the language features from the expression and fuse it into a conventional instance segmentation pipeline for filtering out the undesired instances in a coarse-to-fine manner and keeping the matched ones. To model the relative arrangement of the objects and the relationship among them from a global view, we propose a global reference mechanism (GRM) to assign references to each detected candidate to identify its position. We compare with existing methods designed for multi-instance referring image segmentation of scene sketches and for the standard task of referring image segmentation, and the results demonstrate the effectiveness and superiority of our approach.","PeriodicalId":88304,"journal":{"name":"Proceedings. Pacific Conference on Computer Graphics and Applications","volume":"1 1","pages":"7-12"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. Pacific Conference on Computer Graphics and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2312/pg.20221238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Scene sketch segmentation based on referring expression plays an important role in sketch editing of anime industry. While most existing referring image segmentation approaches are designed for the standard task of generating a binary segmentation mask for a single or a group of target(s), we think it necessary to equip these models with the ability of multi-instance segmentation. To this end, we propose GRM-Net, a one-stage framework tailored for multi-instance referring image segmentation of scene sketches. We extract the language features from the expression and fuse it into a conventional instance segmentation pipeline for filtering out the undesired instances in a coarse-to-fine manner and keeping the matched ones. To model the relative arrangement of the objects and the relationship among them from a global view, we propose a global reference mechanism (GRM) to assign references to each detected candidate to identify its position. We compare with existing methods designed for multi-instance referring image segmentation of scene sketches and for the standard task of referring image segmentation, and the results demonstrate the effectiveness and superiority of our approach.

查看原文本刊更多论文

基于全局参考机制的场景草图多实例参考图像分割

基于参考表达的场景小品分割在动漫小品编辑中占有重要地位。虽然大多数现有的参考图像分割方法都是为单个或一组目标生成二值分割掩码的标准任务而设计的，但我们认为有必要为这些模型配备多实例分割的能力。为此，我们提出了GRM-Net，这是一个为场景草图的多实例参考图像分割量身定制的单阶段框架。我们从表达式中提取语言特征，并将其融合到传统的实例分割管道中，以粗到细的方式过滤掉不需要的实例，并保留匹配的实例。为了从全局角度对目标的相对排列和相互之间的关系进行建模，我们提出了一种全局引用机制(GRM)，为每个检测到的候选对象分配引用以确定其位置。通过与已有的场景草图多实例参考图像分割方法和标准参考图像分割方法的比较，验证了本文方法的有效性和优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings. Pacific Conference on Computer Graphics and Applications

自引率

0.00%

发文量