{"title":"Attentive Edgemap Fusion for Sketch-Based Image Retrieval","authors":"Yuanchen Guo, Yun Cai, Songhai Zhang","doi":"10.3724/sp.j.1089.2021.18589","DOIUrl":null,"url":null,"abstract":"Sketch-based image retrieval (SBIR) aims to return a collection of corresponding images based on an input sketch. Different from traditional content-based image retrieval, unique difficulties exist due to the large domain gap between sketches and natural images. Based on the similarity between edgemaps and sketches, a novel SBIR model named spatial attentive edgemap fusion is presented which combines both image and edgemap features. Images and the corresponding edgemaps are first encoded to their own latent feature space, and then fused by a learned spatial attention map. Experiment results on two widely-used SBIR datasets, Sketchy and Flickr15K, show the promising performance of the proposed model.","PeriodicalId":52442,"journal":{"name":"计算机辅助设计与图形学学报","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"计算机辅助设计与图形学学报","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.3724/sp.j.1089.2021.18589","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0
Abstract
Sketch-based image retrieval (SBIR) aims to return a collection of corresponding images based on an input sketch. Different from traditional content-based image retrieval, unique difficulties exist due to the large domain gap between sketches and natural images. Based on the similarity between edgemaps and sketches, a novel SBIR model named spatial attentive edgemap fusion is presented which combines both image and edgemap features. Images and the corresponding edgemaps are first encoded to their own latent feature space, and then fused by a learned spatial attention map. Experiment results on two widely-used SBIR datasets, Sketchy and Flickr15K, show the promising performance of the proposed model.