Fashion-Specific Ambiguous Expression Interpretation with Partial Visual-Semantic Embedding

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2023-06-01 DOI:10.1109/CVPRW59228.2023.00353

Ryotaro Shimizu, Takuma Nakamura, M. Goto

引用次数: 0

Abstract

A novel technology named fashion intelligence system has been proposed to quantify ambiguous expressions unique to fashion, such as "casual," "adult-casual," and "office-casual," and to support users’ understanding of fashion. However, the existing visual-semantic embedding (VSE) model, which is the basis of its system, does not support situations in which images are composed of multiple parts such as hair, tops, pants, skirts, and shoes. We propose partial VSE, which enables sensitive learning for each part of the fashion outfits. This enables five types of practical functionalities, particularly image-retrieval tasks in which changes are made only to the specified parts and image-reordering tasks that focus on the specified parts by the single model. Based on both the multiple unique qualitative and quantitative evaluation experiments, we show the effectiveness of the proposed model.

查看原文本刊更多论文

基于部分视觉语义嵌入的时尚特定歧义表达解释

为了量化“休闲”、“成人休闲”、“办公休闲”等时尚特有的歧义表达，并支持用户对时尚的理解，提出了一种名为“时尚智能系统”的新技术。然而，现有的视觉语义嵌入(VSE)模型作为其系统的基础，不支持图像由多个部分组成的情况，如头发、上衣、裤子、裙子和鞋子。我们提出了局部VSE，它可以对时尚服装的每个部分进行敏感学习。这支持五种类型的实际功能，特别是图像检索任务，其中仅对指定部件进行更改，以及通过单个模型关注指定部件的图像重新排序任务。基于多个独特的定性和定量评价实验，我们证明了该模型的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

自引率

0.00%

发文量