Aligning Visual and Lexical Semantics

Diversity, divergence, dialogue : 16th international conference, iConference 2021, Beijing, China, March 17-31, 2021 : proceedings. iConference (Conference) (16th : 2021 : Online) Pub Date : 2022-12-13 DOI:10.48550/arXiv.2212.06629

Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao

引用次数: 3

Abstract

We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics focus on how humans build concepts of the same target reality through the use of language. The lack of coincidence between visual and lexical semantics, in turn, has a major impact on CV systems in the form of the Semantic Gap Problem (SGP). The paper, while extensively exemplifying the lack of coincidence as above, introduces a general, domain-agnostic methodology to enforce alignment between visual and lexical semantics.

查看原文本刊更多论文

对齐视觉和词汇语义

我们讨论了与计算机视觉(CV)系统相关的两种语义——视觉语义和词汇语义。视觉语义学关注的是人类在使用视觉感知目标现实时如何构建概念，而词汇语义学关注的是人类如何通过使用语言构建相同目标现实的概念。视觉语义和词汇语义之间缺乏一致性，反过来又以语义缺口问题(SGP)的形式对CV系统产生重大影响。本文虽然广泛地举例说明了上述巧合的缺乏，但引入了一种通用的、领域不可知论的方法来强制视觉语义和词汇语义之间的对齐。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Diversity, divergence, dialogue : 16th international conference, iConference 2021, Beijing, China, March 17-31, 2021 : proceedings. iConference (Conference) (16th : 2021 : Online)

自引率

0.00%

发文量