“Caption” as a Coherence Relation: Evidence and Implications

Malihe Alikhani, Matthew Stone
{"title":"“Caption” as a Coherence Relation: Evidence and Implications","authors":"Malihe Alikhani, Matthew Stone","doi":"10.18653/v1/W19-1806","DOIUrl":null,"url":null,"abstract":"We study verbs in image–text corpora, contrasting caption corpora, where texts are explicitly written to characterize image content, with depiction corpora, where texts and images may stand in more general relations. Captions show a distinctively limited distribution of verbs, with strong preferences for specific tense, aspect, lexical aspect, and semantic field. These limitations, which appear in data elicited by a range of methods, restrict the utility of caption corpora to inform image retrieval, multimodal document generation, and perceptually-grounded semantic models. We suggest that these limitations reflect the discourse constraints in play when subjects write texts to accompany imagery, so we argue that future development of image–text corpora should work to increase the diversity of event descriptions, while looking explicitly at the different ways text and imagery can be coherently related.","PeriodicalId":254607,"journal":{"name":"Proceedings of the Second Workshop on Shortcomings in Vision and Language","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Second Workshop on Shortcomings in Vision and Language","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/v1/W19-1806","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23

Abstract

We study verbs in image–text corpora, contrasting caption corpora, where texts are explicitly written to characterize image content, with depiction corpora, where texts and images may stand in more general relations. Captions show a distinctively limited distribution of verbs, with strong preferences for specific tense, aspect, lexical aspect, and semantic field. These limitations, which appear in data elicited by a range of methods, restrict the utility of caption corpora to inform image retrieval, multimodal document generation, and perceptually-grounded semantic models. We suggest that these limitations reflect the discourse constraints in play when subjects write texts to accompany imagery, so we argue that future development of image–text corpora should work to increase the diversity of event descriptions, while looking explicitly at the different ways text and imagery can be coherently related.
“标题”作为一种连贯关系:证据与启示
我们研究了图像-文本语料库中的动词,对比了标题语料库和描述语料库,标题语料库中文本被明确地写出来以表征图像内容,而描述语料库中文本和图像可能处于更一般的关系中。标题显示出明显有限的动词分布,对特定的时态、方面、词汇方面和语义领域有强烈的偏好。这些限制出现在一系列方法得出的数据中,限制了标题语料库在为图像检索、多模态文档生成和基于感知的语义模型提供信息方面的效用。我们认为,这些限制反映了当受试者写文本伴随图像时的话语约束,因此我们认为,图像-文本语料库的未来发展应该努力增加事件描述的多样性,同时明确地关注文本和图像连贯相关的不同方式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信