基于相似镜头图的分数传播改进视觉对象检索

J. M. Barrios, J. M. Saavedra
{"title":"基于相似镜头图的分数传播改进视觉对象检索","authors":"J. M. Barrios, J. M. Saavedra","doi":"10.1145/2802558.2814644","DOIUrl":null,"url":null,"abstract":"The Visual Object Retrieval problem consists in locating the occurrences of a specific entity in an image/video dataset. In this work, we focus on discovering new occurrences of an entity by propagating the detection scores of already computed candidates to other video segments. The score propagation follows the edges of a pre-computed Similarity Shot Graph (SSG). The SSG connects video segments that are similar according to some criterion. Four methods for creating the SSG are presented: two based on computing and comparing low-level visual features, one based on comparing text transcriptions, and other based on computing and comparing high-level concepts. The score propagation is evaluated on the INS 2014 dataset. The results show that the detection performance can be slightly improved by the proposed algorithm. However, the performance is variable and depends on the properties of the SSG and the target entity. It is part of the future work to automatically decide the kind of SSG that will be used to propagate scores given a set of detection candidates.","PeriodicalId":115369,"journal":{"name":"Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia","volume":"162 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Score Propagation Based on Similarity Shot Graph for Improving Visual Object Retrieval\",\"authors\":\"J. M. Barrios, J. M. Saavedra\",\"doi\":\"10.1145/2802558.2814644\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Visual Object Retrieval problem consists in locating the occurrences of a specific entity in an image/video dataset. In this work, we focus on discovering new occurrences of an entity by propagating the detection scores of already computed candidates to other video segments. The score propagation follows the edges of a pre-computed Similarity Shot Graph (SSG). The SSG connects video segments that are similar according to some criterion. Four methods for creating the SSG are presented: two based on computing and comparing low-level visual features, one based on comparing text transcriptions, and other based on computing and comparing high-level concepts. The score propagation is evaluated on the INS 2014 dataset. The results show that the detection performance can be slightly improved by the proposed algorithm. However, the performance is variable and depends on the properties of the SSG and the target entity. It is part of the future work to automatically decide the kind of SSG that will be used to propagate scores given a set of detection candidates.\",\"PeriodicalId\":115369,\"journal\":{\"name\":\"Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia\",\"volume\":\"162 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2802558.2814644\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third Edition Workshop on Speech, Language & Audio in Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2802558.2814644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

视觉对象检索问题包括定位图像/视频数据集中出现的特定实体。在这项工作中,我们专注于通过将已经计算的候选对象的检测分数传播到其他视频片段来发现实体的新出现。分数传播遵循预先计算的相似射击图(SSG)的边缘。SSG根据某种标准将相似的视频片段连接起来。提出了创建SSG的四种方法:两种基于计算和比较低级视觉特征,一种基于比较文本转录,另一种基于计算和比较高级概念。在INS 2014数据集上评估分数传播。结果表明,该算法能略微提高检测性能。然而,性能是可变的,取决于SSG和目标实体的属性。这是未来工作的一部分,自动决定将用于传播给定一组检测候选分数的SSG类型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Score Propagation Based on Similarity Shot Graph for Improving Visual Object Retrieval
The Visual Object Retrieval problem consists in locating the occurrences of a specific entity in an image/video dataset. In this work, we focus on discovering new occurrences of an entity by propagating the detection scores of already computed candidates to other video segments. The score propagation follows the edges of a pre-computed Similarity Shot Graph (SSG). The SSG connects video segments that are similar according to some criterion. Four methods for creating the SSG are presented: two based on computing and comparing low-level visual features, one based on comparing text transcriptions, and other based on computing and comparing high-level concepts. The score propagation is evaluated on the INS 2014 dataset. The results show that the detection performance can be slightly improved by the proposed algorithm. However, the performance is variable and depends on the properties of the SSG and the target entity. It is part of the future work to automatically decide the kind of SSG that will be used to propagate scores given a set of detection candidates.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信