Comparing retrieval effectiveness of alternative content segmentation methods for Internet video search

2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI) Pub Date : 2012-06-27 DOI:10.1109/CBMI.2012.6269810

Maria Eskevich, G. Jones, Christian Wartena, M. Larson, Robin Aly, T. Verschoor, R. Ordelman

{"title":"Comparing retrieval effectiveness of alternative content segmentation methods for Internet video search","authors":"Maria Eskevich, G. Jones, Christian Wartena, M. Larson, Robin Aly, T. Verschoor, R. Ordelman","doi":"10.1109/CBMI.2012.6269810","DOIUrl":null,"url":null,"abstract":"We present an exploratory study of the retrieval of semiprofessional user-generated Internet video. The study is based on the MediaEval 2011 Rich Speech Retrieval (RSR) task for which the dataset was taken from the Internet sharing platform blip.tv, and search queries associated with specific speech acts occurring in the video. We compare results from three participant groups using: automatic speech recognition system transcript (ASR), metadata manually assigned to each video by the user who uploaded it, and their combination. RSR 2011 was a known-item search for a single manually identified ideal jump-in point in the video for each query where playback should begin. Retrieval effectiveness is measured using the MRR and mGAP metrics. Using different transcript segmentation methods the participants tried to maximize the rank of the relevant item and to locate the nearest match to the ideal jump-in point. Results indicate that best overall results are obtained for topically homogeneous segments which have a strong overlap with the relevant region associated with the jump-in point, and that use of metadata can be beneficial when segments are unfocused or cover more than one topic.","PeriodicalId":120769,"journal":{"name":"2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMI.2012.6269810","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

Abstract

We present an exploratory study of the retrieval of semiprofessional user-generated Internet video. The study is based on the MediaEval 2011 Rich Speech Retrieval (RSR) task for which the dataset was taken from the Internet sharing platform blip.tv, and search queries associated with specific speech acts occurring in the video. We compare results from three participant groups using: automatic speech recognition system transcript (ASR), metadata manually assigned to each video by the user who uploaded it, and their combination. RSR 2011 was a known-item search for a single manually identified ideal jump-in point in the video for each query where playback should begin. Retrieval effectiveness is measured using the MRR and mGAP metrics. Using different transcript segmentation methods the participants tried to maximize the rank of the relevant item and to locate the nearest match to the ideal jump-in point. Results indicate that best overall results are obtained for topically homogeneous segments which have a strong overlap with the relevant region associated with the jump-in point, and that use of metadata can be beneficial when segments are unfocused or cover more than one topic.

查看原文本刊更多论文

网络视频搜索中不同内容分割方法的检索效果比较

我们提出了半专业用户生成的互联网视频检索的探索性研究。该研究基于MediaEval 2011富语音检索(RSR)任务，该任务的数据集取自互联网共享平台blip。电视，以及与视频中出现的特定言语行为相关的搜索查询。我们使用自动语音识别系统记录(ASR)、上传视频的用户手动分配给每个视频的元数据以及它们的组合来比较三个参与者组的结果。RSR 2011是一个已知项搜索，为每个查询在视频中应该开始播放的地方手动确定一个理想的跳入点。使用MRR和mGAP度量度量检索效率。使用不同的转录片段分割方法，参与者试图最大化相关项目的排名，并找到最接近理想跳入点的匹配。结果表明，对于与跳跃点相关区域有很强重叠的主题同质片段，可以获得最佳的总体结果，并且当片段不集中或覆盖多个主题时，元数据的使用可能是有益的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI)

自引率

0.00%

发文量