Automatic Story Segmentation for TV News Video Using Multiple Modalities

Emilie Dumont, G. Quénot
{"title":"Automatic Story Segmentation for TV News Video Using Multiple Modalities","authors":"Emilie Dumont, G. Quénot","doi":"10.1155/2012/732514","DOIUrl":null,"url":null,"abstract":"While video content is often stored in rather large files or broadcasted in continuous streams, users are often interested in retrieving only a particular passage on a topic of interest to them. It is, therefore, necessary to split video documents or streams into shorter segments corresponding to appropriate retrieval units. We propose here a method for the automatic segmentation of TV news videos into stories. A-multiple-descriptor based segmentation approach is proposed. The selected multimodal features are complementary and give good insights about story boundaries. Once extracted, these features are expanded with a local temporal context and combined by an early fusion process. The story boundaries are then predicted using machine learning techniques. We investigate the system by experiments conducted using TRECVID 2003 data and protocol of the story boundary detection task, and we show that the proposed approach outperforms the state-of-the-art methods while requiring a very small amount of manual annotation.","PeriodicalId":204253,"journal":{"name":"Int. J. Digit. Multim. Broadcast.","volume":"54 9","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Digit. Multim. Broadcast.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2012/732514","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 31

Abstract

While video content is often stored in rather large files or broadcasted in continuous streams, users are often interested in retrieving only a particular passage on a topic of interest to them. It is, therefore, necessary to split video documents or streams into shorter segments corresponding to appropriate retrieval units. We propose here a method for the automatic segmentation of TV news videos into stories. A-multiple-descriptor based segmentation approach is proposed. The selected multimodal features are complementary and give good insights about story boundaries. Once extracted, these features are expanded with a local temporal context and combined by an early fusion process. The story boundaries are then predicted using machine learning techniques. We investigate the system by experiments conducted using TRECVID 2003 data and protocol of the story boundary detection task, and we show that the proposed approach outperforms the state-of-the-art methods while requiring a very small amount of manual annotation.
基于多模态的电视新闻视频故事自动分割
虽然视频内容通常存储在相当大的文件中或以连续流的形式播放,但用户通常只对检索他们感兴趣的主题的特定段落感兴趣。因此,有必要将视像文件或视像流分成与适当检索单元相对应的较短片段。本文提出了一种将电视新闻视频自动分割成故事的方法。提出了基于a -多描述符的分割方法。所选择的多模式特征是互补的,可以很好地了解故事边界。一旦提取出来,这些特征就会被扩展到局部时间背景中,并通过早期融合过程进行组合。然后使用机器学习技术预测故事边界。我们使用TRECVID 2003数据和故事边界检测任务协议对该系统进行了实验研究,结果表明,该方法在需要非常少的手工注释的同时,优于当前最先进的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信