Multi-modal Interview Concept Detection for Rushes Exploitation

Anan Liu, Jintao Li, Yongdong Zhang, Sheng Tang, Zhaoxuan Yang
{"title":"Multi-modal Interview Concept Detection for Rushes Exploitation","authors":"Anan Liu, Jintao Li, Yongdong Zhang, Sheng Tang, Zhaoxuan Yang","doi":"10.5555/1931390.1931407","DOIUrl":null,"url":null,"abstract":"According to the concepts of Large-Scale Concept Ontology for Multimedia (LSCOM) and requirement of the 4th task in the 2006 TRECVID, i.e., rushes exploitation, the \"interview\" concept is an important semantic concept for rushes content analysis. The paper presents the shot-level \"interview\" concept detection method. Face detection and audio classification are implemented to detect \"face\" and \"speech\" concepts for each shot. By integrating audiovisual information, \"interview\" concept is finally detected. The utilization of the method will definitely benefit the video edit. Large-scale experimental results strongly demonstrate the accuracy and effectiveness of the proposed method.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"198 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"RIAO Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5555/1931390.1931407","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

According to the concepts of Large-Scale Concept Ontology for Multimedia (LSCOM) and requirement of the 4th task in the 2006 TRECVID, i.e., rushes exploitation, the "interview" concept is an important semantic concept for rushes content analysis. The paper presents the shot-level "interview" concept detection method. Face detection and audio classification are implemented to detect "face" and "speech" concepts for each shot. By integrating audiovisual information, "interview" concept is finally detected. The utilization of the method will definitely benefit the video edit. Large-scale experimental results strongly demonstrate the accuracy and effectiveness of the proposed method.
rush开发中的多模态访谈概念检测
根据多媒体大规模概念本体(Large-Scale Concept Ontology for Multimedia, LSCOM)的概念和2006 TRECVID中第4个任务(rush exploitation)的要求,“采访”概念是rush内容分析的一个重要语义概念。本文提出了镜头级“采访”概念检测方法。实现人脸检测和音频分类,检测每个镜头的“人脸”和“语音”概念。通过整合视听信息,最终检测出“采访”概念。该方法的应用将为视频编辑带来一定的好处。大规模实验结果有力地证明了该方法的准确性和有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信