Multi-modal Interview Concept Detection for Rushes Exploitation

RIAO Conference Pub Date : 2007-05-30 DOI:10.5555/1931390.1931407

Anan Liu, Jintao Li, Yongdong Zhang, Sheng Tang, Zhaoxuan Yang

引用次数: 0

Abstract

According to the concepts of Large-Scale Concept Ontology for Multimedia (LSCOM) and requirement of the 4th task in the 2006 TRECVID, i.e., rushes exploitation, the "interview" concept is an important semantic concept for rushes content analysis. The paper presents the shot-level "interview" concept detection method. Face detection and audio classification are implemented to detect "face" and "speech" concepts for each shot. By integrating audiovisual information, "interview" concept is finally detected. The utilization of the method will definitely benefit the video edit. Large-scale experimental results strongly demonstrate the accuracy and effectiveness of the proposed method.

查看原文本刊更多论文

rush开发中的多模态访谈概念检测

根据多媒体大规模概念本体(Large-Scale Concept Ontology for Multimedia, LSCOM)的概念和2006 TRECVID中第4个任务(rush exploitation)的要求，“采访”概念是rush内容分析的一个重要语义概念。本文提出了镜头级“采访”概念检测方法。实现人脸检测和音频分类，检测每个镜头的“人脸”和“语音”概念。通过整合视听信息，最终检测出“采访”概念。该方法的应用将为视频编辑带来一定的好处。大规模实验结果有力地证明了该方法的准确性和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

RIAO Conference

自引率

0.00%

发文量