Anan Liu, Jintao Li, Yongdong Zhang, Sheng Tang, Zhaoxuan Yang
{"title":"rush开发中的多模态访谈概念检测","authors":"Anan Liu, Jintao Li, Yongdong Zhang, Sheng Tang, Zhaoxuan Yang","doi":"10.5555/1931390.1931407","DOIUrl":null,"url":null,"abstract":"According to the concepts of Large-Scale Concept Ontology for Multimedia (LSCOM) and requirement of the 4th task in the 2006 TRECVID, i.e., rushes exploitation, the \"interview\" concept is an important semantic concept for rushes content analysis. The paper presents the shot-level \"interview\" concept detection method. Face detection and audio classification are implemented to detect \"face\" and \"speech\" concepts for each shot. By integrating audiovisual information, \"interview\" concept is finally detected. The utilization of the method will definitely benefit the video edit. Large-scale experimental results strongly demonstrate the accuracy and effectiveness of the proposed method.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"198 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-modal Interview Concept Detection for Rushes Exploitation\",\"authors\":\"Anan Liu, Jintao Li, Yongdong Zhang, Sheng Tang, Zhaoxuan Yang\",\"doi\":\"10.5555/1931390.1931407\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"According to the concepts of Large-Scale Concept Ontology for Multimedia (LSCOM) and requirement of the 4th task in the 2006 TRECVID, i.e., rushes exploitation, the \\\"interview\\\" concept is an important semantic concept for rushes content analysis. The paper presents the shot-level \\\"interview\\\" concept detection method. Face detection and audio classification are implemented to detect \\\"face\\\" and \\\"speech\\\" concepts for each shot. By integrating audiovisual information, \\\"interview\\\" concept is finally detected. The utilization of the method will definitely benefit the video edit. Large-scale experimental results strongly demonstrate the accuracy and effectiveness of the proposed method.\",\"PeriodicalId\":120472,\"journal\":{\"name\":\"RIAO Conference\",\"volume\":\"198 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-05-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"RIAO Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5555/1931390.1931407\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"RIAO Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5555/1931390.1931407","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
根据多媒体大规模概念本体(Large-Scale Concept Ontology for Multimedia, LSCOM)的概念和2006 TRECVID中第4个任务(rush exploitation)的要求,“采访”概念是rush内容分析的一个重要语义概念。本文提出了镜头级“采访”概念检测方法。实现人脸检测和音频分类,检测每个镜头的“人脸”和“语音”概念。通过整合视听信息,最终检测出“采访”概念。该方法的应用将为视频编辑带来一定的好处。大规模实验结果有力地证明了该方法的准确性和有效性。
Multi-modal Interview Concept Detection for Rushes Exploitation
According to the concepts of Large-Scale Concept Ontology for Multimedia (LSCOM) and requirement of the 4th task in the 2006 TRECVID, i.e., rushes exploitation, the "interview" concept is an important semantic concept for rushes content analysis. The paper presents the shot-level "interview" concept detection method. Face detection and audio classification are implemented to detect "face" and "speech" concepts for each shot. By integrating audiovisual information, "interview" concept is finally detected. The utilization of the method will definitely benefit the video edit. Large-scale experimental results strongly demonstrate the accuracy and effectiveness of the proposed method.