Complex Event Detection via Multi-source Video Attributes

2013 IEEE Conference on Computer Vision and Pattern Recognition Pub Date : 2013-06-23 DOI:10.1109/CVPR.2013.339

Zhigang Ma, Yi Yang, Zhongwen Xu, Shuicheng Yan, N. Sebe, Alexander Hauptmann

{"title":"Complex Event Detection via Multi-source Video Attributes","authors":"Zhigang Ma, Yi Yang, Zhongwen Xu, Shuicheng Yan, N. Sebe, Alexander Hauptmann","doi":"10.1109/CVPR.2013.339","DOIUrl":null,"url":null,"abstract":"Complex events essentially include human, scenes, objects and actions that can be summarized by visual attributes, so leveraging relevant attributes properly could be helpful for event detection. Many works have exploited attributes at image level for various applications. However, attributes at image level are possibly insufficient for complex event detection in videos due to their limited capability in characterizing the dynamic properties of video data. Hence, we propose to leverage attributes at video level (named as video attributes in this work), i.e., the semantic labels of external videos are used as attributes. Compared to complex event videos, these external videos contain simple contents such as objects, scenes and actions which are the basic elements of complex events. Specifically, building upon a correlation vector which correlates the attributes and the complex event, we incorporate video attributes latently as extra informative cues into the event detector learnt from complex event videos. Extensive experiments on a real-world large-scale dataset validate the efficacy of the proposed approach.","PeriodicalId":6343,"journal":{"name":"2013 IEEE Conference on Computer Vision and Pattern Recognition","volume":"27 1","pages":"2627-2633"},"PeriodicalIF":0.0000,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"73","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE Conference on Computer Vision and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPR.2013.339","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 73

Abstract

Complex events essentially include human, scenes, objects and actions that can be summarized by visual attributes, so leveraging relevant attributes properly could be helpful for event detection. Many works have exploited attributes at image level for various applications. However, attributes at image level are possibly insufficient for complex event detection in videos due to their limited capability in characterizing the dynamic properties of video data. Hence, we propose to leverage attributes at video level (named as video attributes in this work), i.e., the semantic labels of external videos are used as attributes. Compared to complex event videos, these external videos contain simple contents such as objects, scenes and actions which are the basic elements of complex events. Specifically, building upon a correlation vector which correlates the attributes and the complex event, we incorporate video attributes latently as extra informative cues into the event detector learnt from complex event videos. Extensive experiments on a real-world large-scale dataset validate the efficacy of the proposed approach.

查看原文本刊更多论文

基于多源视频属性的复杂事件检测

复杂事件本质上包括人、场景、对象和动作，这些都可以通过视觉属性进行总结，因此适当地利用相关属性可能有助于事件检测。许多作品都利用了图像级别的属性用于各种应用。然而，由于图像级属性在表征视频数据动态特性方面的能力有限，它们可能不足以用于视频中的复杂事件检测。因此，我们建议在视频级别利用属性(在本工作中称为视频属性)，即使用外部视频的语义标签作为属性。与复杂事件视频相比，这些外部视频包含简单的内容，如对象、场景和动作，这些是复杂事件的基本元素。具体来说，我们在关联属性和复杂事件的相关向量的基础上，将视频属性作为额外的信息线索潜在地整合到从复杂事件视频中学习的事件检测器中。在真实世界的大规模数据集上进行的大量实验验证了所提出方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE Conference on Computer Vision and Pattern Recognition

自引率

0.00%

发文量