图像中文化事件识别的混合融合

Shivansh Srivastava, Bappaditya Mandal, Anirban Chakraborty
{"title":"图像中文化事件识别的混合融合","authors":"Shivansh Srivastava, Bappaditya Mandal, Anirban Chakraborty","doi":"10.1049/icp.2021.1455","DOIUrl":null,"url":null,"abstract":"Understanding high level semantic concepts in images requires information from various modalities of visual concepts. One such task is recognition of events based on still images, which requires simultaneous reasoning about high level semantic concepts like objects, people, scenes and their interactions. In this work, we explore different strategies to fuse object and scene information in images to aid the task of cultural event recognition. We start with early and late fusion strategies to combine object and scene level information to reason about event classes. To support our hypothesis that early fused models are able to extract complementary object and scene information, we propose the use of guided backpropagation method to visualize image activations. Inspection of image activations gives an essence of object-scene complementarity in case of early fusion which is not observed in the case of late fused models. As extensions to early and late fusion techniques, we propose HFCER, a hybrid fusion framework along with an alternating training scheme. The proposed technique shows improvement over its late and early fusion counterparts. Late fusion of different fusion techniques namely late, early and hybrid fusion shows state of the art results on Chalearn LAP cultural event recognition dataset.","PeriodicalId":431144,"journal":{"name":"11th International Conference of Pattern Recognition Systems (ICPRS 2021)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"HFCER : Hybrid Fusion for Cultural Event Recognition in Images\",\"authors\":\"Shivansh Srivastava, Bappaditya Mandal, Anirban Chakraborty\",\"doi\":\"10.1049/icp.2021.1455\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Understanding high level semantic concepts in images requires information from various modalities of visual concepts. One such task is recognition of events based on still images, which requires simultaneous reasoning about high level semantic concepts like objects, people, scenes and their interactions. In this work, we explore different strategies to fuse object and scene information in images to aid the task of cultural event recognition. We start with early and late fusion strategies to combine object and scene level information to reason about event classes. To support our hypothesis that early fused models are able to extract complementary object and scene information, we propose the use of guided backpropagation method to visualize image activations. Inspection of image activations gives an essence of object-scene complementarity in case of early fusion which is not observed in the case of late fused models. As extensions to early and late fusion techniques, we propose HFCER, a hybrid fusion framework along with an alternating training scheme. The proposed technique shows improvement over its late and early fusion counterparts. Late fusion of different fusion techniques namely late, early and hybrid fusion shows state of the art results on Chalearn LAP cultural event recognition dataset.\",\"PeriodicalId\":431144,\"journal\":{\"name\":\"11th International Conference of Pattern Recognition Systems (ICPRS 2021)\",\"volume\":\"57 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"11th International Conference of Pattern Recognition Systems (ICPRS 2021)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1049/icp.2021.1455\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"11th International Conference of Pattern Recognition Systems (ICPRS 2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1049/icp.2021.1455","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

理解图像中的高级语义概念需要来自视觉概念的各种形态的信息。其中一项任务是基于静止图像的事件识别,这需要同时对物体、人物、场景及其相互作用等高级语义概念进行推理。在这项工作中,我们探索了不同的策略来融合图像中的物体和场景信息,以帮助完成文化事件识别的任务。我们从早期和晚期融合策略开始,结合对象和场景级别信息来推断事件类别。为了支持我们的假设,即早期融合模型能够提取互补的目标和场景信息,我们提出使用引导反向传播方法来可视化图像激活。图像激活检测在早期融合的情况下提供了物体-场景互补性的本质,而在后期融合模型的情况下则没有观察到。作为早期和晚期融合技术的扩展,我们提出了HFCER,一种混合融合框架以及交替训练方案。与早期和晚期的融合技术相比,所提出的技术有了改进。不同融合技术的后期融合,即晚期融合、早期融合和混合融合,显示了Chalearn LAP文化事件识别数据集上的最新结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
HFCER : Hybrid Fusion for Cultural Event Recognition in Images
Understanding high level semantic concepts in images requires information from various modalities of visual concepts. One such task is recognition of events based on still images, which requires simultaneous reasoning about high level semantic concepts like objects, people, scenes and their interactions. In this work, we explore different strategies to fuse object and scene information in images to aid the task of cultural event recognition. We start with early and late fusion strategies to combine object and scene level information to reason about event classes. To support our hypothesis that early fused models are able to extract complementary object and scene information, we propose the use of guided backpropagation method to visualize image activations. Inspection of image activations gives an essence of object-scene complementarity in case of early fusion which is not observed in the case of late fused models. As extensions to early and late fusion techniques, we propose HFCER, a hybrid fusion framework along with an alternating training scheme. The proposed technique shows improvement over its late and early fusion counterparts. Late fusion of different fusion techniques namely late, early and hybrid fusion shows state of the art results on Chalearn LAP cultural event recognition dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信