Fishy Action Recognition from Cross-Event Integrated Video Dataset Based on Deep 3D and Mixed CNN models

2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT) Pub Date : 2021-09-15 DOI:10.1109/icecct52121.2021.9616808

Raisa Begum, Md. Sajjatul Islam

{"title":"Fishy Action Recognition from Cross-Event Integrated Video Dataset Based on Deep 3D and Mixed CNN models","authors":"Raisa Begum, Md. Sajjatul Islam","doi":"10.1109/icecct52121.2021.9616808","DOIUrl":null,"url":null,"abstract":"As two crucial aspects- 1) temporal features and 2) motion characteristics are to be scrutinized apart from spatial content in analyzing video actions, therefore applying the two-dimensional CNNs alone entail fundamental problems: They do not consider the temporal ordering and eventually collapse the temporal correlations. In this paper, an automatic deep fishy action recognition system has been developed. We have implemented two spatiotemporal convolutional algorithms (pre-trained on Kinetics-400) that are capable of handling information regarding temporal, spatial, and motion in video sequences via residual learning techniques. Six categories of suspicious videos where there are only fine-grained atomic actions from five different datasets have been considered and a composite dataset has been created. To reduce over-fitting and making the dataset more robust and generalized, synthetic data have been procured through the offline augmentation processes. To the best of our knowledge this is the first attempt to recognize malicious actions across an integrated dataset comprising of five cross-event and heterogeneous datasets of actions. Therefore, we had to combat with multifarious challenges of domain shifting. We have achieved the higher level of accuracy.","PeriodicalId":155129,"journal":{"name":"2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT)","volume":"41 4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icecct52121.2021.9616808","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

As two crucial aspects- 1) temporal features and 2) motion characteristics are to be scrutinized apart from spatial content in analyzing video actions, therefore applying the two-dimensional CNNs alone entail fundamental problems: They do not consider the temporal ordering and eventually collapse the temporal correlations. In this paper, an automatic deep fishy action recognition system has been developed. We have implemented two spatiotemporal convolutional algorithms (pre-trained on Kinetics-400) that are capable of handling information regarding temporal, spatial, and motion in video sequences via residual learning techniques. Six categories of suspicious videos where there are only fine-grained atomic actions from five different datasets have been considered and a composite dataset has been created. To reduce over-fitting and making the dataset more robust and generalized, synthetic data have been procured through the offline augmentation processes. To the best of our knowledge this is the first attempt to recognize malicious actions across an integrated dataset comprising of five cross-event and heterogeneous datasets of actions. Therefore, we had to combat with multifarious challenges of domain shifting. We have achieved the higher level of accuracy.

查看原文本刊更多论文

基于深度3D和混合CNN模型的跨事件集成视频数据集的可疑动作识别

在分析视频动作时，除了空间内容外，还需要仔细研究两个关键方面- 1)时间特征和2)运动特征，因此单独应用二维cnn会带来根本问题:它们不考虑时间顺序，最终会破坏时间相关性。本文开发了一种深海鱼类动作自动识别系统。我们已经实现了两种时空卷积算法(在Kinetics-400上进行了预训练)，它们能够通过残差学习技术处理视频序列中有关时间、空间和运动的信息。考虑了来自五个不同数据集的六类可疑视频，其中只有细粒度的原子动作，并创建了一个复合数据集。为了减少过度拟合，提高数据集的鲁棒性和泛化性，通过离线增强过程获取合成数据。据我们所知，这是第一次尝试识别跨集成数据集的恶意行为，该数据集由五个跨事件和异构的行为数据集组成。因此，我们必须与领域转移的各种挑战作斗争。我们已经达到了更高的精确度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT)

自引率

0.00%

发文量