BrutNet：使用带有 GRU 的 DCNN 进行暴力检测和分类的新方法

The Journal of Engineering Pub Date : 2024-04-01 DOI:10.1049/tje2.12375

M. Haque, Hussain Nyeem, Syma Afsha

{"title":"BrutNet：使用带有 GRU 的 DCNN 进行暴力检测和分类的新方法","authors":"M. Haque, Hussain Nyeem, Syma Afsha","doi":"10.1049/tje2.12375","DOIUrl":null,"url":null,"abstract":"Automatic Violence Detection and Classification (AVDC) with deep learning has garnered significant attention in computer vision research. This paper presents a novel approach for combining a custom Deep Convolutional Neural Network (DCNN) with a Gated Recurrent Unit (GRU) in developing a new AVDC model called BrutNet. Specifically, a time‐distributed DCNN (TD‐DCNN) is developed to generate a compact 2D representation with 512 spatial features per frame from a set of equally‐spaced frames of dimension 16090 in short video segments. Further to leverage the temporal information, a GRU layer is utilised, generating a condensed 1D vector that enables binary classification of violent or non‐violent content through multiple dense layers. Overfitting is addressed by incorporating dropout layers with a rate of 0.5, while the hidden and output layers employ rectified linear unit (ReLU) and sigmoid activations, respectively. The model is trained on the NVIDIA Tesla K80 GPU through Google Colab, demonstrating superior performance compared to existing models across various video datasets, including hockey fights, movie fights, AVD, and RWF‐2000. Notably, the model stands out by requiring only 3.416 million parameters and achieving impressive test accuracies of 97.62%, 100%, 97.22%, and 86.43% on the respective datasets. Thus, BrutNet exhibits the potential to emerge as a highly efficient and robust AVDC model in support of greater public safety, content moderation and censorship, computer‐aided investigations, and law enforcement.","PeriodicalId":22858,"journal":{"name":"The Journal of Engineering","volume":"37 10","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"BrutNet: A novel approach for violence detection and classification using DCNN with GRU\",\"authors\":\"M. Haque, Hussain Nyeem, Syma Afsha\",\"doi\":\"10.1049/tje2.12375\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic Violence Detection and Classification (AVDC) with deep learning has garnered significant attention in computer vision research. This paper presents a novel approach for combining a custom Deep Convolutional Neural Network (DCNN) with a Gated Recurrent Unit (GRU) in developing a new AVDC model called BrutNet. Specifically, a time‐distributed DCNN (TD‐DCNN) is developed to generate a compact 2D representation with 512 spatial features per frame from a set of equally‐spaced frames of dimension 16090 in short video segments. Further to leverage the temporal information, a GRU layer is utilised, generating a condensed 1D vector that enables binary classification of violent or non‐violent content through multiple dense layers. Overfitting is addressed by incorporating dropout layers with a rate of 0.5, while the hidden and output layers employ rectified linear unit (ReLU) and sigmoid activations, respectively. The model is trained on the NVIDIA Tesla K80 GPU through Google Colab, demonstrating superior performance compared to existing models across various video datasets, including hockey fights, movie fights, AVD, and RWF‐2000. Notably, the model stands out by requiring only 3.416 million parameters and achieving impressive test accuracies of 97.62%, 100%, 97.22%, and 86.43% on the respective datasets. Thus, BrutNet exhibits the potential to emerge as a highly efficient and robust AVDC model in support of greater public safety, content moderation and censorship, computer‐aided investigations, and law enforcement.\",\"PeriodicalId\":22858,\"journal\":{\"name\":\"The Journal of Engineering\",\"volume\":\"37 10\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Journal of Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1049/tje2.12375\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1049/tje2.12375","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

利用深度学习进行自动暴力检测与分类（AVDC）在计算机视觉研究领域备受关注。本文介绍了一种结合定制深度卷积神经网络（DCNN）和门控递归单元（GRU）的新方法，用于开发一种名为 BrutNet 的新型 AVDC 模型。具体来说，我们开发了一种时间分布式 DCNN（TD-DCNN），用于从短视频片段中维度为 16090 的等间距帧集生成紧凑的二维表示，每帧具有 512 个空间特征。此外，为了充分利用时间信息，还利用了 GRU 层，生成了一个浓缩的一维向量，通过多个密集层对暴力或非暴力内容进行二元分类。通过加入比率为 0.5 的剔除层来解决过拟合问题，而隐藏层和输出层则分别采用整流线性单元（ReLU）和 sigmoid 激活。该模型通过 Google Colab 在 NVIDIA Tesla K80 GPU 上进行了训练，在各种视频数据集（包括曲棍球格斗、电影格斗、AVD 和 RWF-2000）中与现有模型相比表现出了卓越的性能。值得注意的是，该模型仅需 341.6 万个参数，在各个数据集上的测试准确率分别达到 97.62%、100%、97.22% 和 86.43%，令人印象深刻。因此，BrutNet 有可能成为一种高效、稳健的 AVDC 模型，为加强公共安全、内容控制和审查、计算机辅助调查和执法提供支持。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

BrutNet: A novel approach for violence detection and classification using DCNN with GRU

Automatic Violence Detection and Classification (AVDC) with deep learning has garnered significant attention in computer vision research. This paper presents a novel approach for combining a custom Deep Convolutional Neural Network (DCNN) with a Gated Recurrent Unit (GRU) in developing a new AVDC model called BrutNet. Specifically, a time‐distributed DCNN (TD‐DCNN) is developed to generate a compact 2D representation with 512 spatial features per frame from a set of equally‐spaced frames of dimension 16090 in short video segments. Further to leverage the temporal information, a GRU layer is utilised, generating a condensed 1D vector that enables binary classification of violent or non‐violent content through multiple dense layers. Overfitting is addressed by incorporating dropout layers with a rate of 0.5, while the hidden and output layers employ rectified linear unit (ReLU) and sigmoid activations, respectively. The model is trained on the NVIDIA Tesla K80 GPU through Google Colab, demonstrating superior performance compared to existing models across various video datasets, including hockey fights, movie fights, AVD, and RWF‐2000. Notably, the model stands out by requiring only 3.416 million parameters and achieving impressive test accuracies of 97.62%, 100%, 97.22%, and 86.43% on the respective datasets. Thus, BrutNet exhibits the potential to emerge as a highly efficient and robust AVDC model in support of greater public safety, content moderation and censorship, computer‐aided investigations, and law enforcement.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

The Journal of Engineering

自引率

0.00%

发文量