SqueezeNet-ImpLinknet Architecture for Crowd Anomaly Detection With Improved R-CNN-Based Segmentation

IF 1.7 4区计算机科学 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Computer Animation and Virtual Worlds Pub Date : 2026-03-03 DOI:10.1002/cav.70100

Jyoti Ambadas Kendule, Kailash J. Karande

{"title":"SqueezeNet-ImpLinknet Architecture for Crowd Anomaly Detection With Improved R-CNN-Based Segmentation","authors":"Jyoti Ambadas Kendule, Kailash J. Karande","doi":"10.1002/cav.70100","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Crowd anomaly detection is a critical aspect of ensuring public safety in various domains such as surveillance and security. Ensuring public safety in crowded environments requires accurate and efficient crowd anomaly detection. This research proposes an innovative approach to crowd anomaly detection using the SqueezeNet-ImpLinknet architecture. The input images are first preprocessed using a median filtering technique. Then, object segmentation takes place using an Improved Mask Region-based CNN. It incorporates batch normalization, ReLU activation, and an advanced Scale Dot Product attention mechanism to improve segmentation accuracy and computational efficiency. Subsequently, features such as the Improved SLBT feature, capturing shape and texture information, color features, and LGTrP features are extracted. Then, anomaly detection is performed using a hybrid model that integrates SqueezeNet and Improved Linknet models. The Improved LinkNet model enhances feature representation by integrating an attention mechanism in the encoder and a novel ReLUSignmax activation function in the decoder, overcoming limitations of conventional architectures. The approach is evaluated on the widely used UCSD Anomaly Detection Dataset, achieving superior performance with accuracy ranging from 0.939 to 0.975 and a specificity of 0.987 at 90% training data. The proposed approach offers a robust solution for intelligent surveillance in crowded environments.</p>\n </div>","PeriodicalId":50645,"journal":{"name":"Computer Animation and Virtual Worlds","volume":"37 2","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2026-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Animation and Virtual Worlds","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cav.70100","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Crowd anomaly detection is a critical aspect of ensuring public safety in various domains such as surveillance and security. Ensuring public safety in crowded environments requires accurate and efficient crowd anomaly detection. This research proposes an innovative approach to crowd anomaly detection using the SqueezeNet-ImpLinknet architecture. The input images are first preprocessed using a median filtering technique. Then, object segmentation takes place using an Improved Mask Region-based CNN. It incorporates batch normalization, ReLU activation, and an advanced Scale Dot Product attention mechanism to improve segmentation accuracy and computational efficiency. Subsequently, features such as the Improved SLBT feature, capturing shape and texture information, color features, and LGTrP features are extracted. Then, anomaly detection is performed using a hybrid model that integrates SqueezeNet and Improved Linknet models. The Improved LinkNet model enhances feature representation by integrating an attention mechanism in the encoder and a novel ReLUSignmax activation function in the decoder, overcoming limitations of conventional architectures. The approach is evaluated on the widely used UCSD Anomaly Detection Dataset, achieving superior performance with accuracy ranging from 0.939 to 0.975 and a specificity of 0.987 at 90% training data. The proposed approach offers a robust solution for intelligent surveillance in crowded environments.

Abstract Image

查看原文本刊更多论文

基于改进r - cnn分割的人群异常检测的SqueezeNet-ImpLinknet架构

人群异常检测在监控、安防等各个领域都是保障公共安全的重要方面。为了保证拥挤环境下的公共安全，需要准确、高效的人群异常检测。本研究提出了一种利用SqueezeNet-ImpLinknet架构进行人群异常检测的创新方法。首先使用中值滤波技术对输入图像进行预处理。然后，使用改进的基于掩码区域的CNN进行对象分割。它结合了批归一化，ReLU激活和先进的尺度点积注意机制，以提高分割精度和计算效率。随后，提取改进的SLBT特征、捕获形状和纹理信息特征、颜色特征、LGTrP特征等特征。然后，使用结合了SqueezeNet和Improved Linknet模型的混合模型进行异常检测。改进的LinkNet模型通过在编码器中集成注意机制和在解码器中集成新颖的ReLUSignmax激活函数来增强特征表示，克服了传统架构的局限性。在广泛使用的UCSD异常检测数据集上对该方法进行了评估，在90%的训练数据下，准确率为0.939 ~ 0.975，特异性为0.987，取得了优异的性能。该方法为拥挤环境下的智能监控提供了一种鲁棒的解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer Animation and Virtual Worlds 工程技术-计算机：软件工程

CiteScore

2.20

自引率

0.00%

发文量

审稿时长

6-12 weeks

期刊介绍： With the advent of very powerful PCs and high-end graphics cards, there has been an incredible development in Virtual Worlds, real-time computer animation and simulation, games. But at the same time, new and cheaper Virtual Reality devices have appeared allowing an interaction with these real-time Virtual Worlds and even with real worlds through Augmented Reality. Three-dimensional characters, especially Virtual Humans are now of an exceptional quality, which allows to use them in the movie industry. But this is only a beginning, as with the development of Artificial Intelligence and Agent technology, these characters will become more and more autonomous and even intelligent. They will inhabit the Virtual Worlds in a Virtual Life together with animals and plants.