Human Activity Recognition with Noise-Injected Time-Distributed AlexNet.

IF 3.9 3区医学 Q1 ENGINEERING, MULTIDISCIPLINARY

Biomimetics Pub Date : 2025-09-11 DOI:10.3390/biomimetics10090613

Sanjay Dutta, Tossapon Boongoen, Reyer Zwiggelaar

{"title":"Human Activity Recognition with Noise-Injected Time-Distributed AlexNet.","authors":"Sanjay Dutta, Tossapon Boongoen, Reyer Zwiggelaar","doi":"10.3390/biomimetics10090613","DOIUrl":null,"url":null,"abstract":"<p><p>This study investigates the integration of biologically inspired noise injection with a time-distributed adaptation of the AlexNet architecture to enhance the performance and robustness of human activity recognition (HAR) systems. It is a critical field in computer vision which involves identifying and interpreting human actions from video sequences and has applications in healthcare, security and smart environments. The proposed model is based on an adaptation of AlexNet, originally developed for static image classification and not inherently suited for modelling temporal sequences for video action classification tasks. While our time-distributed AlexNet efficiently captures spatial and temporal features and suitable for video classification. However, its performance can be limited by overfitting and poor generalisation to unseen scenarios, to address these challenges, Gaussian noise was introduced at the input level during training, inspired by neural mechanisms observed in biological sensory processing to handle variability and uncertainty. Experiments were conducted on the EduNet, UCF50 and UCF101 datasets. The EduNet dataset was specifically designed for educational environments and we evaluate the impact of noise injection on model accuracy, stability and overall performance. The proposed bio-inspired noise-injected time-distributed AlexNet achieved an overall accuracy of 91.40% and an F1 score of 92.77%, outperforming other state-of-the-art models. Hyperparameter tuning, particularly optimising the learning rate, further enhanced model stability, reflected in lower standard deviation values across multiple experimental runs. These findings demonstrate that the strategic combination of noise injection with time-distributed architectures improves generalisation and robustness in HAR, paving the way for resource-efficient and real-world-deployable deep learning systems.</p>","PeriodicalId":8907,"journal":{"name":"Biomimetics","volume":"10 9","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12467874/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomimetics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/biomimetics10090613","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

This study investigates the integration of biologically inspired noise injection with a time-distributed adaptation of the AlexNet architecture to enhance the performance and robustness of human activity recognition (HAR) systems. It is a critical field in computer vision which involves identifying and interpreting human actions from video sequences and has applications in healthcare, security and smart environments. The proposed model is based on an adaptation of AlexNet, originally developed for static image classification and not inherently suited for modelling temporal sequences for video action classification tasks. While our time-distributed AlexNet efficiently captures spatial and temporal features and suitable for video classification. However, its performance can be limited by overfitting and poor generalisation to unseen scenarios, to address these challenges, Gaussian noise was introduced at the input level during training, inspired by neural mechanisms observed in biological sensory processing to handle variability and uncertainty. Experiments were conducted on the EduNet, UCF50 and UCF101 datasets. The EduNet dataset was specifically designed for educational environments and we evaluate the impact of noise injection on model accuracy, stability and overall performance. The proposed bio-inspired noise-injected time-distributed AlexNet achieved an overall accuracy of 91.40% and an F1 score of 92.77%, outperforming other state-of-the-art models. Hyperparameter tuning, particularly optimising the learning rate, further enhanced model stability, reflected in lower standard deviation values across multiple experimental runs. These findings demonstrate that the strategic combination of noise injection with time-distributed architectures improves generalisation and robustness in HAR, paving the way for resource-efficient and real-world-deployable deep learning systems.

Abstract Image

查看原文本刊更多论文

基于噪声注入时间分布AlexNet的人类活动识别。

本研究探讨了生物启发的噪声注入与AlexNet架构的时间分布适应的集成，以提高人类活动识别（HAR）系统的性能和鲁棒性。它是计算机视觉的一个关键领域，涉及从视频序列中识别和解释人类行为，并在医疗保健，安全和智能环境中有应用。所提出的模型是基于AlexNet的改编，AlexNet最初是为静态图像分类而开发的，并不适合为视频动作分类任务建模时间序列。而我们的时间分布式AlexNet可以有效地捕捉空间和时间特征，适合视频分类。然而，它的性能可能会受到过度拟合和对未知场景的不良泛化的限制，为了解决这些挑战，在训练过程中在输入层面引入高斯噪声，灵感来自于在生物感觉处理中观察到的神经机制，以处理可变性和不确定性。实验在EduNet、UCF50和UCF101数据集上进行。EduNet数据集是专门为教育环境设计的，我们评估了噪声注入对模型准确性、稳定性和整体性能的影响。提出的仿生噪声注入时间分布式AlexNet的总体准确率为91.40%，F1得分为92.77%，优于其他最先进的模型。超参数调整，特别是优化学习率，进一步增强了模型的稳定性，反映在多个实验运行的较低标准差值上。这些发现表明，噪声注入与时间分布式架构的战略组合提高了HAR的泛化和鲁棒性，为资源高效和现实世界可部署的深度学习系统铺平了道路。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊