{"title":"Lightweight Human Behavior Recognition Method for Visual Communication AGV Based on CNN-LSTM","authors":"Shuhua Zhao;Jianxin Zhu;Jiang Lu;Zhibo Ju;Dong Wu","doi":"10.26599/IJCS.2024.9100014","DOIUrl":null,"url":null,"abstract":"Behavior recognition uses deep learning network model to automatically extract the deep features of data, but traditional machine learning algorithms have some problems such as manual feature extraction and poor generalization ability of models. The S-MobileNet is proposed for human behavior recognition. Firstly, the 3D convolution to extract features is used to build a time series model to learn the long-term dependence of human behavior characteristics on time series. Secondly, Long Short-Term Memory (LSTM) is used as the input of multi-layer recurrent neural network time series model, so as to obtain individual dynamic features, and then individual features are aggregated by attention pooling mechanism to obtain corresponding group behavior features. At last, the recognition of individual behavior and group behavior is completed by relying on the characteristics of individual and group behavior. The experiments show that the network in this paper achieves high recognition accuracy on UCF101 and HMDB51 datasets, and the overall recognition rate of proposed model for 13 kinds of human behaviors is 95.3%.","PeriodicalId":32381,"journal":{"name":"International Journal of Crowd Science","volume":"9 2","pages":"133-138"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11003456","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Crowd Science","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11003456/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Decision Sciences","Score":null,"Total":0}
引用次数: 0
Abstract
Behavior recognition uses deep learning network model to automatically extract the deep features of data, but traditional machine learning algorithms have some problems such as manual feature extraction and poor generalization ability of models. The S-MobileNet is proposed for human behavior recognition. Firstly, the 3D convolution to extract features is used to build a time series model to learn the long-term dependence of human behavior characteristics on time series. Secondly, Long Short-Term Memory (LSTM) is used as the input of multi-layer recurrent neural network time series model, so as to obtain individual dynamic features, and then individual features are aggregated by attention pooling mechanism to obtain corresponding group behavior features. At last, the recognition of individual behavior and group behavior is completed by relying on the characteristics of individual and group behavior. The experiments show that the network in this paper achieves high recognition accuracy on UCF101 and HMDB51 datasets, and the overall recognition rate of proposed model for 13 kinds of human behaviors is 95.3%.