Xin Zhang, Yantao Zhu, Li-Juan Deng, Long Qi, Zhang Tao, Jiali Hu
{"title":"A SlowFast behavior recognition algorithm incorporating motion saliency","authors":"Xin Zhang, Yantao Zhu, Li-Juan Deng, Long Qi, Zhang Tao, Jiali Hu","doi":"10.1117/12.2674969","DOIUrl":null,"url":null,"abstract":"This paper first analyzes three major problems that can be encountered in video behavior recognition tasks: sampled blocks cannot be focused on motion regions, global motion affects recognition results, and the network's spatio-temporal modeling capability is weak. To address these three problems, we propose the SlowFast behavior recognition algorithm (MASlowFast) that incorporates motion saliency as an application scenario for mine personnel safety behavior recognition. The specific solutions are the sampling method based on motion saliency, the extraction of motion boundary features, and the spatio-temporal segmentation strategy of fast and slow channels. Finally, we validated the effectiveness and accuracy of the algorithm in this paper by ablation experiments on UCF101 dataset and HMDB51 dataset.","PeriodicalId":286364,"journal":{"name":"Conference on Computer Graphics, Artificial Intelligence, and Data Processing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Computer Graphics, Artificial Intelligence, and Data Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2674969","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper first analyzes three major problems that can be encountered in video behavior recognition tasks: sampled blocks cannot be focused on motion regions, global motion affects recognition results, and the network's spatio-temporal modeling capability is weak. To address these three problems, we propose the SlowFast behavior recognition algorithm (MASlowFast) that incorporates motion saliency as an application scenario for mine personnel safety behavior recognition. The specific solutions are the sampling method based on motion saliency, the extraction of motion boundary features, and the spatio-temporal segmentation strategy of fast and slow channels. Finally, we validated the effectiveness and accuracy of the algorithm in this paper by ablation experiments on UCF101 dataset and HMDB51 dataset.