Miao Jin, Jun Zhang, Tianfu Huang, Zhiwei Guo, Xiwen Chen
{"title":"基于视频全局-局部特征的人体动作识别研究","authors":"Miao Jin, Jun Zhang, Tianfu Huang, Zhiwei Guo, Xiwen Chen","doi":"10.1109/PRML52754.2021.9520743","DOIUrl":null,"url":null,"abstract":"In the research of human behavior recognition, the two-stream network structure shows excellent results. Aiming at the branch feature of two-stream networks, this paper proposes a two-stream human behavior research method based on global-local features. This method first uses a mixture of Gaussian background modeling methods to extract silhouette features as global contour features, and then uses an end-to-end learnable unsupervised network TV-Net to generate optical flow motion features, which are used as the network input, and the Xception network is used as The feature generation network which does not change the model scale while improving the accuracy, and performs fusion classification on the output of the two-stream branch network to obtain the behavior recognition result. This method refines the motion information contained in the global and local features for classification, reduces the computational complexity, and shows a good level of recognition on both public and internal data sets.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Research on Human Action Recognition Based on Global-Local Features of Video\",\"authors\":\"Miao Jin, Jun Zhang, Tianfu Huang, Zhiwei Guo, Xiwen Chen\",\"doi\":\"10.1109/PRML52754.2021.9520743\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the research of human behavior recognition, the two-stream network structure shows excellent results. Aiming at the branch feature of two-stream networks, this paper proposes a two-stream human behavior research method based on global-local features. This method first uses a mixture of Gaussian background modeling methods to extract silhouette features as global contour features, and then uses an end-to-end learnable unsupervised network TV-Net to generate optical flow motion features, which are used as the network input, and the Xception network is used as The feature generation network which does not change the model scale while improving the accuracy, and performs fusion classification on the output of the two-stream branch network to obtain the behavior recognition result. This method refines the motion information contained in the global and local features for classification, reduces the computational complexity, and shows a good level of recognition on both public and internal data sets.\",\"PeriodicalId\":429603,\"journal\":{\"name\":\"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PRML52754.2021.9520743\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PRML52754.2021.9520743","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on Human Action Recognition Based on Global-Local Features of Video
In the research of human behavior recognition, the two-stream network structure shows excellent results. Aiming at the branch feature of two-stream networks, this paper proposes a two-stream human behavior research method based on global-local features. This method first uses a mixture of Gaussian background modeling methods to extract silhouette features as global contour features, and then uses an end-to-end learnable unsupervised network TV-Net to generate optical flow motion features, which are used as the network input, and the Xception network is used as The feature generation network which does not change the model scale while improving the accuracy, and performs fusion classification on the output of the two-stream branch network to obtain the behavior recognition result. This method refines the motion information contained in the global and local features for classification, reduces the computational complexity, and shows a good level of recognition on both public and internal data sets.