{"title":"从具有外观和运动特征的自我中心图像中识别活动","authors":"Yanhua Chen, Mingtao Pei, Z. Nie","doi":"10.1109/mlsp52302.2021.9596178","DOIUrl":null,"url":null,"abstract":"With the development of wearable cameras, recognizing activities from egocentric images has attracted the interest of many researchers. The motion of the camera wearer is an important cue for the activity recognition, and is either explicitly used by optical flow for videos or implicitly used by fusing several images for images. In this paper, based on the observation that the two consecutive images captured by the wearable camera contain the motion information of the camera wearer, we propose to use the camera wearer's rotation and translation computed from the two consecutive images as the motion features. The motion features are combined with appearance features extracted by a CNN as the activity features, and the activity is classified by a random decision forest. We test our method on two egocentric image datasets. The experimental results show that by adding the motion information, the accuracy of activity recognition has been significantly improved.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"28 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Recognizing Activities from Egocentric Images with Appearance and Motion Features\",\"authors\":\"Yanhua Chen, Mingtao Pei, Z. Nie\",\"doi\":\"10.1109/mlsp52302.2021.9596178\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of wearable cameras, recognizing activities from egocentric images has attracted the interest of many researchers. The motion of the camera wearer is an important cue for the activity recognition, and is either explicitly used by optical flow for videos or implicitly used by fusing several images for images. In this paper, based on the observation that the two consecutive images captured by the wearable camera contain the motion information of the camera wearer, we propose to use the camera wearer's rotation and translation computed from the two consecutive images as the motion features. The motion features are combined with appearance features extracted by a CNN as the activity features, and the activity is classified by a random decision forest. We test our method on two egocentric image datasets. The experimental results show that by adding the motion information, the accuracy of activity recognition has been significantly improved.\",\"PeriodicalId\":156116,\"journal\":{\"name\":\"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)\",\"volume\":\"28 1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/mlsp52302.2021.9596178\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/mlsp52302.2021.9596178","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Recognizing Activities from Egocentric Images with Appearance and Motion Features
With the development of wearable cameras, recognizing activities from egocentric images has attracted the interest of many researchers. The motion of the camera wearer is an important cue for the activity recognition, and is either explicitly used by optical flow for videos or implicitly used by fusing several images for images. In this paper, based on the observation that the two consecutive images captured by the wearable camera contain the motion information of the camera wearer, we propose to use the camera wearer's rotation and translation computed from the two consecutive images as the motion features. The motion features are combined with appearance features extracted by a CNN as the activity features, and the activity is classified by a random decision forest. We test our method on two egocentric image datasets. The experimental results show that by adding the motion information, the accuracy of activity recognition has been significantly improved.