{"title":"视频中动作识别的MPEG cdv特征轨迹","authors":"R. Dasari, Chang Wen Chen","doi":"10.1109/MIPR.2018.00069","DOIUrl":null,"url":null,"abstract":"Visual Action Recognition on mobile phones is a challenging problem. Mobile and wearable devices deal with power, memory, computational and hardware constraints, which mandate robust and lightweight algorithmic implementations for sophisticated vision applications, like action recognition. Compact Descriptors for Visual Search (CDVS) is an MPEG7 standard for an accelerated visual search on mobiles. In our work, we propose a mobile action recognition framework which classifies actions by tracking CDVS feature trajectories of human subjects. The proposed method capitalizes on the sparse, salient and memory efficient properties of CDVS features. Although our recognition accuracies on standard action datasets KTH, UCF50, and HMDB is not superior to the CNN based methods, our work explores and proves the feasibility of using CDVS features for action recognition.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"MPEG CDVS Feature Trajectories for Action Recognition in Videos\",\"authors\":\"R. Dasari, Chang Wen Chen\",\"doi\":\"10.1109/MIPR.2018.00069\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Visual Action Recognition on mobile phones is a challenging problem. Mobile and wearable devices deal with power, memory, computational and hardware constraints, which mandate robust and lightweight algorithmic implementations for sophisticated vision applications, like action recognition. Compact Descriptors for Visual Search (CDVS) is an MPEG7 standard for an accelerated visual search on mobiles. In our work, we propose a mobile action recognition framework which classifies actions by tracking CDVS feature trajectories of human subjects. The proposed method capitalizes on the sparse, salient and memory efficient properties of CDVS features. Although our recognition accuracies on standard action datasets KTH, UCF50, and HMDB is not superior to the CNN based methods, our work explores and proves the feasibility of using CDVS features for action recognition.\",\"PeriodicalId\":320000,\"journal\":{\"name\":\"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-04-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MIPR.2018.00069\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIPR.2018.00069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
摘要
手机的视觉动作识别是一个具有挑战性的问题。移动和可穿戴设备处理功率,内存,计算和硬件限制,这需要强大和轻量级的算法实现复杂的视觉应用,如动作识别。压缩视觉搜索描述符(Compact Descriptors for Visual Search, cddvs)是MPEG7的一个标准,用于加速移动设备上的视觉搜索。在我们的工作中,我们提出了一个移动动作识别框架,该框架通过跟踪人类受试者的cdv特征轨迹来对动作进行分类。该方法充分利用了cdv特征的稀疏性、显著性和内存效率。虽然我们在标准动作数据集KTH, UCF50和HMDB上的识别精度并不优于基于CNN的方法,但我们的工作探索并证明了使用cdv特征进行动作识别的可行性。
MPEG CDVS Feature Trajectories for Action Recognition in Videos
Visual Action Recognition on mobile phones is a challenging problem. Mobile and wearable devices deal with power, memory, computational and hardware constraints, which mandate robust and lightweight algorithmic implementations for sophisticated vision applications, like action recognition. Compact Descriptors for Visual Search (CDVS) is an MPEG7 standard for an accelerated visual search on mobiles. In our work, we propose a mobile action recognition framework which classifies actions by tracking CDVS feature trajectories of human subjects. The proposed method capitalizes on the sparse, salient and memory efficient properties of CDVS features. Although our recognition accuracies on standard action datasets KTH, UCF50, and HMDB is not superior to the CNN based methods, our work explores and proves the feasibility of using CDVS features for action recognition.