{"title":"MPEG CDVS Feature Trajectories for Action Recognition in Videos","authors":"R. Dasari, Chang Wen Chen","doi":"10.1109/MIPR.2018.00069","DOIUrl":null,"url":null,"abstract":"Visual Action Recognition on mobile phones is a challenging problem. Mobile and wearable devices deal with power, memory, computational and hardware constraints, which mandate robust and lightweight algorithmic implementations for sophisticated vision applications, like action recognition. Compact Descriptors for Visual Search (CDVS) is an MPEG7 standard for an accelerated visual search on mobiles. In our work, we propose a mobile action recognition framework which classifies actions by tracking CDVS feature trajectories of human subjects. The proposed method capitalizes on the sparse, salient and memory efficient properties of CDVS features. Although our recognition accuracies on standard action datasets KTH, UCF50, and HMDB is not superior to the CNN based methods, our work explores and proves the feasibility of using CDVS features for action recognition.","PeriodicalId":320000,"journal":{"name":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MIPR.2018.00069","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Visual Action Recognition on mobile phones is a challenging problem. Mobile and wearable devices deal with power, memory, computational and hardware constraints, which mandate robust and lightweight algorithmic implementations for sophisticated vision applications, like action recognition. Compact Descriptors for Visual Search (CDVS) is an MPEG7 standard for an accelerated visual search on mobiles. In our work, we propose a mobile action recognition framework which classifies actions by tracking CDVS feature trajectories of human subjects. The proposed method capitalizes on the sparse, salient and memory efficient properties of CDVS features. Although our recognition accuracies on standard action datasets KTH, UCF50, and HMDB is not superior to the CNN based methods, our work explores and proves the feasibility of using CDVS features for action recognition.