{"title":"基于字典的动作视频分类与动作库","authors":"S. Wilson, M. Srinivas, C. Mohan","doi":"10.1109/ICDSP.2014.6900734","DOIUrl":null,"url":null,"abstract":"Classifying action videos became challenging problem in computer vision community. In this work, action videos are represented by dictionaries which are learned by online dictionary learning (ODL). Here, we have used two simple measures to classify action videos, reconstruction error and projection. Sparse approximation algorithm LASSO is used to reconstruct test video and reconstruction error is calculated for each of the dictionaries. To get another discriminative measure projection, the test vector is projected onto the atoms in the dictionary. Minimum reconstruction error and maximum projection give information regarding the action category of the test vector. With action bank as a feature vector, our best performance is 59.3% on UCF50 (benchmark is 57.9%), 97.7% on KTH (benchmark is 98.2%)and 23.63% on HMDB51 (benchmark is 26.9%).","PeriodicalId":301856,"journal":{"name":"2014 19th International Conference on Digital Signal Processing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Dictionary based action video classification with action bank\",\"authors\":\"S. Wilson, M. Srinivas, C. Mohan\",\"doi\":\"10.1109/ICDSP.2014.6900734\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Classifying action videos became challenging problem in computer vision community. In this work, action videos are represented by dictionaries which are learned by online dictionary learning (ODL). Here, we have used two simple measures to classify action videos, reconstruction error and projection. Sparse approximation algorithm LASSO is used to reconstruct test video and reconstruction error is calculated for each of the dictionaries. To get another discriminative measure projection, the test vector is projected onto the atoms in the dictionary. Minimum reconstruction error and maximum projection give information regarding the action category of the test vector. With action bank as a feature vector, our best performance is 59.3% on UCF50 (benchmark is 57.9%), 97.7% on KTH (benchmark is 98.2%)and 23.63% on HMDB51 (benchmark is 26.9%).\",\"PeriodicalId\":301856,\"journal\":{\"name\":\"2014 19th International Conference on Digital Signal Processing\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 19th International Conference on Digital Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSP.2014.6900734\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 19th International Conference on Digital Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSP.2014.6900734","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Dictionary based action video classification with action bank
Classifying action videos became challenging problem in computer vision community. In this work, action videos are represented by dictionaries which are learned by online dictionary learning (ODL). Here, we have used two simple measures to classify action videos, reconstruction error and projection. Sparse approximation algorithm LASSO is used to reconstruct test video and reconstruction error is calculated for each of the dictionaries. To get another discriminative measure projection, the test vector is projected onto the atoms in the dictionary. Minimum reconstruction error and maximum projection give information regarding the action category of the test vector. With action bank as a feature vector, our best performance is 59.3% on UCF50 (benchmark is 57.9%), 97.7% on KTH (benchmark is 98.2%)and 23.63% on HMDB51 (benchmark is 26.9%).