{"title":"改进时空特征提取技术及其在动作分类中的应用","authors":"Maral Mesmakhosroshahi, Joohee Kim","doi":"10.1109/VCIP.2012.6410811","DOIUrl":null,"url":null,"abstract":"Space-time feature extraction is a recent and popular method used for action recognition. This paper presents a new algorithm to improve the robustness of spatio-temporal feature extraction techniques against the illumination and scale variations. Most of the interest point detectors are sensitive to illumination variations that may cause serious problems in action recognition by finding wrong keypoints. A method is proposed to make the 3-D Harris corner detector robust to illumination changes. Illumination invariance is achieved by applying a contrast stretching function to the video to find the proper intensity level for each pixel. A non-uniform binning method is also proposed to make the 3-D extension of the well-known SIFT descriptor more reliable and robust to scale changes by forming orientation histograms which concentrate on the regions near the interest points. Bag of features technique is used for classifying actions provided by the KTH dataset and the results demonstrate that our proposed method outperforms the original 3-D corner detector and SIFT descriptor.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Improving spatio-temporal feature extraction techniques and their applications in action classification\",\"authors\":\"Maral Mesmakhosroshahi, Joohee Kim\",\"doi\":\"10.1109/VCIP.2012.6410811\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Space-time feature extraction is a recent and popular method used for action recognition. This paper presents a new algorithm to improve the robustness of spatio-temporal feature extraction techniques against the illumination and scale variations. Most of the interest point detectors are sensitive to illumination variations that may cause serious problems in action recognition by finding wrong keypoints. A method is proposed to make the 3-D Harris corner detector robust to illumination changes. Illumination invariance is achieved by applying a contrast stretching function to the video to find the proper intensity level for each pixel. A non-uniform binning method is also proposed to make the 3-D extension of the well-known SIFT descriptor more reliable and robust to scale changes by forming orientation histograms which concentrate on the regions near the interest points. Bag of features technique is used for classifying actions provided by the KTH dataset and the results demonstrate that our proposed method outperforms the original 3-D corner detector and SIFT descriptor.\",\"PeriodicalId\":103073,\"journal\":{\"name\":\"2012 Visual Communications and Image Processing\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 Visual Communications and Image Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/VCIP.2012.6410811\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Visual Communications and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VCIP.2012.6410811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving spatio-temporal feature extraction techniques and their applications in action classification
Space-time feature extraction is a recent and popular method used for action recognition. This paper presents a new algorithm to improve the robustness of spatio-temporal feature extraction techniques against the illumination and scale variations. Most of the interest point detectors are sensitive to illumination variations that may cause serious problems in action recognition by finding wrong keypoints. A method is proposed to make the 3-D Harris corner detector robust to illumination changes. Illumination invariance is achieved by applying a contrast stretching function to the video to find the proper intensity level for each pixel. A non-uniform binning method is also proposed to make the 3-D extension of the well-known SIFT descriptor more reliable and robust to scale changes by forming orientation histograms which concentrate on the regions near the interest points. Bag of features technique is used for classifying actions provided by the KTH dataset and the results demonstrate that our proposed method outperforms the original 3-D corner detector and SIFT descriptor.