改进时空特征提取技术及其在动作分类中的应用

2012 Visual Communications and Image Processing Pub Date : 2012-11-01 DOI:10.1109/VCIP.2012.6410811

Maral Mesmakhosroshahi, Joohee Kim

{"title":"改进时空特征提取技术及其在动作分类中的应用","authors":"Maral Mesmakhosroshahi, Joohee Kim","doi":"10.1109/VCIP.2012.6410811","DOIUrl":null,"url":null,"abstract":"Space-time feature extraction is a recent and popular method used for action recognition. This paper presents a new algorithm to improve the robustness of spatio-temporal feature extraction techniques against the illumination and scale variations. Most of the interest point detectors are sensitive to illumination variations that may cause serious problems in action recognition by finding wrong keypoints. A method is proposed to make the 3-D Harris corner detector robust to illumination changes. Illumination invariance is achieved by applying a contrast stretching function to the video to find the proper intensity level for each pixel. A non-uniform binning method is also proposed to make the 3-D extension of the well-known SIFT descriptor more reliable and robust to scale changes by forming orientation histograms which concentrate on the regions near the interest points. Bag of features technique is used for classifying actions provided by the KTH dataset and the results demonstrate that our proposed method outperforms the original 3-D corner detector and SIFT descriptor.","PeriodicalId":103073,"journal":{"name":"2012 Visual Communications and Image Processing","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Improving spatio-temporal feature extraction techniques and their applications in action classification\",\"authors\":\"Maral Mesmakhosroshahi, Joohee Kim\",\"doi\":\"10.1109/VCIP.2012.6410811\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Space-time feature extraction is a recent and popular method used for action recognition. This paper presents a new algorithm to improve the robustness of spatio-temporal feature extraction techniques against the illumination and scale variations. Most of the interest point detectors are sensitive to illumination variations that may cause serious problems in action recognition by finding wrong keypoints. A method is proposed to make the 3-D Harris corner detector robust to illumination changes. Illumination invariance is achieved by applying a contrast stretching function to the video to find the proper intensity level for each pixel. A non-uniform binning method is also proposed to make the 3-D extension of the well-known SIFT descriptor more reliable and robust to scale changes by forming orientation histograms which concentrate on the regions near the interest points. Bag of features technique is used for classifying actions provided by the KTH dataset and the results demonstrate that our proposed method outperforms the original 3-D corner detector and SIFT descriptor.\",\"PeriodicalId\":103073,\"journal\":{\"name\":\"2012 Visual Communications and Image Processing\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 Visual Communications and Image Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/VCIP.2012.6410811\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Visual Communications and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VCIP.2012.6410811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

时空特征提取是一种近年来比较流行的动作识别方法。为了提高时空特征提取技术对光照和尺度变化的鲁棒性，提出了一种新的算法。大多数兴趣点检测器对光照变化很敏感，这可能会导致错误的关键点在动作识别中出现严重的问题。提出了一种使三维哈里斯角点探测器对光照变化具有鲁棒性的方法。照明不变性是通过对视频应用对比度拉伸函数来找到每个像素的适当强度级别来实现的。提出了一种非均匀分束方法，通过形成集中在兴趣点附近区域的方向直方图，使众所周知的SIFT描述符的三维扩展更加可靠和鲁棒。将特征袋技术用于KTH数据集提供的动作分类，结果表明我们提出的方法优于原始的三维角点检测器和SIFT描述子。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Improving spatio-temporal feature extraction techniques and their applications in action classification

Space-time feature extraction is a recent and popular method used for action recognition. This paper presents a new algorithm to improve the robustness of spatio-temporal feature extraction techniques against the illumination and scale variations. Most of the interest point detectors are sensitive to illumination variations that may cause serious problems in action recognition by finding wrong keypoints. A method is proposed to make the 3-D Harris corner detector robust to illumination changes. Illumination invariance is achieved by applying a contrast stretching function to the video to find the proper intensity level for each pixel. A non-uniform binning method is also proposed to make the 3-D extension of the well-known SIFT descriptor more reliable and robust to scale changes by forming orientation histograms which concentrate on the regions near the interest points. Bag of features technique is used for classifying actions provided by the KTH dataset and the results demonstrate that our proposed method outperforms the original 3-D corner detector and SIFT descriptor.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 Visual Communications and Image Processing

自引率

0.00%

发文量