{"title":"Action Recognition with Trajectory and Scene","authors":"Jiqing Liu, Hui Xiang, Yibo Shi, D. Yu","doi":"10.1109/ICDH.2012.55","DOIUrl":null,"url":null,"abstract":"Trajectory features have recently shown promising results to action recognition in video. Typically, they are extracted by tracking feature points with the KLT tracker or matching SIFT descriptors between frames. However, trajectory can be due to the action of interest, but also be caused by background or the camera motion. To overcome the problem, human detection is applied to roughly estimate of the location of the human in the video and segment video into Foreground/Background regions. In many cases, human actions can be identified not only by observing human body in motion, but also properties of the surrounding scene. In our work, we addresse the problem and propose an approach that integrates multiple features from scene and people. We evaluate our video description with a bag of-features model. We also present experimental results on two datasets with an increasing degree of difficulty and demonstrate significant improvements.","PeriodicalId":308799,"journal":{"name":"2012 Fourth International Conference on Digital Home","volume":"129 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Fourth International Conference on Digital Home","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDH.2012.55","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Trajectory features have recently shown promising results to action recognition in video. Typically, they are extracted by tracking feature points with the KLT tracker or matching SIFT descriptors between frames. However, trajectory can be due to the action of interest, but also be caused by background or the camera motion. To overcome the problem, human detection is applied to roughly estimate of the location of the human in the video and segment video into Foreground/Background regions. In many cases, human actions can be identified not only by observing human body in motion, but also properties of the surrounding scene. In our work, we addresse the problem and propose an approach that integrates multiple features from scene and people. We evaluate our video description with a bag of-features model. We also present experimental results on two datasets with an increasing degree of difficulty and demonstrate significant improvements.