{"title":"Overview: Video recognition from handcrafted method to deep learning method","authors":"Xiao Xiao, Dan Xu, W. Wan","doi":"10.1109/ICALIP.2016.7846652","DOIUrl":null,"url":null,"abstract":"With the development of information technology, the automatic recognition of human action from video becomes a very popular research topic. In this paper, we review recent state-of-the-art of human action recognition methods in videos. First, we compare several notable handcrafted methods. Then we introduce some deep learning action recognition models. As deep learning becomes hot spot of research in recent years, more and more papers have utilized this method to explore the spatiotemporal features representation. We find that the deep learning methods outperform handcrafted methods at large scale recognition especially in cluttered background. But the networks still have much disadvantage. We expect our overview provides a fairly clear guidance for future research in this domain.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICALIP.2016.7846652","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
With the development of information technology, the automatic recognition of human action from video becomes a very popular research topic. In this paper, we review recent state-of-the-art of human action recognition methods in videos. First, we compare several notable handcrafted methods. Then we introduce some deep learning action recognition models. As deep learning becomes hot spot of research in recent years, more and more papers have utilized this method to explore the spatiotemporal features representation. We find that the deep learning methods outperform handcrafted methods at large scale recognition especially in cluttered background. But the networks still have much disadvantage. We expect our overview provides a fairly clear guidance for future research in this domain.