概述:视频识别从手工方法到深度学习方法

2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI:10.1109/ICALIP.2016.7846652

Xiao Xiao, Dan Xu, W. Wan

{"title":"概述:视频识别从手工方法到深度学习方法","authors":"Xiao Xiao, Dan Xu, W. Wan","doi":"10.1109/ICALIP.2016.7846652","DOIUrl":null,"url":null,"abstract":"With the development of information technology, the automatic recognition of human action from video becomes a very popular research topic. In this paper, we review recent state-of-the-art of human action recognition methods in videos. First, we compare several notable handcrafted methods. Then we introduce some deep learning action recognition models. As deep learning becomes hot spot of research in recent years, more and more papers have utilized this method to explore the spatiotemporal features representation. We find that the deep learning methods outperform handcrafted methods at large scale recognition especially in cluttered background. But the networks still have much disadvantage. We expect our overview provides a fairly clear guidance for future research in this domain.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Overview: Video recognition from handcrafted method to deep learning method\",\"authors\":\"Xiao Xiao, Dan Xu, W. Wan\",\"doi\":\"10.1109/ICALIP.2016.7846652\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of information technology, the automatic recognition of human action from video becomes a very popular research topic. In this paper, we review recent state-of-the-art of human action recognition methods in videos. First, we compare several notable handcrafted methods. Then we introduce some deep learning action recognition models. As deep learning becomes hot spot of research in recent years, more and more papers have utilized this method to explore the spatiotemporal features representation. We find that the deep learning methods outperform handcrafted methods at large scale recognition especially in cluttered background. But the networks still have much disadvantage. We expect our overview provides a fairly clear guidance for future research in this domain.\",\"PeriodicalId\":184170,\"journal\":{\"name\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICALIP.2016.7846652\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICALIP.2016.7846652","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

摘要

随着信息技术的发展，从视频中自动识别人的动作已成为一个非常热门的研究课题。本文综述了视频中人类动作识别方法的最新进展。首先，我们比较几种著名的手工制作方法。然后介绍了一些深度学习动作识别模型。随着深度学习成为近年来的研究热点，越来越多的论文利用该方法来探索时空特征表示。我们发现深度学习方法在大规模识别中优于手工方法，特别是在杂乱的背景下。但是网络仍然有很多劣势。我们希望我们的概述为该领域的未来研究提供一个相当明确的指导。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Overview: Video recognition from handcrafted method to deep learning method

With the development of information technology, the automatic recognition of human action from video becomes a very popular research topic. In this paper, we review recent state-of-the-art of human action recognition methods in videos. First, we compare several notable handcrafted methods. Then we introduce some deep learning action recognition models. As deep learning becomes hot spot of research in recent years, more and more papers have utilized this method to explore the spatiotemporal features representation. We find that the deep learning methods outperform handcrafted methods at large scale recognition especially in cluttered background. But the networks still have much disadvantage. We expect our overview provides a fairly clear guidance for future research in this domain.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 International Conference on Audio, Language and Image Processing (ICALIP)

自引率

0.00%

发文量