Human action recognition with 3D convolutional neural network

2017 IEEE Latin American Conference on Computational Intelligence (LA-CCI) Pub Date : 2017-11-01 DOI:10.1109/LA-CCI.2017.8285700

Tiago Lima, Bruno José Torres Fernandes, Pablo V. A. Barros

{"title":"Human action recognition with 3D convolutional neural network","authors":"Tiago Lima, Bruno José Torres Fernandes, Pablo V. A. Barros","doi":"10.1109/LA-CCI.2017.8285700","DOIUrl":null,"url":null,"abstract":"In the last decade, there was a development of technologies that allowed the possibility of storing and processing large amounts of data. Due to this, there was a considerable increase in the use of video cameras. Areas such as surveillance, traffic control, and entertainment, presented a greater demand for the development of techniques for analysis and automatic classification of videos. Within those areas of application, human activities recognition is considered one of the major problems and is discussed in the scientific environment due to related challenges, such as blurred images, point view changed confusion with background and low resolution. Recently, the Convolutional Neural Networks (CNN) have made considerable advances in several areas of research, improving state of the art in many cases, including images and videos classification problems. Thus, this work aims to develop a 3D CNN for the human actions recognition, as well as a study of the influence of the resolutions of entries in the network. After choosing the model are compared with other works in the area. The results obtained by the model surpassed the state-of-the-art in the bases evaluated and are discussed in this document.","PeriodicalId":144567,"journal":{"name":"2017 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LA-CCI.2017.8285700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

In the last decade, there was a development of technologies that allowed the possibility of storing and processing large amounts of data. Due to this, there was a considerable increase in the use of video cameras. Areas such as surveillance, traffic control, and entertainment, presented a greater demand for the development of techniques for analysis and automatic classification of videos. Within those areas of application, human activities recognition is considered one of the major problems and is discussed in the scientific environment due to related challenges, such as blurred images, point view changed confusion with background and low resolution. Recently, the Convolutional Neural Networks (CNN) have made considerable advances in several areas of research, improving state of the art in many cases, including images and videos classification problems. Thus, this work aims to develop a 3D CNN for the human actions recognition, as well as a study of the influence of the resolutions of entries in the network. After choosing the model are compared with other works in the area. The results obtained by the model surpassed the state-of-the-art in the bases evaluated and are discussed in this document.

查看原文本刊更多论文

基于三维卷积神经网络的人体动作识别

在过去的十年里，技术的发展使得存储和处理大量数据成为可能。因此，摄影机的使用有了相当大的增加。监控、交通控制和娱乐等领域对视频分析和自动分类技术的发展提出了更大的需求。在这些应用领域中，人类活动识别被认为是主要问题之一，并在科学环境中进行了讨论，因为相关的挑战，如图像模糊，视点变化与背景混淆和低分辨率。最近，卷积神经网络(CNN)在几个研究领域取得了相当大的进展，在许多情况下提高了技术水平，包括图像和视频分类问题。因此，本工作旨在开发一个用于人类动作识别的3D CNN，并研究网络中条目分辨率的影响。选择后的模型与该地区其他作品进行了比较。该模型得到的结果超过了目前评价基地的水平，并在本文中进行了讨论。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE Latin American Conference on Computational Intelligence (LA-CCI)

自引率

0.00%

发文量