Tiago Lima, Bruno José Torres Fernandes, Pablo V. A. Barros
{"title":"Human action recognition with 3D convolutional neural network","authors":"Tiago Lima, Bruno José Torres Fernandes, Pablo V. A. Barros","doi":"10.1109/LA-CCI.2017.8285700","DOIUrl":null,"url":null,"abstract":"In the last decade, there was a development of technologies that allowed the possibility of storing and processing large amounts of data. Due to this, there was a considerable increase in the use of video cameras. Areas such as surveillance, traffic control, and entertainment, presented a greater demand for the development of techniques for analysis and automatic classification of videos. Within those areas of application, human activities recognition is considered one of the major problems and is discussed in the scientific environment due to related challenges, such as blurred images, point view changed confusion with background and low resolution. Recently, the Convolutional Neural Networks (CNN) have made considerable advances in several areas of research, improving state of the art in many cases, including images and videos classification problems. Thus, this work aims to develop a 3D CNN for the human actions recognition, as well as a study of the influence of the resolutions of entries in the network. After choosing the model are compared with other works in the area. The results obtained by the model surpassed the state-of-the-art in the bases evaluated and are discussed in this document.","PeriodicalId":144567,"journal":{"name":"2017 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Latin American Conference on Computational Intelligence (LA-CCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/LA-CCI.2017.8285700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
In the last decade, there was a development of technologies that allowed the possibility of storing and processing large amounts of data. Due to this, there was a considerable increase in the use of video cameras. Areas such as surveillance, traffic control, and entertainment, presented a greater demand for the development of techniques for analysis and automatic classification of videos. Within those areas of application, human activities recognition is considered one of the major problems and is discussed in the scientific environment due to related challenges, such as blurred images, point view changed confusion with background and low resolution. Recently, the Convolutional Neural Networks (CNN) have made considerable advances in several areas of research, improving state of the art in many cases, including images and videos classification problems. Thus, this work aims to develop a 3D CNN for the human actions recognition, as well as a study of the influence of the resolutions of entries in the network. After choosing the model are compared with other works in the area. The results obtained by the model surpassed the state-of-the-art in the bases evaluated and are discussed in this document.