Roshan Singh, A. Kushwaha, Rajat Khurana, R. Srivastava
{"title":"Activity Recognition by Delving deeper using CNN and RNN","authors":"Roshan Singh, A. Kushwaha, Rajat Khurana, R. Srivastava","doi":"10.1109/ISCON47742.2019.9036262","DOIUrl":null,"url":null,"abstract":"Video content has a protagonist role in this age of data revolution. These days, computer vision research community is fascinated towards application of convolutional neural networks for various image and video analysis tasks. Recurrent Neural Networks are also used in various computer vision tasks. Introduction of residual connections in traditional CNN model to design very deep architectures known as Residual Networks are very efficient for computer vision tasks. To exploit capabilities of both CNN and RNN the proposed model is based on CRNN which is trained from scratch as well as using ResNet 152 which is pre trained on ImageNet dataset. The architecture is trained and validated on popular UCF-101 dataset on the basis of accuracy and average loss. From results, it can be observed that proposed approach provides better results than state of art methods.","PeriodicalId":124412,"journal":{"name":"2019 4th International Conference on Information Systems and Computer Networks (ISCON)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 4th International Conference on Information Systems and Computer Networks (ISCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCON47742.2019.9036262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Video content has a protagonist role in this age of data revolution. These days, computer vision research community is fascinated towards application of convolutional neural networks for various image and video analysis tasks. Recurrent Neural Networks are also used in various computer vision tasks. Introduction of residual connections in traditional CNN model to design very deep architectures known as Residual Networks are very efficient for computer vision tasks. To exploit capabilities of both CNN and RNN the proposed model is based on CRNN which is trained from scratch as well as using ResNet 152 which is pre trained on ImageNet dataset. The architecture is trained and validated on popular UCF-101 dataset on the basis of accuracy and average loss. From results, it can be observed that proposed approach provides better results than state of art methods.