{"title":"基于序列2D-CNN的监视系统动作识别","authors":"Van-Dung Hoang, D. Hoang, Có hiệu","doi":"10.1109/IECON.2018.8591338","DOIUrl":null,"url":null,"abstract":"Action recognition plays an important task in surveillance systems, robot-human interaction and autonomous, systems. However, there are many challenging problems due to varieties of shape, illumination conditions, and complex of actions. Consuming time and precision are typically the main challenges for action recognition systems. Deep neural network techniques have dramatically become the state of the art in image processing. The high capacity deep learning on the temporal action from video analysis has been impeded because of varieties of classes, similarity of actions. This paper presents a new approach based on sequential deep neural network and data augmentation for improving accuracy. First, the deep neural network is constructed using several inceptions with different parallel convolutional operations which support reducing consuming time. Second, image augmentation of training dataset generates a larger data enough for deep neural network learning. This proposed task is aimed to address the small data problem. It is utilized for enhancing capabilities of deep learning. The proposed approach was evaluated on some benchmark datasets. Experimental evaluation on public benchmark dataset shows that the proposed approach enhances performance to 89.53% accuracy. Comparison results illustrated that our proposed method reaches higher performance than almost state- of- the- art methods.","PeriodicalId":370319,"journal":{"name":"IECON 2018 - 44th Annual Conference of the IEEE Industrial Electronics Society","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Action Recognition Based on Sequential 2D-CNN for Surveillance Systems\",\"authors\":\"Van-Dung Hoang, D. Hoang, Có hiệu\",\"doi\":\"10.1109/IECON.2018.8591338\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Action recognition plays an important task in surveillance systems, robot-human interaction and autonomous, systems. However, there are many challenging problems due to varieties of shape, illumination conditions, and complex of actions. Consuming time and precision are typically the main challenges for action recognition systems. Deep neural network techniques have dramatically become the state of the art in image processing. The high capacity deep learning on the temporal action from video analysis has been impeded because of varieties of classes, similarity of actions. This paper presents a new approach based on sequential deep neural network and data augmentation for improving accuracy. First, the deep neural network is constructed using several inceptions with different parallel convolutional operations which support reducing consuming time. Second, image augmentation of training dataset generates a larger data enough for deep neural network learning. This proposed task is aimed to address the small data problem. It is utilized for enhancing capabilities of deep learning. The proposed approach was evaluated on some benchmark datasets. Experimental evaluation on public benchmark dataset shows that the proposed approach enhances performance to 89.53% accuracy. Comparison results illustrated that our proposed method reaches higher performance than almost state- of- the- art methods.\",\"PeriodicalId\":370319,\"journal\":{\"name\":\"IECON 2018 - 44th Annual Conference of the IEEE Industrial Electronics Society\",\"volume\":\"56 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IECON 2018 - 44th Annual Conference of the IEEE Industrial Electronics Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IECON.2018.8591338\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IECON 2018 - 44th Annual Conference of the IEEE Industrial Electronics Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IECON.2018.8591338","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Action Recognition Based on Sequential 2D-CNN for Surveillance Systems
Action recognition plays an important task in surveillance systems, robot-human interaction and autonomous, systems. However, there are many challenging problems due to varieties of shape, illumination conditions, and complex of actions. Consuming time and precision are typically the main challenges for action recognition systems. Deep neural network techniques have dramatically become the state of the art in image processing. The high capacity deep learning on the temporal action from video analysis has been impeded because of varieties of classes, similarity of actions. This paper presents a new approach based on sequential deep neural network and data augmentation for improving accuracy. First, the deep neural network is constructed using several inceptions with different parallel convolutional operations which support reducing consuming time. Second, image augmentation of training dataset generates a larger data enough for deep neural network learning. This proposed task is aimed to address the small data problem. It is utilized for enhancing capabilities of deep learning. The proposed approach was evaluated on some benchmark datasets. Experimental evaluation on public benchmark dataset shows that the proposed approach enhances performance to 89.53% accuracy. Comparison results illustrated that our proposed method reaches higher performance than almost state- of- the- art methods.