{"title":"Real-Time Target Detection and Recognition with Deep Convolutional Networks for Intelligent Visual Surveillance","authors":"Wen Xu, Jing He, H. Zhang, B. Mao, Jie Cao","doi":"10.1145/2996890.3007881","DOIUrl":null,"url":null,"abstract":"Moving target detection and tracking, recognition, behaviours analysis are the key issues in the intelligent visual surveillance system (IVSS). The challenge is how to process the real-time video stream in an effective way in case that we could find the interested objects for analysis. However, the traditional video surveillance technology often does not meet the needs of real-time key frame recognition for the on-line intelligent video monitoring system. In our paper, we apply the state-of-the-art Faster R-CNN [7] that takes advantages of convolutional neural networks into our real-time target recognition system - Deep Intelligent Visual Surveillance (DIVS). The key aspects of our DIVS are consisted of four parts: (i) Getting the real-time video image from remote cameras, (ii) Processing the data with the deep learning framework caffe [23] built for Faster R-CNN, (iii) Storing the valuable data with MySQL, (iv) Data presentation on the website. Experiments based on our system validated the effectiveness, stability and accuracy of our proposed solutions.","PeriodicalId":350701,"journal":{"name":"2016 IEEE/ACM 9th International Conference on Utility and Cloud Computing (UCC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/ACM 9th International Conference on Utility and Cloud Computing (UCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2996890.3007881","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Moving target detection and tracking, recognition, behaviours analysis are the key issues in the intelligent visual surveillance system (IVSS). The challenge is how to process the real-time video stream in an effective way in case that we could find the interested objects for analysis. However, the traditional video surveillance technology often does not meet the needs of real-time key frame recognition for the on-line intelligent video monitoring system. In our paper, we apply the state-of-the-art Faster R-CNN [7] that takes advantages of convolutional neural networks into our real-time target recognition system - Deep Intelligent Visual Surveillance (DIVS). The key aspects of our DIVS are consisted of four parts: (i) Getting the real-time video image from remote cameras, (ii) Processing the data with the deep learning framework caffe [23] built for Faster R-CNN, (iii) Storing the valuable data with MySQL, (iv) Data presentation on the website. Experiments based on our system validated the effectiveness, stability and accuracy of our proposed solutions.