2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance最新文献

筛选
英文 中文
Action Recognition from Experience 从经验中认识行动
P. Tu, T. Sebastian, Dashan Gao
{"title":"Action Recognition from Experience","authors":"P. Tu, T. Sebastian, Dashan Gao","doi":"10.1109/AVSS.2012.85","DOIUrl":"https://doi.org/10.1109/AVSS.2012.85","url":null,"abstract":"A reinforcement learning model, which allows for an agent to interact with a simulated 3D learning environment under the initial guidance of an all knowing oracle is proposed. Methods are presented that allow the agent to learn how to perform a set of task oriented actions. It is then hypothesized that the ability to recognize an action may in fact be a byproduct of learning how to perform an action. Evidence supporting this conjecture is presented using both simulated and real world imagery.","PeriodicalId":275325,"journal":{"name":"2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124694303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stereo-Based Framework for Pedestrian Detection with Partial Occlusion Handling 基于立体的部分遮挡行人检测框架
Samuele Martelli, M. Cristani, Vittorio Murino
{"title":"Stereo-Based Framework for Pedestrian Detection with Partial Occlusion Handling","authors":"Samuele Martelli, M. Cristani, Vittorio Murino","doi":"10.1109/AVSS.2012.71","DOIUrl":"https://doi.org/10.1109/AVSS.2012.71","url":null,"abstract":"The pedestrian detection literature has been recently extended by the availability of large-scale multisensory datasets, able to capture complementary aspects of the objects of interest, namely, appearance, motion, and depth. In this paper, we exploit this multimodal scenario to propose a new set of composite descriptors dubbed CO2, CO-variances of visual features and CO-occurrences of depth fields. Covariances of visual features allow us to integrate at low-level heterogeneous visual cues related to intensity and texture. Co-occurrences of depth fields are brand new descriptors, which use range information for characterizing the global shape of a pedestrian while being also able to identify its occluded parts. This paper illustrates how these descriptors can be instantiated and combined together, improving detection capabilities taking also benefit from the proper handling of occlusions. Experimental results show that CO2, fed into a standard discriminative classification system, set state-of-the-art performances on recent multi-modal intensity- and stereo-based pedestrian datasets.","PeriodicalId":275325,"journal":{"name":"2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123548301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Discovering Activities and Their Temporal Significance 发现活动及其时间意义
Ayesha Choudhary, T. Faruquie, Subhashis Banerjee, S. Chaudhury
{"title":"Discovering Activities and Their Temporal Significance","authors":"Ayesha Choudhary, T. Faruquie, Subhashis Banerjee, S. Chaudhury","doi":"10.1109/AVSS.2012.37","DOIUrl":"https://doi.org/10.1109/AVSS.2012.37","url":null,"abstract":"In this paper, we address the problem of discovering activities and their temporal significance in an area under surveillance. Discovering activities along with its expectation of occurrence at a particular time plays an important role in many surveillance applications. We propose an unsupervised model, called Time pLSA model, that extends the probabilistic Latent Semantic Analysis (pLSA) model to jointly capture the activities and their behaviour over time. We use adaptive background subtraction to detect spatio-temporal patches, which are used as feature representation for activity patterns. Each of these patches are associated with the time slot in which they occur. Multinomial distributions are used to model both activities as distribution over spatio-temporal patches and time significance as distribution over the time-line. We demonstrate the effectiveness of our approach on a real life surveillance feed of an outdoor scene.","PeriodicalId":275325,"journal":{"name":"2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance","volume":"135 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124225682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Unsupervised Discovery of Activities and Their Temporal Behaviour 活动的无监督发现及其时间行为
T. Faruquie, Subhashis Banerjee, P. Kalra
{"title":"Unsupervised Discovery of Activities and Their Temporal Behaviour","authors":"T. Faruquie, Subhashis Banerjee, P. Kalra","doi":"10.1109/AVSS.2012.79","DOIUrl":"https://doi.org/10.1109/AVSS.2012.79","url":null,"abstract":"This paper addresses the problem of discovering activities and their temporal significance in surveillance videos in an unsupervised manner. We propose a generative model that can jointly capture the activities and their behaviour over time. We use multinomial distribution over local motion features to model activities and a mixture distribution over their time stamps to capture the multi-modal temporal distribution of these activities. We give a Gibbs sampling algorithm to infer the parameters of the model. We demonstrate the effectiveness of our approach on real life surveillance feed of outdoor scenes.","PeriodicalId":275325,"journal":{"name":"2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128918014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Interest Point Selection with Spatio-temporal Context for Realistic Action Recognition 基于时空背景的兴趣点选择在现实动作识别中的应用
Yanhu Shan, Z. Zhang, Junge Zhang, Kaiqi Huang, Na Wu, Oh Se Hyun
{"title":"Interest Point Selection with Spatio-temporal Context for Realistic Action Recognition","authors":"Yanhu Shan, Z. Zhang, Junge Zhang, Kaiqi Huang, Na Wu, Oh Se Hyun","doi":"10.1109/AVSS.2012.43","DOIUrl":"https://doi.org/10.1109/AVSS.2012.43","url":null,"abstract":"Spatio-Temporal Interest Point (STIP) has been widely used for human action recognition. However, the performance of the STIP based methods are still limited in realistic datasets which often include large variations in illuminations, viewpoints and camera motions. One reason of the low performance is that the STIPs only reflect the local change in videos, which is not enough to obtain stable informative features for action representation in realistic scene. To tackle the problem, we proposed an approach to selecting the \"stable STIPs\" with the spatio-temporal distribution of STIPs in neighbor region. Then, BoW feature is constructed to represent actions with these selected points. The experimental results on KTH dataset and HMDB (the largest realistic human action dataset) demonstrate that the proposed approach has obvious effect on improving the recognition rates of realistic data.","PeriodicalId":275325,"journal":{"name":"2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133396219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Spatio-temporal LBP Based Moving Object Segmentation in Compressed Domain 基于时空LBP的压缩域运动目标分割
Jianwei Yang, Shizheng Wang, Zhen Lei, Yanyun Zhao, S. Li
{"title":"Spatio-temporal LBP Based Moving Object Segmentation in Compressed Domain","authors":"Jianwei Yang, Shizheng Wang, Zhen Lei, Yanyun Zhao, S. Li","doi":"10.1109/AVSS.2012.68","DOIUrl":"https://doi.org/10.1109/AVSS.2012.68","url":null,"abstract":"With the increasing amount of surveillance data, moving object segmentation in the compressed domain has drawn broad attention from both academy and industry. In this paper, we propose a novel moving object segmentation method towards H.264 compressed surveillance videos. First, the motion vectors (MV) are accumulated and filtered to achieve reliable motion information. Second, considering the spatial and temporal correlations among adjacent blocks, spatio-temporal Local Binary Pattern (LBP) features of MVs are extracted to obtain coarse and initial object regions. Finally, a coarse-to-fine segmentation algorithm of boundary modification is conducted based on the DCT coefficients. The experimental results validate that the proposed method not only can extract fairly accurate objects in compressed video, but also has a relatively low computational complexity.","PeriodicalId":275325,"journal":{"name":"2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133854225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Automatic Calibration of Multiple Stationary Laser Range Finders Using Trajectories 基于轨迹的多台固定式激光测距仪自动标定
Konrad Schenk, Alexander Kolarow, M. Eisenbach, Klaus Debes, H. Groß
{"title":"Automatic Calibration of Multiple Stationary Laser Range Finders Using Trajectories","authors":"Konrad Schenk, Alexander Kolarow, M. Eisenbach, Klaus Debes, H. Groß","doi":"10.1109/AVSS.2012.14","DOIUrl":"https://doi.org/10.1109/AVSS.2012.14","url":null,"abstract":"Laser based detection and tracking of persons can be used for numerous tasks, like statistical measurements for determining bottlenecks in public buildings, optimizing passenger flow, or planning camera placement. Only a network of multiple LRF is sufficient to fulfill these tasks in larger spaces. Calibrating multiple LRF into a global coordinate system is usually done by hand in a time consuming procedure. In this paper, we address the problem of automatically calibrating such a sensor network. We introduce an automatic calibration mechanism, which is able to obtain the positions and orientations of all LRF in a global coordinate system, without any prior knowledge of the scene. Our approach is based on comparing person tracks, determined by each individual LRF unit and matching them in order to obtain constraints between the LRF units. By resolving these constraints, we are able to estimate the poses of all LRF. We evaluate and compare our method to the current state of the art approach methodically and experimentally. Experiments show that our calibration approach outperforms this approach.","PeriodicalId":275325,"journal":{"name":"2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125420524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Clustering Motion for Real-Time Optical Flow Based Tracking 基于实时光流跟踪的聚类运动
T. Senst, Rubén Heras Evangelio, I. Keller, T. Sikora
{"title":"Clustering Motion for Real-Time Optical Flow Based Tracking","authors":"T. Senst, Rubén Heras Evangelio, I. Keller, T. Sikora","doi":"10.1109/AVSS.2012.20","DOIUrl":"https://doi.org/10.1109/AVSS.2012.20","url":null,"abstract":"The selection of regions or sets of points to track is a key task in motion-based video analysis, which has significant performance effects in terms of accuracy and computational efficiency. Computational efficiency is an unavoidable requirement in video surveillance applications. Well established methods, e.g. Good Features to Track, select points to be tracked based on appearance features such as cornerness and therefore neglecting the motion exhibited by the selected points. In this paper, we propose an interest point selection method that takes into account the motion of previously tracked points in order to constrain the number of point trajectories needed. By defining pair-wise temporal affinities between trajectories and representing them in a minimum spanning tree, we achieve a very efficient clustering. The number of trajectories assigned to each motion cluster is adapted by initializing and removing tracked points by means of feed-back. Compared to the KLT tracker, we save up to 65% of the points to track, therefore gaining in efficiency while not scarifying accuracy.","PeriodicalId":275325,"journal":{"name":"2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127103461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Online Learning of Activities from Video 视频活动的在线学习
J. L. Patino, F. Brémond, M. Thonnat
{"title":"Online Learning of Activities from Video","authors":"J. L. Patino, F. Brémond, M. Thonnat","doi":"10.1109/AVSS.2012.50","DOIUrl":"https://doi.org/10.1109/AVSS.2012.50","url":null,"abstract":"The present work introduces a new method for activity extraction from video. To achieve this, we focus on the modelling of context by developing an algorithm that automatically learns the main activity zones of the observed scene by taking as input the trajectories of detected mobiles. Automatically learning the context of the scene (activity zones) allows first to extract a knowledge on the occupancy of the different areas of the scene. In a second step, learned zones are employed to extract people activities by relating mobile trajectories to the learned zones, in this way, the activity of a person can be summarised as the series of zones that the person has visited. For the analysis of the trajectory, a multiresolution analysis is set such that a trajectory is segmented into a series of tracklets based on changing speed points thus allowing differentiating when people stop to interact with elements of the scene or other persons. Tracklets allow thus to extract behavioural information. Starting and ending tracklet points are fed to a simple yet advantageous incremental clustering algorithm to create an initial partition of the scene. Similarity relations between resulting clusters are modeled employing fuzzy relations. These can then be aggregated with typical soft-computing algebra. A clustering algorithm based on the transitive closure calculation of the fuzzy relations allows building the final structure of the scene. To allow for incremental learning and update of activity zones (and thus people activities), fuzzy relations are defined with online learning terms. We present results obtained on real videos from different activity domains.","PeriodicalId":275325,"journal":{"name":"2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131374263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
SLTP: A Fast Descriptor for People Detection in Depth Images SLTP:深度图像中人检测的快速描述符
Shiqi Yu, Shengyin Wu, Liang Wang
{"title":"SLTP: A Fast Descriptor for People Detection in Depth Images","authors":"Shiqi Yu, Shengyin Wu, Liang Wang","doi":"10.1109/AVSS.2012.67","DOIUrl":"https://doi.org/10.1109/AVSS.2012.67","url":null,"abstract":"This paper presents a new feature descriptor for real-time people detection in depth images. The shape cue in depth images can reduce negative impacts of variations of clothing, lighting conditions and the complexity of backgrounds. The proposed Simplified Local Ternary Patterns (SLTP) can take advantage of depth images to describe human body shape with low computational cost. To evaluate the SLTP feature, we establish a dataset with 7260 positive samples. A series of experiments are carried out on this dataset, and the results show that the SLTP feature can achieve a high detection rate with a low false positive rate. Besides, SLTP is easy to implement, and performs fast (over 80 frames per second) on a standard desktop computer.","PeriodicalId":275325,"journal":{"name":"2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance","volume":"425 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132053842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信