Juan C. Sanmiguel, Marcos Escudero-Viñolo, J. Sanchez, Jesús Bescós
{"title":"受控环境下实时单视点视频事件识别","authors":"Juan C. Sanmiguel, Marcos Escudero-Viñolo, J. Sanchez, Jesús Bescós","doi":"10.1109/CBMI.2011.5972527","DOIUrl":null,"url":null,"abstract":"This paper presents a real-time video event recognition system for controlled environments. It is able to recognize human activities and interactions with the objects of the environment by exploiting different cues like trajectory analysis, skin detection and people recognition of the foreground blobs of the scene. Time variations of these features are studied and combined using Bayesian inference to detect the events. Contextual information, including fixed objects' location, object types and event hierarchical definitions, is formally included in the system. A corpus of video sequences has been designed and recorded considering different complexity levels for object extraction. Experimental results show that our approach can recognize five kinds of events (two activities and three human-object interactions) with high precision operating at real-time.","PeriodicalId":358337,"journal":{"name":"2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Real-time single-view video event recognition in controlled environments\",\"authors\":\"Juan C. Sanmiguel, Marcos Escudero-Viñolo, J. Sanchez, Jesús Bescós\",\"doi\":\"10.1109/CBMI.2011.5972527\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a real-time video event recognition system for controlled environments. It is able to recognize human activities and interactions with the objects of the environment by exploiting different cues like trajectory analysis, skin detection and people recognition of the foreground blobs of the scene. Time variations of these features are studied and combined using Bayesian inference to detect the events. Contextual information, including fixed objects' location, object types and event hierarchical definitions, is formally included in the system. A corpus of video sequences has been designed and recorded considering different complexity levels for object extraction. Experimental results show that our approach can recognize five kinds of events (two activities and three human-object interactions) with high precision operating at real-time.\",\"PeriodicalId\":358337,\"journal\":{\"name\":\"2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI)\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CBMI.2011.5972527\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 9th International Workshop on Content-Based Multimedia Indexing (CBMI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMI.2011.5972527","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Real-time single-view video event recognition in controlled environments
This paper presents a real-time video event recognition system for controlled environments. It is able to recognize human activities and interactions with the objects of the environment by exploiting different cues like trajectory analysis, skin detection and people recognition of the foreground blobs of the scene. Time variations of these features are studied and combined using Bayesian inference to detect the events. Contextual information, including fixed objects' location, object types and event hierarchical definitions, is formally included in the system. A corpus of video sequences has been designed and recorded considering different complexity levels for object extraction. Experimental results show that our approach can recognize five kinds of events (two activities and three human-object interactions) with high precision operating at real-time.