{"title":"Real-time shot segmentation of unedited video stream for maintenance work","authors":"F. Tsutsumi","doi":"10.1109/MMSP.2002.1203281","DOIUrl":null,"url":null,"abstract":"We propose a shot segmentation method of live video stream for wearable assistants. This method can automatically divide an unedited video stream into meaningful shots, even if the stream contains shaking, vibration or blurring. The basic idea of the method is dividing video streams into \"stationary\" shots and \"transitive\" shots based on the tendency of visual changes. Two typical conventional methods were compared with our method on the 55 minute maintenance video recorded in a restricted environment. Analyzing the video and computing the recall and precision rate showed the adequacy of out method. Furthermore, the practical effectiveness of the methods was evaluated by another 54 minute patrol video in an outdoor environment. Nine human subjects tried to find randomly selected shots with different segmentation produced by three methods. The results show that our method supported users to find the shots most effectively (89% success in 5 minutes).","PeriodicalId":398813,"journal":{"name":"2002 IEEE Workshop on Multimedia Signal Processing.","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2002 IEEE Workshop on Multimedia Signal Processing.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MMSP.2002.1203281","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We propose a shot segmentation method of live video stream for wearable assistants. This method can automatically divide an unedited video stream into meaningful shots, even if the stream contains shaking, vibration or blurring. The basic idea of the method is dividing video streams into "stationary" shots and "transitive" shots based on the tendency of visual changes. Two typical conventional methods were compared with our method on the 55 minute maintenance video recorded in a restricted environment. Analyzing the video and computing the recall and precision rate showed the adequacy of out method. Furthermore, the practical effectiveness of the methods was evaluated by another 54 minute patrol video in an outdoor environment. Nine human subjects tried to find randomly selected shots with different segmentation produced by three methods. The results show that our method supported users to find the shots most effectively (89% success in 5 minutes).