{"title":"Video segmentation using BIC and stacked scanning","authors":"King Yiu Tam, J. Lay, D. Levy","doi":"10.1109/ICME.2011.6012019","DOIUrl":null,"url":null,"abstract":"This paper proposes an algorithm for automatic segmentation of video clips into speaker units, with the intention of using the latter as the index units for a video indexing and retrieval system. The algorithm works by pooling together global information of each speaker before detecting the true speaker change locations. The use of stacked scanning can offer better results. The experiment shows that the algorithm is able to boost the discriminating power of BIC resulting in 20% reduction in average detection offset and 5% reduction in average maximum detection offset.","PeriodicalId":433997,"journal":{"name":"2011 IEEE International Conference on Multimedia and Expo","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Multimedia and Expo","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICME.2011.6012019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposes an algorithm for automatic segmentation of video clips into speaker units, with the intention of using the latter as the index units for a video indexing and retrieval system. The algorithm works by pooling together global information of each speaker before detecting the true speaker change locations. The use of stacked scanning can offer better results. The experiment shows that the algorithm is able to boost the discriminating power of BIC resulting in 20% reduction in average detection offset and 5% reduction in average maximum detection offset.