{"title":"A scene boundary detection method","authors":"Minn Chung, Hyeokman Kim, S. M. Song","doi":"10.1109/ICIP.2000.899610","DOIUrl":null,"url":null,"abstract":"A visual rhythm is a special 2D image reduced from a 3D video data in a way that the pixels along a vertical line of the visual rhythm are the pixels uniformly sampled along the diagonal line of a video frame. Using the distinct visual patterns to appear on the visual rhythm, we propose a video segmentation method to effectively detect both abrupt and gradual shot transitions. Specifically, the algorithms to detect cut, wipe and dissolve transitions are outlined. It is noted that the proposed method operates on a small fraction of the original video data, and thus offers remarkable computational savings.","PeriodicalId":193198,"journal":{"name":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIP.2000.899610","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 25
Abstract
A visual rhythm is a special 2D image reduced from a 3D video data in a way that the pixels along a vertical line of the visual rhythm are the pixels uniformly sampled along the diagonal line of a video frame. Using the distinct visual patterns to appear on the visual rhythm, we propose a video segmentation method to effectively detect both abrupt and gradual shot transitions. Specifically, the algorithms to detect cut, wipe and dissolve transitions are outlined. It is noted that the proposed method operates on a small fraction of the original video data, and thus offers remarkable computational savings.