{"title":"Pose-based clustering in action sequences","authors":"G. Loy, Josephine Sullivan, S. Carlsson","doi":"10.1109/HLK.2003.1240860","DOIUrl":null,"url":null,"abstract":"A method is presented for automatically extracting key frames from an image sequence. The sequence is divided into clusters of frames with similar appearance, and the most central frame in each cluster defines a key frame. Clustering is done using an extension of the normalized cut segmentation technique based on the inter-frame similarities. The similarity between every pair of frames in the sequence is determined from the spatial image characteristics via a shape matching technique. Our algorithm is demonstrated successfully extracting 20 key frames for a tennis player in action over a 30 second (900 frame) video sequence.","PeriodicalId":265600,"journal":{"name":"First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis, 2003. HLK 2003.","volume":"347 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis, 2003. HLK 2003.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HLK.2003.1240860","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 22
Abstract
A method is presented for automatically extracting key frames from an image sequence. The sequence is divided into clusters of frames with similar appearance, and the most central frame in each cluster defines a key frame. Clustering is done using an extension of the normalized cut segmentation technique based on the inter-frame similarities. The similarity between every pair of frames in the sequence is determined from the spatial image characteristics via a shape matching technique. Our algorithm is demonstrated successfully extracting 20 key frames for a tennis player in action over a 30 second (900 frame) video sequence.