Entropy Based Fuzzy C Means Clustering and Key Frame Extraction for Sports Video Summarization

2014 Fifth International Conference on Signal and Image Processing Pub Date : 2014-01-08 DOI:10.1109/ICSIP.2014.49

S. Angadi, Vilas Naik

{"title":"Entropy Based Fuzzy C Means Clustering and Key Frame Extraction for Sports Video Summarization","authors":"S. Angadi, Vilas Naik","doi":"10.1109/ICSIP.2014.49","DOIUrl":null,"url":null,"abstract":"Recent advances in technology have made tremendous amount of multimedia information available to the general population. To access the needed information in this scenario there is a need for automatic tools to filter and present information summary. Summarization techniques will give a choice to users to browse and select the multimedia documents of their choice for complete viewing later. In this work a new summarization technique to collect frames of importance in a video is presented. The method is based on selection of frames typically different from their immediate neighbors as key frames from group of similar frames. It uses the process of clustering, where visually similar frames are collected into one group using Fuzzy C means clustering algorithm. When clusters are formed, the frames that exhibit a change ratio which is a measure of the content variation, greater than the average value of the cluster are treated as Key frames. The summary is created by merging Key frames on the basis of their timeline. This method ensures that video summary represents the most unique frames of the input video and gives equal attention to preserving continuity of the summarized video. The robustness of the algorithm is validated by average values of performance parameters. The average compression ratio of 92% is indication of higher conciseness. The average fidelity of 95% is an indicative of comprehensive representation of video by the key frames selected using proposed algorithm.","PeriodicalId":111591,"journal":{"name":"2014 Fifth International Conference on Signal and Image Processing","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Fifth International Conference on Signal and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIP.2014.49","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 19

Abstract

Recent advances in technology have made tremendous amount of multimedia information available to the general population. To access the needed information in this scenario there is a need for automatic tools to filter and present information summary. Summarization techniques will give a choice to users to browse and select the multimedia documents of their choice for complete viewing later. In this work a new summarization technique to collect frames of importance in a video is presented. The method is based on selection of frames typically different from their immediate neighbors as key frames from group of similar frames. It uses the process of clustering, where visually similar frames are collected into one group using Fuzzy C means clustering algorithm. When clusters are formed, the frames that exhibit a change ratio which is a measure of the content variation, greater than the average value of the cluster are treated as Key frames. The summary is created by merging Key frames on the basis of their timeline. This method ensures that video summary represents the most unique frames of the input video and gives equal attention to preserving continuity of the summarized video. The robustness of the algorithm is validated by average values of performance parameters. The average compression ratio of 92% is indication of higher conciseness. The average fidelity of 95% is an indicative of comprehensive representation of video by the key frames selected using proposed algorithm.

查看原文本刊更多论文

基于熵的模糊C均值聚类和关键帧提取运动视频摘要

最近技术的进步使大众可以获得大量的多媒体信息。为了在此场景中访问所需的信息，需要使用自动工具来过滤和显示信息摘要。摘要技术将为用户提供浏览和选择他们选择的多媒体文档的选择，以便以后完整地查看。本文提出了一种新的视频重要帧的汇总技术。该方法基于从一组相似帧中选择通常与其近邻不同的帧作为关键帧。它使用聚类的过程，其中视觉上相似的帧收集成一组使用模糊C均值聚类算法。当集群形成时，表现出变化比率(衡量内容变化的指标)大于集群平均值的框架被视为关键框架。摘要是通过在时间轴的基础上合并关键帧来创建的。该方法既保证了视频摘要代表了输入视频中最独特的帧，又保证了视频摘要的连续性。通过性能参数的平均值验证了算法的鲁棒性。平均压缩比为92%，表明简洁性较高。95%的平均保真度表明，使用所提出的算法选择的关键帧对视频进行了全面的表示。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 Fifth International Conference on Signal and Image Processing

自引率

0.00%

发文量