Entropy Based Fuzzy C Means Clustering and Key Frame Extraction for Sports Video Summarization

S. Angadi, Vilas Naik
{"title":"Entropy Based Fuzzy C Means Clustering and Key Frame Extraction for Sports Video Summarization","authors":"S. Angadi, Vilas Naik","doi":"10.1109/ICSIP.2014.49","DOIUrl":null,"url":null,"abstract":"Recent advances in technology have made tremendous amount of multimedia information available to the general population. To access the needed information in this scenario there is a need for automatic tools to filter and present information summary. Summarization techniques will give a choice to users to browse and select the multimedia documents of their choice for complete viewing later. In this work a new summarization technique to collect frames of importance in a video is presented. The method is based on selection of frames typically different from their immediate neighbors as key frames from group of similar frames. It uses the process of clustering, where visually similar frames are collected into one group using Fuzzy C means clustering algorithm. When clusters are formed, the frames that exhibit a change ratio which is a measure of the content variation, greater than the average value of the cluster are treated as Key frames. The summary is created by merging Key frames on the basis of their timeline. This method ensures that video summary represents the most unique frames of the input video and gives equal attention to preserving continuity of the summarized video. The robustness of the algorithm is validated by average values of performance parameters. The average compression ratio of 92% is indication of higher conciseness. The average fidelity of 95% is an indicative of comprehensive representation of video by the key frames selected using proposed algorithm.","PeriodicalId":111591,"journal":{"name":"2014 Fifth International Conference on Signal and Image Processing","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Fifth International Conference on Signal and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIP.2014.49","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19

Abstract

Recent advances in technology have made tremendous amount of multimedia information available to the general population. To access the needed information in this scenario there is a need for automatic tools to filter and present information summary. Summarization techniques will give a choice to users to browse and select the multimedia documents of their choice for complete viewing later. In this work a new summarization technique to collect frames of importance in a video is presented. The method is based on selection of frames typically different from their immediate neighbors as key frames from group of similar frames. It uses the process of clustering, where visually similar frames are collected into one group using Fuzzy C means clustering algorithm. When clusters are formed, the frames that exhibit a change ratio which is a measure of the content variation, greater than the average value of the cluster are treated as Key frames. The summary is created by merging Key frames on the basis of their timeline. This method ensures that video summary represents the most unique frames of the input video and gives equal attention to preserving continuity of the summarized video. The robustness of the algorithm is validated by average values of performance parameters. The average compression ratio of 92% is indication of higher conciseness. The average fidelity of 95% is an indicative of comprehensive representation of video by the key frames selected using proposed algorithm.
基于熵的模糊C均值聚类和关键帧提取运动视频摘要
最近技术的进步使大众可以获得大量的多媒体信息。为了在此场景中访问所需的信息,需要使用自动工具来过滤和显示信息摘要。摘要技术将为用户提供浏览和选择他们选择的多媒体文档的选择,以便以后完整地查看。本文提出了一种新的视频重要帧的汇总技术。该方法基于从一组相似帧中选择通常与其近邻不同的帧作为关键帧。它使用聚类的过程,其中视觉上相似的帧收集成一组使用模糊C均值聚类算法。当集群形成时,表现出变化比率(衡量内容变化的指标)大于集群平均值的框架被视为关键框架。摘要是通过在时间轴的基础上合并关键帧来创建的。该方法既保证了视频摘要代表了输入视频中最独特的帧,又保证了视频摘要的连续性。通过性能参数的平均值验证了算法的鲁棒性。平均压缩比为92%,表明简洁性较高。95%的平均保真度表明,使用所提出的算法选择的关键帧对视频进行了全面的表示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信