讲座视频片段的视觉总结,以增强导航功能

Mohammad Rajiur Rahman, J. Subhlok, Shishir K. Shah
{"title":"讲座视频片段的视觉总结,以增强导航功能","authors":"Mohammad Rajiur Rahman, J. Subhlok, Shishir K. Shah","doi":"10.1109/ISM.2020.00033","DOIUrl":null,"url":null,"abstract":"Lecture video is an increasingly important learning resource. However, the challenge of quickly finding the content of interest in a long lecture video is a critical limitation of this format. This paper introduces visual summarization of lecture video segments to improve navigation. A lecture video is divided into segments based on the frame-to-frame similarity of content. The user navigates a lecture video assisted by single frame visual and textual summaries of segments. The paper presents a novel methodology to generate the visual summary of a lecture video segment by estimating the importance of each image in the segment, computing similarities between the images, and employing a graph-based algorithm to identify the most representative images. The summarization framework developed is integrated into a real-world lecture video management portal called Videopoints. An evaluation with ground truth from human experts established that the algorithms presented are significantly superior to random selection as well as clustering based selection, and only modestly inferior to human selection. Over 65% of automatically generated summaries were rated at Good or better by the users. Overall, the methodology introduced in this paper was shown to produce good quality visual summaries that are practically useful for lecture video navigation.","PeriodicalId":120972,"journal":{"name":"2020 IEEE International Symposium on Multimedia (ISM)","volume":"125 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Visual Summarization of Lecture Video Segments for Enhanced Navigation\",\"authors\":\"Mohammad Rajiur Rahman, J. Subhlok, Shishir K. Shah\",\"doi\":\"10.1109/ISM.2020.00033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Lecture video is an increasingly important learning resource. However, the challenge of quickly finding the content of interest in a long lecture video is a critical limitation of this format. This paper introduces visual summarization of lecture video segments to improve navigation. A lecture video is divided into segments based on the frame-to-frame similarity of content. The user navigates a lecture video assisted by single frame visual and textual summaries of segments. The paper presents a novel methodology to generate the visual summary of a lecture video segment by estimating the importance of each image in the segment, computing similarities between the images, and employing a graph-based algorithm to identify the most representative images. The summarization framework developed is integrated into a real-world lecture video management portal called Videopoints. An evaluation with ground truth from human experts established that the algorithms presented are significantly superior to random selection as well as clustering based selection, and only modestly inferior to human selection. Over 65% of automatically generated summaries were rated at Good or better by the users. Overall, the methodology introduced in this paper was shown to produce good quality visual summaries that are practically useful for lecture video navigation.\",\"PeriodicalId\":120972,\"journal\":{\"name\":\"2020 IEEE International Symposium on Multimedia (ISM)\",\"volume\":\"125 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Symposium on Multimedia (ISM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISM.2020.00033\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Symposium on Multimedia (ISM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISM.2020.00033","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11

摘要

讲座视频是一种越来越重要的学习资源。然而,在一个很长的讲座视频中快速找到感兴趣的内容是这种格式的一个关键限制。本文介绍了讲座视频片段的可视化摘要,以改善导航。讲座视频是根据内容的帧与帧之间的相似度来划分的。用户通过单帧视觉和文本摘要的辅助来浏览讲座视频。本文提出了一种新的方法,通过估计片段中每个图像的重要性,计算图像之间的相似性,并采用基于图的算法来识别最具代表性的图像,从而生成视频片段的视觉摘要。开发的摘要框架被集成到一个叫做videpoints的真实世界的讲座视频管理门户中。来自人类专家的基于事实的评估表明,所提出的算法明显优于随机选择和基于聚类的选择,仅略逊于人类选择。超过65%的自动生成的摘要被用户评为好或更好。总的来说,本文中介绍的方法被证明可以产生高质量的视觉摘要,这对于讲座视频导航非常有用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Visual Summarization of Lecture Video Segments for Enhanced Navigation
Lecture video is an increasingly important learning resource. However, the challenge of quickly finding the content of interest in a long lecture video is a critical limitation of this format. This paper introduces visual summarization of lecture video segments to improve navigation. A lecture video is divided into segments based on the frame-to-frame similarity of content. The user navigates a lecture video assisted by single frame visual and textual summaries of segments. The paper presents a novel methodology to generate the visual summary of a lecture video segment by estimating the importance of each image in the segment, computing similarities between the images, and employing a graph-based algorithm to identify the most representative images. The summarization framework developed is integrated into a real-world lecture video management portal called Videopoints. An evaluation with ground truth from human experts established that the algorithms presented are significantly superior to random selection as well as clustering based selection, and only modestly inferior to human selection. Over 65% of automatically generated summaries were rated at Good or better by the users. Overall, the methodology introduced in this paper was shown to produce good quality visual summaries that are practically useful for lecture video navigation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信