Expanding the point: automatic enlargement of presentation video elements

Proceedings of the 19th ACM international conference on Multimedia Pub Date : 2011-11-28 DOI:10.1145/2072298.2071913

Q. Tung, R. Swaminathan, A. Efrat, Kobus Barnard

{"title":"Expanding the point: automatic enlargement of presentation video elements","authors":"Q. Tung, R. Swaminathan, A. Efrat, Kobus Barnard","doi":"10.1145/2072298.2071913","DOIUrl":null,"url":null,"abstract":"We present a system that assists users in viewing videos of lectures on small screen devices, such as cell phones. It automatically identifies semantic units on the slides, such as bullets, groups of bullets, and images. As the participant views the lecture, the system magnifies the appropriate semantic unit while it is the focus of the discussion. The system makes this decision based on cues from laser pointer gestures and spoken words that are read off the slide. It then magnifies the semantic element using the slide image and the homography between the slide image and the video frame. Experiments suggest that the semantic units of laser-based events identified by our algorithm closely match those identified by humans. In the case of identifying bullets through spoken words, results are more limited but are a good starting point for more complex methods. Finally, we show that this kind of magnification has potential for improving learning of technical content from video lectures when the resolution of the video is limited, such as when being viewed on hand held devices.","PeriodicalId":318758,"journal":{"name":"Proceedings of the 19th ACM international conference on Multimedia","volume":"125 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 19th ACM international conference on Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2072298.2071913","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

We present a system that assists users in viewing videos of lectures on small screen devices, such as cell phones. It automatically identifies semantic units on the slides, such as bullets, groups of bullets, and images. As the participant views the lecture, the system magnifies the appropriate semantic unit while it is the focus of the discussion. The system makes this decision based on cues from laser pointer gestures and spoken words that are read off the slide. It then magnifies the semantic element using the slide image and the homography between the slide image and the video frame. Experiments suggest that the semantic units of laser-based events identified by our algorithm closely match those identified by humans. In the case of identifying bullets through spoken words, results are more limited but are a good starting point for more complex methods. Finally, we show that this kind of magnification has potential for improving learning of technical content from video lectures when the resolution of the video is limited, such as when being viewed on hand held devices.

查看原文本刊更多论文

扩展点:自动放大演示视频元素

我们提出了一个系统，可以帮助用户在小屏幕设备(如手机)上观看讲座视频。它可以自动识别幻灯片上的语义单位，如项目符号、项目符号组和图像。当参与者观看讲座时，系统会放大适当的语义单元，而它是讨论的焦点。系统根据激光笔的手势和从幻灯片上读出的语音来做出这个决定。然后，它使用幻灯片图像和幻灯片图像与视频帧之间的单义性来放大语义元素。实验表明，我们的算法识别的基于激光事件的语义单位与人类识别的语义单位非常接近。在通过口语识别子弹的情况下，结果更有限，但对于更复杂的方法来说，这是一个很好的起点。最后，我们表明，当视频分辨率有限时，例如在手持设备上观看时，这种放大具有提高视频讲座技术内容学习的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 19th ACM international conference on Multimedia

自引率

0.00%

发文量