Personalizing video recorders using multimedia processing and integration

MULTIMEDIA '01 Pub Date : 2001-10-01 DOI:10.1145/500141.500243

N. Dimitrova, R. Jasinschi, L. Agnihotri, J. Zimmerman, T. McGee, Dongge Li

引用次数: 5

Abstract

Current personal Vido recorders make it very easy for consumers to record whole TV programs. Our research however, focuses on personalizing TV at a sub-program level. We use a traditional Content-Based Information Retrieval system architecture consisting of archiving and retrieval modules. The archiving module employs a three-layered, multimodal integration framework to segment, analyze, characterize, and classify segments. The retrieval module relies on users personal preferences to deliver both full programs and video segments of interest. We tested retrieval concepts with real users and discovered that they see more value in segmenting non-narrative programs (e.g. news) than narrative programs (e.g. movies). We benchmarked individual algorithms and segment classification for celebrity and financial segments as instances of non-narrative content. For celebrity segments we obtained a total precision of 94.1% and recall of 85.7%, and for financial segments a total precision of 81.1% and a recall of 86.9%.

查看原文本刊更多论文

使用多媒体处理和集成个性化录像机

现在的个人录像机使消费者很容易录下整个电视节目。然而，我们的研究主要集中在子节目层面的个性化电视。我们使用传统的基于内容的信息检索系统架构，包括归档和检索模块。归档模块采用三层多模态集成框架对数据段进行分段、分析、表征和分类。检索模块依赖于用户的个人偏好来提供完整的节目和感兴趣的视频片段。我们在真实用户身上测试了检索概念，发现他们认为分割非叙事节目(如新闻)比分割叙事节目(如电影)更有价值。我们将名人和金融细分作为非叙事内容的实例，对个别算法和细分分类进行基准测试。对于名人片段，我们获得了94.1%的总精度和85.7%的召回率，对于金融片段，我们获得了81.1%的总精度和86.9%的召回率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

MULTIMEDIA '01

自引率

0.00%

发文量