Associating video with related documents

MULTIMEDIA '99 Pub Date : 1999-10-01 DOI:10.1145/319878.319883

R. Hamada, I. Ide, S. Sakai, Hidehiko Tanaka

{"title":"Associating video with related documents","authors":"R. Hamada, I. Ide, S. Sakai, Hidehiko Tanaka","doi":"10.1145/319878.319883","DOIUrl":null,"url":null,"abstract":"Reflecting the increasing importance of handling multimedia data, many studies are made on indexing to TV broadcast video. Multimedia data consist of image, audio, and text, and various research on analysis of each individual medium has been made. Especially, image processing has been the main issue when handling multimedia for a long time. But recently, it has started to be considered that image processing alone is insufficient for thorough understanding of multimedia data. In the 1990’s, integrated processing that supplements the incompleteness of information from each medium has become a trend. Following this trend, we are trying to integrate TV programs with related documents, taking advantage of the relative easiness of extracting semantic structures from text media. Among various programs, cultural programs are considered as appropriate sources since (1)supplementary documents are available and (2)the video contains a lot of implicit information that integration could be helpful to thorough understanding of supplementary texts. Many attempts have been made to index video by means of multimedia integration. But sufficient accuracy for practical use is not necessarily achieved since their subjects are too general to achieve accuracy from elemental technoiogies by making use of domain specific characteristics. In our method, we examine and construct a practical system using relatively simple elemental technologies by reflecting the result of one medium’s process to another. We will focus on cooking programs, so that we can take advantage of domain specific constraints and knowledge. Through the examination in this specific domain, and the usage of a supplementary document and its analysis, we aim for proposing a novel advanced multimedia integration method. Using the result of this method, we also propose an integrative restructuring method of the multimedia data provided both from the video and the supplementary document.","PeriodicalId":265329,"journal":{"name":"MULTIMEDIA '99","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MULTIMEDIA '99","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/319878.319883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Reflecting the increasing importance of handling multimedia data, many studies are made on indexing to TV broadcast video. Multimedia data consist of image, audio, and text, and various research on analysis of each individual medium has been made. Especially, image processing has been the main issue when handling multimedia for a long time. But recently, it has started to be considered that image processing alone is insufficient for thorough understanding of multimedia data. In the 1990’s, integrated processing that supplements the incompleteness of information from each medium has become a trend. Following this trend, we are trying to integrate TV programs with related documents, taking advantage of the relative easiness of extracting semantic structures from text media. Among various programs, cultural programs are considered as appropriate sources since (1)supplementary documents are available and (2)the video contains a lot of implicit information that integration could be helpful to thorough understanding of supplementary texts. Many attempts have been made to index video by means of multimedia integration. But sufficient accuracy for practical use is not necessarily achieved since their subjects are too general to achieve accuracy from elemental technoiogies by making use of domain specific characteristics. In our method, we examine and construct a practical system using relatively simple elemental technologies by reflecting the result of one medium’s process to another. We will focus on cooking programs, so that we can take advantage of domain specific constraints and knowledge. Through the examination in this specific domain, and the usage of a supplementary document and its analysis, we aim for proposing a novel advanced multimedia integration method. Using the result of this method, we also propose an integrative restructuring method of the multimedia data provided both from the video and the supplementary document.

查看原文本刊更多论文

关联视频和相关文档

随着多媒体数据处理的日益重要，对电视播放视频的索引进行了大量的研究。多媒体数据由图像、音频和文本组成，对每种单独的媒体进行了各种研究分析。长期以来，图像处理一直是多媒体处理的主要问题。但最近，人们开始认为仅靠图像处理不足以彻底理解多媒体数据。在20世纪90年代，补充各种媒介信息的不完整性的综合处理已成为一种趋势。顺应这一趋势，我们正在尝试将电视节目与相关文档结合起来，利用从文本媒体中提取语义结构相对容易的优势。在各种节目中，文化节目被认为是合适的来源，因为(1)可以获得补充文件;(2)视频包含许多隐含信息，整合这些信息有助于彻底理解补充文本。利用多媒体集成的方法对视频进行索引已经做了很多尝试。但是对于实际应用来说，并不一定能达到足够的准确性，因为它们的主题过于笼统，无法通过利用领域特定的特征从基本技术中获得准确性。在我们的方法中，我们通过将一种介质过程的结果反映到另一种介质过程中，使用相对简单的基本技术来检验和构建一个实用的系统。我们将专注于烹饪程序，这样我们就可以利用特定领域的约束和知识。通过对这一特定领域的研究，以及对补充文献的使用和分析，提出了一种新颖的先进的多媒体集成方法。利用该方法的结果，我们还提出了一种对视频和补充文档提供的多媒体数据进行综合重组的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

MULTIMEDIA '99

自引率

0.00%

发文量