2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services最新文献

筛选
英文 中文
A Semantic Multimedia Analysis Approach Utilizing a Region Thesaurus and LSA 一种利用区域叙词表和LSA的语义多媒体分析方法
E. Spyrou, Giorgos Tolias, Phivos Mylonas, Yannis Avrithis
{"title":"A Semantic Multimedia Analysis Approach Utilizing a Region Thesaurus and LSA","authors":"E. Spyrou, Giorgos Tolias, Phivos Mylonas, Yannis Avrithis","doi":"10.1109/WIAMIS.2008.49","DOIUrl":"https://doi.org/10.1109/WIAMIS.2008.49","url":null,"abstract":"This paper presents an approach on high-level feature detection within video documents, using a region thesaurus and latent semantic analysis. A video shot is represented by a single keyframe. MPEG-7 features are extracted from coarse regions of it. A clustering algorithm is applied on all extracted regions and a region thesaurus is constructed. Its use is to assist to the mapping of low- to high-level features by a model vector representation. Latent semantic analysis is then applied on the model vectors to exploit the latent relations among region types aiming to improve detection performance. The proposed approach is thoroughly examined using TRECVID 2007 development data.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130559921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Distributed Cross-Modal Search within the MPEG Query Format 分布跨模态搜索在MPEG查询格式
M. Gruhne, P. Dunker, M. Döller, R. Tous
{"title":"Distributed Cross-Modal Search within the MPEG Query Format","authors":"M. Gruhne, P. Dunker, M. Döller, R. Tous","doi":"10.1109/WIAMIS.2008.61","DOIUrl":"https://doi.org/10.1109/WIAMIS.2008.61","url":null,"abstract":"One of the latest developments of the MPEG committee is the Query Format for search and retrieval of multimedia content. This language constitutes the interface between a client and a search engine for searching multimedia data. Another possible scenario is the use of a service provider, which accepts and understands a query from a client and forwards parts of the query to one or more specific databases. Furthermore the service provider is able to retrieve the reply from these databases and post processes this result in order to send it to this client. During the last years, the cross-modal search of video and audio signals became more and more important, since using both, the video and the audio signal together turned out to be much more robust for identification of video streams, than the image part of the video alone.This paper describes a cross-modal search based on the MPEG Query Format, using a service provider for splitting and aggregating the query and two different types of databases.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"564 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127916736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Recent, Current and Future Developments in Video Coding 视频编码的最新、当前和未来发展
J. Ohm
{"title":"Recent, Current and Future Developments in Video Coding","authors":"J. Ohm","doi":"10.1109/WIAMIS.2008.65","DOIUrl":"https://doi.org/10.1109/WIAMIS.2008.65","url":null,"abstract":"Abstract form only given. Most recent attention in development of video coding algorithms has been devoted to the ITU-T Rec.H.264 | ISO/IEC 14496-10 advanced video coding standard. Recent and current extensions to this standard include developments for professional applications, highly efficient scalable video coding and multi-view video coding. Finally, digital video over various networks, going for higher and higher resolutions, is becoming reality. While this technology is progressing and further optimizations are sought, new challenges appear at the horizon. New types of displays include 3D capabilities, requiring the generation of additional view perspectives beyond available camera positions. Cameras and displays are coming up with permanently increasing frame rates and sizes. The tremendous amount of different applications for digital video requires additional flexibility and reconfigurability of devices. And last not least, increased compression efficiency (meaning rate reduction versus processing cost) is again becoming more important with ever increasing numbers of pixels to be transmitted. The talk will focus on possible solutions to these challenges and discuss the maturity they currently have.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"165 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125391236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An MPEG-7 Extension for Describing Visual Impairments 描述视觉障碍的MPEG-7扩展
W. Bailer, P. Schallauer
{"title":"An MPEG-7 Extension for Describing Visual Impairments","authors":"W. Bailer, P. Schallauer","doi":"10.1109/WIAMIS.2008.22","DOIUrl":"https://doi.org/10.1109/WIAMIS.2008.22","url":null,"abstract":"Analysing the condition of audiovisual essence is an important step in audiovisual production and preservation. Standardised impairment description of audiovisual media is a pre-requisite for system interoperability between content digitisation, documentation, management, restoration, production and delivery systems.We analyse existing capabilities for describing impairment in audiovisual metadata standards. Because of its unique detailed spatiotemporally structured description capabilities we have selected MPEG-7 as the basis for visual impairment description. Following the approach for audio quality description, we define a general description scheme for visual impairments, which allows representing defect events and statistical quality measures. For certain defects, more specialised descriptors are proposed. In addition, we define a comprehensive classification scheme for visual impairments.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126521942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Using MPEG-7 for Generic Audiovisual Content Automatic Summarization MPEG-7在通用视听内容自动摘要中的应用
Nuno Matos, F. Pereira
{"title":"Using MPEG-7 for Generic Audiovisual Content Automatic Summarization","authors":"Nuno Matos, F. Pereira","doi":"10.1109/WIAMIS.2008.18","DOIUrl":"https://doi.org/10.1109/WIAMIS.2008.18","url":null,"abstract":"This paper proposes and evaluates a fully automatic summarization application for generic audiovisual based on MPEG-7 compliant hierarchical summary descriptions, which allows providing flexibility, low complexity, and interoperability. The novelty of this paper regards the exploitation of a three features, low-level arousal model to generate the summary metadata needed to instantiate MPEG-7 compliant summary descriptions with the advantages this brings in terms of interoperability. Moreover, a novel, solid performance evaluation methodology has been proposed and its application has been performed.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116673999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Combined Adaptation and Caching of MPEG-4 SVC in Streaming Scenarios 流媒体场景下MPEG-4 SVC的组合适配与缓存
M. Mackay, D. Hutchison, Michael Ransburg, H. Hellwagner
{"title":"Combined Adaptation and Caching of MPEG-4 SVC in Streaming Scenarios","authors":"M. Mackay, D. Hutchison, Michael Ransburg, H. Hellwagner","doi":"10.1109/WIAMIS.2008.44","DOIUrl":"https://doi.org/10.1109/WIAMIS.2008.44","url":null,"abstract":"A key objective of the ENTHRONE II Project is the ability to optimise the delivery of multimedia content to a wide group of heterogeneous users. One example of this is in the cooperative deployment of adaptation and caching functionality in the edge network. This hybrid approach makes it possible not only to store content locally, thus minimising the cost incurred through subsequent requests, but also to better serve heterogeneous groups of users by dynamically adapting the content to suit a wide range of terminal devices. In this paper, we describe and evaluate how the cooperative deployment of MPEG-21-based adaptation and caching of MPEG-4 SVC can result in improvements both in the quality of the content received at the user terminal and the resources consumed during the delivery.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"18 791 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126041190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Digital Rights Metadata Management and Retrieval on Structured Overlay Networks 基于结构化覆盖网络的数字版权元数据管理与检索
W. Allasia, F. Gallo, M. Milanesio, R. Schifanella, F. Chiariglione, A. Difino
{"title":"Digital Rights Metadata Management and Retrieval on Structured Overlay Networks","authors":"W. Allasia, F. Gallo, M. Milanesio, R. Schifanella, F. Chiariglione, A. Difino","doi":"10.1109/WIAMIS.2008.33","DOIUrl":"https://doi.org/10.1109/WIAMIS.2008.33","url":null,"abstract":"This paper introduces a suitable way for indexing multimedia metadata on a structured peer-to-peer overlay network, with special care to the management of rights metadata expressed by MPEG-21. We have selected a suitable subset of MPEG-21 rights expression language elements to be indexed, in order to map governed contents into a flat space and allow insertion and retrieval of digital contents. Furthermore, we present a distributed application built on a structured overlay network enabling the search of multimedia items using rights related information. Our solution is completely decentralized and can be exploited in any MPEG-21 compliant metadata representation.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128529302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Using Neighborhood Distributions of Wavelet Coefficients for On-the-Fly, Multiscale-Based Image Retrieval 基于小波系数邻域分布的动态多尺度图像检索
S. Anthoine, E. Debreuve, Paolo Piro, M. Barlaud
{"title":"Using Neighborhood Distributions of Wavelet Coefficients for On-the-Fly, Multiscale-Based Image Retrieval","authors":"S. Anthoine, E. Debreuve, Paolo Piro, M. Barlaud","doi":"10.1109/WIAMIS.2008.46","DOIUrl":"https://doi.org/10.1109/WIAMIS.2008.46","url":null,"abstract":"In this paper, we define a similarity measure to compare images in the context of (indexing and) retrieval. We use the Kullback-Leibler (KL) divergence to compare sparse multiscale image descriptions in a wavelet domain. The KL divergence between wavelet coefficient distributions has already been used as a similarity measure between images. The novelty here is twofold. Firstly, we consider the dependencies between the coefficients by means of distributions of mixed intra/interscale neighborhoods. Secondly, to cope with the high-dimensionality of the resulting description space, we estimate the KL divergences in the k-th nearest neighbor framework, instead of using classical fixed size kernel methods. Query-by-example experiments are presented.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115935464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Interest Based Selection of User Generated Content for Rich Multimedia Services 基于兴趣的富多媒体服务用户生成内容选择
O. Laere, M. Strobbe, S. Dauwe, B. Dhoedt, F. Turck, P. Demeester, Orlando Verde, Frank Hülsken
{"title":"Interest Based Selection of User Generated Content for Rich Multimedia Services","authors":"O. Laere, M. Strobbe, S. Dauwe, B. Dhoedt, F. Turck, P. Demeester, Orlando Verde, Frank Hülsken","doi":"10.1109/WIAMIS.2008.21","DOIUrl":"https://doi.org/10.1109/WIAMIS.2008.21","url":null,"abstract":"In view of the overwhelming popularity of user generated content, both in terms of production and consumption, new intelligent services are needed to help users finding the content they need and enhance existing services with suitably selected content. In this paper we present a set of algorithms for retrieving content, based on dynamic user profiles and learning capabilities (e.g. based on user feedback). The profile information is used in content searches as well as for assisting the user input analysis process (i.e. speech recognition). To illustrate the approach taken, a rich communication service is presented. Here, the basic service (i.e. voice/video conferencing) is enhanced by showing pictures in real time to the users based on the topic of their conversation and their specific interests.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126838418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Virtual Camera Tools for an Image2Video Application 用于Image2Video应用程序的虚拟摄像机工具
Fernando Barreiro-Megino, J. Sanchez, Víctor Valdés
{"title":"Virtual Camera Tools for an Image2Video Application","authors":"Fernando Barreiro-Megino, J. Sanchez, Víctor Valdés","doi":"10.1109/WIAMIS.2008.25","DOIUrl":"https://doi.org/10.1109/WIAMIS.2008.25","url":null,"abstract":"This paper proposes a set of virtual camera tools developed as a part of an image to video system, oriented to the adaptation of large images to be viewed on small displays without a significant loss of information. This transmoding system automates the process of scrolling and zooming through an image with a minimal user interaction by simulating a virtual camera movement through the picture. The process is automatic and the user interaction will be limited to establish some preferences on the video generation. The focus of this article is the presentation of the algorithms designed to obtain smooth, user-customizable camera movements.","PeriodicalId":325635,"journal":{"name":"2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114885336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信