2012 13th International Workshop on Image Analysis for Multimedia Interactive Services最新文献

筛选
英文 中文
Reliability measure for propagation-based stereo matching 基于传播的立体匹配可靠性测度
Guillaume Gales, S. Chambon, Alain Crouzil, J. McDonald
{"title":"Reliability measure for propagation-based stereo matching","authors":"Guillaume Gales, S. Chambon, Alain Crouzil, J. McDonald","doi":"10.1109/WIAMIS.2012.6226761","DOIUrl":"https://doi.org/10.1109/WIAMIS.2012.6226761","url":null,"abstract":"Seed propagation-based stereo matching can help to reduce ambiguity occuring when a pixel from one image has different putative correspondents in the other one due to difficult areas (repetitive patterns, homogeneous areas, occlusions and depth discontinuities). They rely on previously computed matches (seeds) to reduce the size of the search area, and thus the number of candidates. One approach of these iterative methods selects the “best” seed at each iteration to prevent the propagation of errors. However, little attention has been brought to this best-first selection criterion for which a correlation score is usually employed. This value itself does not consider any ambiguity and is not well-suited to select the most reliable seed. Therefore, in this paper we introduce a reliability measure. It has the advantage of taking into account information from the other candidates, and leads, according to the provided experimental evaluation, to better results than the correlation score alone.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125162585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Exploiting gaze movements for automatic video annotation 利用凝视运动进行自动视频注释
S. Vrochidis, I. Patras, Y. Kompatsiaris
{"title":"Exploiting gaze movements for automatic video annotation","authors":"S. Vrochidis, I. Patras, Y. Kompatsiaris","doi":"10.1109/WIAMIS.2012.6226766","DOIUrl":"https://doi.org/10.1109/WIAMIS.2012.6226766","url":null,"abstract":"This paper proposes a framework for automatic video annotation by exploiting gaze movements during interactive video retrieval. In this context, we use a content-based video search engine to perform video retrieval, during which, we capture the user eye movements with an eye-tracker. We exploit these data by generating feature vectors, which are used to train a classifier that could identify shots of interest for new users. The queries submitted by new users are clustered in search topics and the viewed shots are annotated as relevant or non-relevant to the topics by the classifier. The evaluation shows that the use of aggregated gaze data can be utilized effectively for video annotation purposes.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121499102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Recovering quasi-real occlusion-free textures for facade models by exploiting fusion of image and laser street data and image inpainting 利用图像与激光街道数据融合和图像绘图技术恢复立面模型的准真实无遮挡纹理
K. Hammoudi, F. Dornaika, B. Soheilian, B. Vallet, J. McDonald, N. Paparoditis
{"title":"Recovering quasi-real occlusion-free textures for facade models by exploiting fusion of image and laser street data and image inpainting","authors":"K. Hammoudi, F. Dornaika, B. Soheilian, B. Vallet, J. McDonald, N. Paparoditis","doi":"10.1109/WIAMIS.2012.6226763","DOIUrl":"https://doi.org/10.1109/WIAMIS.2012.6226763","url":null,"abstract":"In this paper we present relevant results for the texturing of 3D urban facade models by exploiting the fusion of terrestrial multi-source data acquired by a Mobile Mapping System (MMS) and image inpainting. Current 3D urban facade models are often textured by using images that contain parts of urban objects that belong to the street. These urban objects represent in this case occlusions since they are located between the acquisition system and the facades. We show the potential use of georeferenced images and 3D point clouds that are acquired at street level by the MMS in generating occlusion-free facade textures. We describe a methodology for reconstructing quasi-real textures of facades that are highly occluded by wide frontal objects.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"291 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134607469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Structural based side information creation with improved matching criteria for Wyner-Ziv video coding 基于结构的侧信息创建与改进的匹配标准的Wyner-Ziv视频编码
Catarina Brites, J. Ascenso, F. Pereira
{"title":"Structural based side information creation with improved matching criteria for Wyner-Ziv video coding","authors":"Catarina Brites, J. Ascenso, F. Pereira","doi":"10.1109/WIAMIS.2012.6226756","DOIUrl":"https://doi.org/10.1109/WIAMIS.2012.6226756","url":null,"abstract":"The Wyner-Ziv video coding (WZVC) efficiency is highly dependent on the quality of the side information (SI) created at the decoder, typically through motion compensated frame interpolation (MCFI) techniques. Since the decoder only has available some reference decoded frames, SI creation turns out to be a difficult problem in WZVC. In most the MCFI techniques available, the matching criterion used for motion estimation only takes into account the (mean of absolute) pixel differences within a block which is limiting for some content types. This paper proposes a structural based SI creation approach with improved matching criteria combining local image features, obtained from the histogram of oriented gradients (HOG), with a boundary continuity criterion and the typical pixel differences criterion. With the structural based SI creation, the motion estimation process becomes more robust, e.g. to illumination changes, thus improving the SI quality (up to 0.4 dB) and reducing the bitrate (up to 6 % in terms of the Bjontegaard metric) regarding a state-of-the-art MCFI solution.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115423315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Field test of SkyMedia HD/3D content augmentation system for immersive media experiences SkyMedia高清/3D内容增强系统的现场测试,用于沉浸式媒体体验
A. Campi, J. Maillard, Marc Leny, R. Suffritti, Massimo Neri
{"title":"Field test of SkyMedia HD/3D content augmentation system for immersive media experiences","authors":"A. Campi, J. Maillard, Marc Leny, R. Suffritti, Massimo Neri","doi":"10.1109/WIAMIS.2012.6226772","DOIUrl":"https://doi.org/10.1109/WIAMIS.2012.6226772","url":null,"abstract":"This paper presents the field test results of the novel SkyMedia 3D/HD system in a marathon race setup. The augmentation system is tailored for immersive media experiences such as public live events in which people can interact together in order to improve user's experience. The paper reports the first public demonstration of the HD/3D content augmentation system conducted in the Turin Marathon race setup. A large set of SkyMedia Multimedia Service Platform (MSP) building blocks has been tested and validated in a real environment.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115541619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced multi-view dancing videos synchronisation 增强多视图舞蹈视频同步
Xinyu Lin, V. Kitanovski, Qianni Zhang, E. Izquierdo
{"title":"Enhanced multi-view dancing videos synchronisation","authors":"Xinyu Lin, V. Kitanovski, Qianni Zhang, E. Izquierdo","doi":"10.1109/WIAMIS.2012.6226773","DOIUrl":"https://doi.org/10.1109/WIAMIS.2012.6226773","url":null,"abstract":"This paper describes a system for automatically synchronising multi-view video sequences of Salsa dancing recorded with multimodal capturing platform. The multimodal capturing setup consists of audiovisual streams along with depth maps and inertial measurements. Part of the dataset was video sequences captured from machine vision cameras and Microsoft Kinect sensor that were not temporal synchronised during the capturing stage. As an essential step, we proposed efficient solutions for synchronisation of these data based on co-occurrence appearance changes. In order to improve the accuracy, the proposed system employed state-of-art body detection and tracking algorithm to obtain Region of Interest, within which the appearance changes are analysed. The accurately synchronised video set can then be further analysed and augmented for visualisation and evaluation of dancing performance.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128548782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
An empirical study on the combination of surf features with VLAD vectors for image search 结合冲浪特征与VLAD矢量进行图像搜索的实证研究
Eleftherios Spyromitros Xioufis, S. Papadopoulos, Y. Kompatsiaris, Grigorios Tsoumakas, I. Vlahavas
{"title":"An empirical study on the combination of surf features with VLAD vectors for image search","authors":"Eleftherios Spyromitros Xioufis, S. Papadopoulos, Y. Kompatsiaris, Grigorios Tsoumakas, I. Vlahavas","doi":"10.1109/WIAMIS.2012.6226771","DOIUrl":"https://doi.org/10.1109/WIAMIS.2012.6226771","url":null,"abstract":"The study of efficient image representations has attracted significant interest due to the computational needs of large-scale applications. In this paper we study the performance of the recently proposed VLAD method for aggregating local image descriptors when combined with SURF features, in the domain of image search. The experiments show that when SURF features are used as local image descriptors, VLAD attains better performance compared to using SIFT features. We also study how the average number of local image descriptors extracted per image affects the performance and show that by controlling this number we are able to adjust the trade off between feature extraction time and search accuracy. Finally, we examine the retrieval performance of the proposed scheme with varying levels of distractor images.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"3 7","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114044263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Student-t background modeling for persons' fall detection through visual cues 基于视觉线索的人跌倒检测的学生背景建模
Konstantinos Makantasis, A. Doulamis, N. Matsatsinis
{"title":"Student-t background modeling for persons' fall detection through visual cues","authors":"Konstantinos Makantasis, A. Doulamis, N. Matsatsinis","doi":"10.1109/WIAMIS.2012.6226767","DOIUrl":"https://doi.org/10.1109/WIAMIS.2012.6226767","url":null,"abstract":"This article presents a robust, real-time background subtraction algorithm able to operate properly in complex dynamically changing visual conditions and indoor/outdoor environments, based on a single, cheap monocular camera, like a webcam. This algorithm uses an image grid and models each pixel of the grid as a mixture of adaptive Student-t distributions. This approach makes this algorithm robust and efficient, in terms of computational cost and memory requirements, and thus suitable for large scale implementations. The proposed algorithm is applied in the problem of humans' fall detection that presents high complexity of visual content. Finally, the performances of this scheme and the scheme proposed in [1] by the same authors, are compared.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125676868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Reconstruction for 3D immersive virtual environments 三维沉浸式虚拟环境重建
D. Alexiadis, G. Kordelas, K. C. Apostolakis, J. D. Agapito, Jesús Vegas, E. Izquierdo, P. Daras
{"title":"Reconstruction for 3D immersive virtual environments","authors":"D. Alexiadis, G. Kordelas, K. C. Apostolakis, J. D. Agapito, Jesús Vegas, E. Izquierdo, P. Daras","doi":"10.1109/WIAMIS.2012.6226760","DOIUrl":"https://doi.org/10.1109/WIAMIS.2012.6226760","url":null,"abstract":"The future of tele-conferencing is towards multi-party 3D Tele-Immersion (TI) and TI environments that can support realistic inter-personal communications and virtual interaction among participants. In this paper, we address two important issues, pertinent to TI environments. The paper focuses on techniques for the real-time, 3D reconstruction of moving humans from multiple Kinect devices. The off-line generation of real-life 3D scenes from visual data, captured by non-professional users is also addressed. Experimental results are provided that demonstrate the efficiency of the methods, along with an example of mixing real with virtual in a shared space.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127620925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Face perception: Influence of location and number in videos 人脸感知:视频中位置和数量的影响
A. Rahman, D. Pellerin, D. Houzet
{"title":"Face perception: Influence of location and number in videos","authors":"A. Rahman, D. Pellerin, D. Houzet","doi":"10.1109/WIAMIS.2012.6226779","DOIUrl":"https://doi.org/10.1109/WIAMIS.2012.6226779","url":null,"abstract":"The study is about the influence of face in videos. In the experiment, the participants were instructed free viewing of various videos. The resulting eye positions are compared to the hand-labeled faces to evaluate the impact of location and number of faces in the visual field. Here, we defined three regions - Inside (I), Periphery (P), and Outside (O) - to categorize video frames with one or two faces based on the location of faces. Then we perform the evaluation of all these categories to get resulting scores. The scores indicate that the impact of face is a function of its eccentricity such that it falls as the face is far from the center of the visual scene. Similarly, the number of faces also limits face attraction.","PeriodicalId":346777,"journal":{"name":"2012 13th International Workshop on Image Analysis for Multimedia Interactive Services","volume":"306 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122726218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信