2011 IEEE International Conference on Multimedia and Expo最新文献_第2页

Multi-modality likelihood based particle filtering for 2-D direction of arrival tracking using a single acoustic vector sensor 基于多模态似然粒子滤波的单声矢量传感器二维到达方向跟踪

2011 IEEE International Conference on Multimedia and Expo Pub Date : 2011-07-11 DOI: 10.1109/ICME.2011.6011965

X. Zhong, A. Premkumar, A. Madhukumar, C. Lau

{"title":"Multi-modality likelihood based particle filtering for 2-D direction of arrival tracking using a single acoustic vector sensor","authors":"X. Zhong, A. Premkumar, A. Madhukumar, C. Lau","doi":"10.1109/ICME.2011.6011965","DOIUrl":"https://doi.org/10.1109/ICME.2011.6011965","url":null,"abstract":"The general problem addressed in this paper is tracking the 2-D direction of arrival (DOA) of an acoustic source signal by using a single acoustic vector sensor (AVS). A Bayesian framework and its particle filtering implementation are introduced to adapt to the underwater ambient noise environment, in which both the interference and background noise exist. Several innovations are explored here: 1) a particle filtering based acoustic source tracking algorithm for AVS is developed; and 2) by using a multi-modality likelihood model to model the source detection and false alarm separately, the algorithm is able to alleviate the effect due to noise and interference. Particularly, by employing additional acoustic information, the proposed approach is able to track the 2-D DOA by using a single AVS. The performance of proposed approach is fully investigated under different simulated ambient noisy environments. Experiment results show that the proposed algorithm outperforms the traditional Capon beamforming approach and is able to lock on the 2-D DOA of the source even in a very challenging environment.","PeriodicalId":433997,"journal":{"name":"2011 IEEE International Conference on Multimedia and Expo","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125022540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Image compression algorithm based on Hilbert Scanning of Embedded quadTrees: An introduction of the Hi-SET coder 基于嵌入式四叉树Hilbert扫描的图像压缩算法:Hi-SET编码器的介绍

2011 IEEE International Conference on Multimedia and Expo Pub Date : 2011-07-11 DOI: 10.1109/ICME.2011.6011870

Jesús Jaime Moreno Escobar, X. Otazu

{"title":"Image compression algorithm based on Hilbert Scanning of Embedded quadTrees: An introduction of the Hi-SET coder","authors":"Jesús Jaime Moreno Escobar, X. Otazu","doi":"10.1109/ICME.2011.6011870","DOIUrl":"https://doi.org/10.1109/ICME.2011.6011870","url":null,"abstract":"In this work we present an effective and computationally simple algorithm for image compression based on Hilbert Scanning of Embedded quadTrees (Hi-SET). It allows to represent an image as an embedded bitstream along a fractal function. Embedding is an important feature of modern image compression algorithms, in this way Salomon in [1, pg. 614] cite that another feature and perhaps a unique one is the fact of achieving the best quality for the number of bits input by the decoder at any point during the decoding. Hi-SET possesses also this latter feature. Furthermore, the coder is based on a quadtree partition strategy, that applied to image transformation structures such as discrete cosine or wavelet transform allows to obtain an energy clustering both in frequency and space. The coding algorithm is composed of three general steps, using just a list of significant pixels. The implementation of the proposed coder is developed for gray-scale and color image compression. Hi-SET compressed images are, on average, 6.20dB better than the ones obtained by other compression techniques based on the Hilbert scanning. Moreover, Hi-SET improves the image quality in 1.39dB and 1.00dB in gray-scale and color compression, respectively, when compared with JPEG2000 coder.","PeriodicalId":433997,"journal":{"name":"2011 IEEE International Conference on Multimedia and Expo","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125067263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

REal-time local stereo matching using guided image filtering 基于引导图像滤波的实时局部立体匹配

2011 IEEE International Conference on Multimedia and Expo Pub Date : 2011-07-11 DOI: 10.1109/ICME.2011.6012131

A. Hosni, M. Bleyer, Christoph Rhemann, M. Gelautz, C. Rother

{"title":"REal-time local stereo matching using guided image filtering","authors":"A. Hosni, M. Bleyer, Christoph Rhemann, M. Gelautz, C. Rother","doi":"10.1109/ICME.2011.6012131","DOIUrl":"https://doi.org/10.1109/ICME.2011.6012131","url":null,"abstract":"Adaptive support weight algorithms represent the state-of-the-art in local stereo matching. Their limitation is a high computational demand, which makes them unattractive for many (real-time) applications. To our knowledge, the algorithm proposed in this paper is the first local method which is both fast (real-time) and produces results comparable to global algorithms. A key insight is that the aggregation step of adaptive support weight algorithms is equivalent to smoothing the stereo cost volume with an edge-preserving filter. From this perspective, the original adaptive support weight algorithm [1] applies bilateral filtering on cost volume slices, and the reason for its poor computational behavior is that bilateral filtering is a relatively slow process. We suggest to use the recently proposed guided filter [2] to overcome this limitation. Analogously to the bilateral filter, this filter has edge-preserving properties, but can be implemented in a very fast way, which makes our stereo algorithm independent of the size of the match window. The GPU implementation of our stereo algorithm can process stereo images with a resolution of 640 × 480 pixels and a disparity range of 26 pixels at 25 fps. According to the Middlebury on-line ranking, our algorithm achieves rank 14 out of over 100 submissions and is not only the best performing local stereo matching method, but also the best performing real-time method.","PeriodicalId":433997,"journal":{"name":"2011 IEEE International Conference on Multimedia and Expo","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126091417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 87

Application of recommendation methods for TV programs 电视节目推荐方法的应用

2011 IEEE International Conference on Multimedia and Expo Pub Date : 2011-07-11 DOI: 10.1109/ICME.2011.6012112

H. Kosch, Günther Hölbling

引用次数: 4

Novel cross-layer scheme for video transmission over LTE-based wireless systems 基于lte的无线系统视频传输的新跨层方案

2011 IEEE International Conference on Multimedia and Expo Pub Date : 2011-07-11 DOI: 10.1109/ICME.2011.6012174

Sotirios Karachontzitis, T. Dagiuklas, Lampros Dounis

引用次数: 23

Social focus of attention as a time function derived from multimodal signals 社会关注焦点是由多模态信号产生的时间函数

2011 IEEE International Conference on Multimedia and Expo Pub Date : 2011-07-11 DOI: 10.1109/ICME.2011.6012241

D. Korchagin, H. R. Abutalebi

引用次数: 0

An efficient angle-based shape matching approach towards object recognition 一种有效的基于角度的物体形状匹配方法

2011 IEEE International Conference on Multimedia and Expo Pub Date : 2011-07-11 DOI: 10.1109/ICME.2011.6012233

Zhiyuan Zhang, Aixin Zhang, Jianhua Li, Shenghong Li

引用次数: 0

Block merging for quadtree-based video coding 基于四叉树的视频编码块合并

2011 IEEE International Conference on Multimedia and Expo Pub Date : 2011-07-11 DOI: 10.1109/ICME.2011.6012010

S. Oudin, Philipp Helle, J. Stegemann, Christian Bartnik, B. Bross, D. Marpe, H. Schwarz, T. Wiegand

引用次数: 20

Real 3D interaction behind mobile phones for augmented environments 真正的3D互动背后的手机增强环境

2011 IEEE International Conference on Multimedia and Expo Pub Date : 2011-07-11 DOI: 10.1109/ICME.2011.6012155

Farid Abedan Kondori, Shahrouz Yousefi, Haibo Li

引用次数: 9

Prediction Signal Aided Spatially Varying Transform 预测信号辅助空间变化变换

2011 IEEE International Conference on Multimedia and Expo Pub Date : 2011-07-11 DOI: 10.1109/ICME.2011.6011911

Cixun Zhang, K. Ugur, J. Lainema, A. Hallapuro, M. Gabbouj

{"title":"Prediction Signal Aided Spatially Varying Transform","authors":"Cixun Zhang, K. Ugur, J. Lainema, A. Hallapuro, M. Gabbouj","doi":"10.1109/ICME.2011.6011911","DOIUrl":"https://doi.org/10.1109/ICME.2011.6011911","url":null,"abstract":"Spatially Varying Transform (SVT) is a technique introduced earlier to improve the coding efficiency of video coders [1][2]. SVT allows the position of the transform block within the macroblock to vary in order to better localize the underlying residual signal. The coding gains of SVT come with increased encoding complexity due to the additional need in the encoder to search for the best Location Parameter (LP) which indicates the position of the transform. In this paper, a new technique called Prediction Signal Aided Spatially Varying Transform (PSASVT) is proposed that utilizes the gradient of prediction signal to eliminate the unlikely LPs. As the number of candidate LPs is reduced, a smaller number of LPs are searched by encoder, which reduces the encoding complexity. In addition, less overhead bits are needed to code the selected LP and thus the coding efficiency can be improved. Experimental results show that the number of LPs to be tested in RDO is reduced on average by more than 20%. This reduction in encoding complexity is achieved with a slight increase in coding efficiency, as the number of candidate LPs is reduced. The decoding complexity increase is only a little.","PeriodicalId":433997,"journal":{"name":"2011 IEEE International Conference on Multimedia and Expo","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129113223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0