{"title":"Postprocessing of block-coded videos for deflicker and deblocking","authors":"J. Ren, Jiaying Liu, Mading Li, Zongming Guo","doi":"10.1109/ICASSP.2013.6637928","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6637928","url":null,"abstract":"In this paper, we propose a novel postprocessing method to suppress both the flickering and blocking artifacts in block-coded videos. For reducing the flickering effect between adjacent frames, we propose an adaptive multi-scale motion filtering method to maintain the motion coherence of processed video. For blocking artifacts suppression, we adopt a patch-based scheme in which similar patches are grouped in a spatio-temporal domain and each patch group is recovered by solving a low rank matrix completion problem. Experimental results show that the proposed method can significantly reduce the flickering and blocking artifacts in the decoded videos.","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"297 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121822475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pingyang Dai, Yanlong Luo, Weisheng Liu, Cuihua Li, Yi Xie
{"title":"Robust visual tracking via part-based sparsity model","authors":"Pingyang Dai, Yanlong Luo, Weisheng Liu, Cuihua Li, Yi Xie","doi":"10.1109/ICASSP.2013.6637963","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6637963","url":null,"abstract":"The sparse representation has been widely used in many areas including visual tracking. The part-based representation performs outstandingly by using non-holistic templates to against occlusion. This paper combined them and proposed a robust object tracking method using part-based sparsity model for tracking an object in a video sequence. In the proposed model, one object is represented by image patches. The candidates of these patches are sparsely represented in the space which is spanned by the patch templates and trivial templates. The part-based method takes the spatial information of each patch into consideration, where the vote maps of multiple patches are used. Furthermore, the update scheme keeps the representative templates of each part dynamically. Therefore, trackers can effectively deal with the changes of appearances and heavy occlusion. On various public benchmark videos, the abundant results of experiments demonstrate that the proposed tracking method outperforms many existing state-of-the-arts algorithms.","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116083635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
H. Mohseni, M. Kringelbach, M. Woolrich, T. Aziz, P. P. Smith
{"title":"A new approach to the fusion of EEG and MEG signals using the LCMV beamformer","authors":"H. Mohseni, M. Kringelbach, M. Woolrich, T. Aziz, P. P. Smith","doi":"10.1109/ICASSP.2013.6637841","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6637841","url":null,"abstract":"In this paper, we demonstrate a new approach for the fusion of multichannel signals. We show how this method can be used to combine signals from magnetometer and gradiometer sensors used in magnetoencephalography (MEG). This approach works by assuming that the lead-fields have multiplicative errors which in turn leads to an under-determined problem. To solve this problem, we impose two constraints that result in closed-from solutions: i) one set of sensors is error-free, ii) the norm of the multiplicative error is bounded. These prior assumptions to estimate the error are used in the linearly constraint minimum variance (LCMV) spatial filter to improve the optimisation. Although we focus on the fusion of MEG sensors, this approach can be employed for multimodal fusion of other multichannel signals such as MEG and EEG signals.","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121999566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
H. Mohseni, M. Kringelbach, M. Woolrich, T. Aziz, P. P. Smith
{"title":"A non-Gaussian LCMV beamformer for MEG source reconstruction","authors":"H. Mohseni, M. Kringelbach, M. Woolrich, T. Aziz, P. P. Smith","doi":"10.1109/ICASSP.2013.6637850","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6637850","url":null,"abstract":"Evidence suggests that magnetoencephalogram (MEG) data have characteristics with non-Gaussian distribution, however, standard methods for source localisation assume Gaussian behaviour. We present a new general method for non-Gaussian source estimation of stationary signals for localising brain activity in the MEG data. By providing a Bayesian formulation for linearly constraint minimum variance (LCMV) beamformer, we extend this approach and show that how the source probability density function (pdf), which is not necessarily Gaussian, can be estimated. The proposed non-Gaussian beamformer is shown to give better spatial estimates than the LCMV beamformer, in both simulations incorporating non-Gaussian signal and in real MEG measurements.","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"520 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131920265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Analysis and reduction of estimation bias for an iterative frequency estimator of complex sinusoid","authors":"Jan-Ray Liao, Chun-Ming Chen","doi":"10.1109/ICASSP.2013.6638844","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6638844","url":null,"abstract":"Frequency estimation of a complex sinusoidal signal is a fundamental problem in signal processing. In this regard, Aboutanios and Mulgrew (A&M) proposed an iterative frequency estimator which can approach the theoretical bound in two iterations, thus, made it one of the best iterative estimators. In this paper, we theoretically analyze the two versions of the A&M estimator and show that the estimation biases of the two versions are not equivalent. The results of the theoretical analysis indicate that the bias of the first iteration can be accurately predicted by a polynomial equation. We then propose to use the roots of the polynomial equation to improve the estimation and reduce the bias. Experiments show that the proposed new estimator can significantly reduce the bias.","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"379 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115307402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Scalable image representation using improved retargeting pyramid","authors":"Yuichi Tanaka, K. Shirai","doi":"10.1109/ICASSP.2013.6638051","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6638051","url":null,"abstract":"The retargeting pyramid (RP) method is a good alternative to the well-known Laplacian pyramid (LP) approach for multiscale image decomposition. RP can be obtained by replacing the low-pass filtering and downsampling processes in LP with content-aware image resizing (a.k.a. retargeting), which is a technique being developed in computer vision research. In this paper, we improve RP so that it obtains good scalable image representation. The improved RP is then integrated with a well-known multiscale-multidirection (MSMD) transform, contourlet transform, to construct a saliency-oriented MSMD image representation. In the experiment, our decomposition outperforms the conventional pyramid structures.","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115603650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Stability of Residual Acoustic Noise Variance in active control of stochastic noise","authors":"Iman Tabatabaei Ardekani, W. Abdulla","doi":"10.1109/ICASSP.2013.6637673","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6637673","url":null,"abstract":"This paper concerns about the theoretical stability of the adaptation process performed by the Filtered-x Least Mean Square (FxLMS) algorithm in active control of acoustic noise. A dynamic model for the Variance of Residual Acoustic Noise (VRAN) is developed and it is shown that the stability of this model is a sufficient condition for the stability of the adaptation process. The basic rules governing the VRAN root locus are developed, based on which an upper-bound for the adaptation step-size is derived. This upper-bound can apply to a general case with an arbitrary secondary path, unlike the traditional upper-bound used in adaptive filter theory, which was derived only for pure delay secondary paths.","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115609055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A maximum likelihood approach for underdetermined TDOA estimation","authors":"Janghoon Cho, C. Yoo","doi":"10.1109/ICASSP.2013.6638410","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6638410","url":null,"abstract":"This paper considers the estimation of time difference of arrival (TDOA) of multiple sparse sources when the number of sources is larger than that of the microphones. White Gaussian noise is assumed present at the microphone in addition to the instantaneously mixed sources. The TDOA estimate is obtained based on a maximum likelihood (ML) criteria, and the likelihood is obtained by marginalizing the joint probability over the sources. Explicit marginalization is mathematically intractable, thus the joint probability is approximated as a summation of several Dirac delta functions by assuming the time-frequency component of the source distribution to be a complex-valued super Gaussian, and the global maximum point of the marginalized joint probability is found by Markov chain Monte Carlo sampling. Experimental results show that the proposed algorithm outperforms TDOA estimation using a well-known Gaussian based approximation method in terms of root-mean-square error (RMSE).","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"95 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124401329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Map-assisted Kalman filtering","authors":"M. Mansour, D. Waters","doi":"10.1109/ICASSP.2013.6638250","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6638250","url":null,"abstract":"We describe a method for incorporating map information to the Kalman filter that is commonly used in indoor and outdoor navigation systems. The map information is provided as a measurement to the Kalman filter to ensure the consistency of the Kalman estimate. The proposed method provides huge computational saving over common map matching algorithms that use the more computationally expensive particle filter. We show indoor navigation examples that highlight the efficiency of the proposed algorithm.","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124418569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pooria Pakrooh, L. Scharf, A. Pezeshki, Yuejie Chi
{"title":"Analysis of fisher information and the Cramer-Rao bound for nonlinear parameter estimation after compressed sensing","authors":"Pooria Pakrooh, L. Scharf, A. Pezeshki, Yuejie Chi","doi":"10.1109/ICASSP.2013.6638944","DOIUrl":"https://doi.org/10.1109/ICASSP.2013.6638944","url":null,"abstract":"In this paper, we analyze the impact of compressed sensing with random matrices on Fisher information and the CRB for estimating unknown parameters in the mean value function of a multivariate normal distribution. We consider the class of random compression matrices that satisfy a version of the Johnson-Lindenstrauss lemma, and we derive analytical lower and upper bounds on the CRB for estimating parameters from randomly compressed data. These bounds quantify the potential loss in CRB as a function of Fisher information of the non-compressed data. In our numerical examples, we consider a direction of arrival estimation problem and compare the actual loss in CRB with our bounds.","PeriodicalId":183968,"journal":{"name":"2013 IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"132 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124438208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}