{"title":"Widely linear kernel-based adaptive filters","authors":"P. Bouboulis, S. Theodoridis, M. Mavroforakis","doi":"10.5281/ZENODO.42313","DOIUrl":"https://doi.org/10.5281/ZENODO.42313","url":null,"abstract":"Widely linear estimation for complex-valued signal processing is growing in popularity, especially in the cases where the involved signals exhibit non-circular characteristics. In this paper, the extended Wirtinger's calculus in complex Reproducing Kernel Hilbert Spaces (RKHS), presented in [1], is adopted to derive complex kernel-based widely-linear estimation filters. Furthermore, we illuminate several important characteristics of widely linear filters, which, to our knowledge, haven't been considered before. Our results indicate that, in contrast to many cases where the gains from adopting widely linear estimation filters, instead of ordinary linear filters, are rudimentary, for the case of kernel-based widely linear filters significant performance improvements can be obtained.","PeriodicalId":331889,"journal":{"name":"2011 19th European Signal Processing Conference","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127769658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Hidden conditional random fields for classification of imaginary motor tasks from EEG data","authors":"J. D. Saa, M. Çetin","doi":"10.5281/ZENODO.42672","DOIUrl":"https://doi.org/10.5281/ZENODO.42672","url":null,"abstract":"Brain-computer interfaces (BCIs) are systems that allow the control of external devices using information extracted from brain signals. Such systems find application in rehabilitation of patients with limited or no muscular control. One mechanism used in BCIs is the imagination of motor activity, which produces variations on the power of the electroencephalography (EEG) signals recorded over the motor cortex. In this paper, we propose a new approach for classification of imaginary motor tasks based on hidden conditional random fields (HCRFs). HCRFs are discriminative graphical models that are attractive for this problem because they involve learned statistical models matched to the classification problem; they do not suffer from some of the limitations of generative models; and they include latent variables that can be used to model different brain states in the signal. Our approach involves auto-regressive modeling of the EEG signals, followed by the computation of the power spectrum. Frequency band selection is performed on the resulting time-frequency representation through feature selection methods. These selected features constitute the data that are fed to the HCRF, parameters of which are learned from training data. Inference algorithms on the HCRFs are used for classification of motor tasks. We experimentally compare this approach to the best performing methods in BCI competition IV and the results show that our approach overperforms all methods proposed in the competition. In addition, we present a comparison with an HMM-based method, and observe that the proposed method produces better classification accuracy.","PeriodicalId":331889,"journal":{"name":"2011 19th European Signal Processing Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129049396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Speaker localization and speech separation using Phase Difference versus Frequency distribution","authors":"Ning Ding, N. Hamada","doi":"10.5281/ZENODO.42372","DOIUrl":"https://doi.org/10.5281/ZENODO.42372","url":null,"abstract":"This paper proposes a novel sparse source separation method using a pair of microphones. The method is based on time-frequency (T-F) decomposition, applies the weighted Hough transform to the Phase Difference (PD) versus Frequency (PD-F) distribution of received mixture signals, and estimates source directions. Then, the estimated source directions and harmonic structure are used to separate the mixture signals. The effectiveness of the proposed method is shown through experiments in real acoustic circumstances.","PeriodicalId":331889,"journal":{"name":"2011 19th European Signal Processing Conference","volume":"171 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115642633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Predictive visual saliency model for surveillance video","authors":"Fahad Fazal Elahi Guraya, F. A. Cheikh","doi":"10.5281/ZENODO.42675","DOIUrl":"https://doi.org/10.5281/ZENODO.42675","url":null,"abstract":"Visual saliency models(VSM) mimic the human visual system to distinguish the salient regions from the non-salient ones in an image or video. Most of the visual saliency model in the literature are static hence they can only be used for images. Motion is important information in case of videos that is not present in still images and thus not used in most of VSMs. There are very few saliency models which take into account both static and motion information. And there is no saliency model in the literature which uses static features, motion, prediction and face feature. In this paper we propose a predictive visual saliency model for video that uses static features, motion feature and face detection to predict the evolution in time of the human attention or the saliency. We introduce a new approach to compute saliency map for videos using salient motion information and prediction. The proposed model is tested and validated for surveillance videos.","PeriodicalId":331889,"journal":{"name":"2011 19th European Signal Processing Conference","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125427136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Lorente, G. Piñero, A. Vidal, J. A. Belloch, Alberto González
{"title":"Parallel implementations of beamforming design and filtering for microphone array applications","authors":"J. Lorente, G. Piñero, A. Vidal, J. A. Belloch, Alberto González","doi":"10.5281/ZENODO.42595","DOIUrl":"https://doi.org/10.5281/ZENODO.42595","url":null,"abstract":"One of the main limitations of microphone array algorithms for audio applications has been their high computational cost in real acoustic environments when real-time signal processing is absolutely required. Regarding audio/speech signal processing, beamforming algorithms have been used for the recovery of acoustic signals from their observations when they are corrupted by noise, reverberation and other interfering signals. In order to reduce their high computational load, frequency-based filtering have been used to achieve a real time application. Our research focuses on the use of different multicore/manycore platforms in order to achieve a real time beamforming application in the time domain. Efficient algorithms has been proposed and tested in several devices and results have shown that GPU implementation of beamforming design and filtering outperforms multicore implementation in computational cost terms. The performance obtained suggests that GPU implementation paves the way for low-cost real-time audio beamforming applications.","PeriodicalId":331889,"journal":{"name":"2011 19th European Signal Processing Conference","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126620869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Waveform processing-domain diversity and ATR","authors":"C. Baker, M. Inggs, A. Mishra","doi":"10.5281/ZENODO.42743","DOIUrl":"https://doi.org/10.5281/ZENODO.42743","url":null,"abstract":"Classification of targets by radar has proved to be notoriously difficult with the best systems still yet to attain sufficiently high levels of performance and reliability. In this paper we take cues from nature to propose and examine a novel approach to target classification, based on diversity, as applied in the waveform processing domain. In the new approach, data is processed in multiple, different, forms, in parallel. The two forms that we have exploited in this work are the time and space domains. Most classification and Radar image analysis algorithms handle Radar data in the space domain only. Using simulation studies, we first show that phase or k-space data contains additional information. It is also shown that, counter-intuitively, having a sharp spatial Radar image (with reduced side-lobes) in fact worsens classification performance. Lastly, the proposed architecture is validated against a traditional, unitary based classification scheme.","PeriodicalId":331889,"journal":{"name":"2011 19th European Signal Processing Conference","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127052103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Person specific activity recognition using fuzzy learning and Discriminant Analysis","authors":"Alexandros Iosifidis, A. Tefas, I. Pitas","doi":"10.5281/ZENODO.42545","DOIUrl":"https://doi.org/10.5281/ZENODO.42545","url":null,"abstract":"One of the major issues that activity recognition methods should be able to face is the style variations observed in the execution of activities performed by different humans. In order to address this issue we propose a person-specific activity recognition framework in which human identification proceeds activity recognition. After recognizing the ID of the human depicted in a video stream, a person-specific activity classifier is responsible to recognize the activity performed by the human. Exploiting the enriched human body information captured by a multi-camera setup, view-invariant person and activity representations are obtained. The classification procedure involves Fuzzy Vector Quantization and Linear Discriminant Analysis. The proposed method is applied on drinking and eating activity recognition as well as on other activity recognition tasks. Experiments show that the person-specific approach outperforms the person-independent one.","PeriodicalId":331889,"journal":{"name":"2011 19th European Signal Processing Conference","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126031248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Robust video watermarking using maximum likelihood decoder","authors":"A. Diyanat, M. Akhaee, S. Ghaemmaghami","doi":"10.5281/ZENODO.42533","DOIUrl":"https://doi.org/10.5281/ZENODO.42533","url":null,"abstract":"In this paper, a robust multiplicative video watermarking scheme is presented. We segment the video signal into 3-D blocks like cubes, and then apply 3-D wavelet transform to each block. The watermark is inserted through multiplying the low frequency wavelet coefficients by a constant parameter that controls the power of the watermark. The proposed watermark extraction procedure is based on the maximum likelihood rule applied to the watermarked wavelet coefficients.","PeriodicalId":331889,"journal":{"name":"2011 19th European Signal Processing Conference","volume":"206 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122596545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jean-Charles Naud, Quentin L. Meunier, D. Ménard, O. Sentieys
{"title":"Fixed-point accuracy evaluation in the context of conditional structures","authors":"Jean-Charles Naud, Quentin L. Meunier, D. Ménard, O. Sentieys","doi":"10.5281/ZENODO.42638","DOIUrl":"https://doi.org/10.5281/ZENODO.42638","url":null,"abstract":"The automation of fixed-point conversion requires generic methods to study accuracy degradation. Accuracy evaluation is often based on simulation approaches, at the cost of an important execution time. This paper proposes a new approach using fast analytical noise power propagation considering conditional structures. These structures are generated from programming language statements such as if-then-else or switch. The proposed model takes two key points into account in fixed-point design: first, an alternative processing of noise depending on the condition; second, decision errors generated by quantization noise affecting the condition. This method is integrated in the fixed-point conversion process and uses path probabilities of execution alternatives. This work extends existing analytical approaches for fixed-point conversion. Experiments of our analytical method show that it has a fairly accurate noise power estimation compared to the real accuracy degradation.","PeriodicalId":331889,"journal":{"name":"2011 19th European Signal Processing Conference","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121534843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Perfect reconstruction DFT modulated oversampled filter bank transceivers","authors":"Siavash Rahimi, B. Champagne","doi":"10.5281/ZENODO.42330","DOIUrl":"https://doi.org/10.5281/ZENODO.42330","url":null,"abstract":"This paper proposes a novel method for the design of perfect reconstruction (PR), discrete Fourier transform (DFT) modulated oversampled filter banks (FB) for application in multi-carrier transceiver systems. The PR property is guaranteed by enforcing a paraunitary constraint on the polyphase matrix of the transmit or receive sub-systems. The desired polyphase matrix is obtained via embedding of lower dimensional paraunitary building blocks, each expressed in terms of a limited set of design parameters through a factorization based on Givens rotations. These parameters are adjusted to minimize the stop-band energy of the subband filters and thus improve their spectral containment. The performance of the proposed FB is investigated in a multi-carrier transceiver application, where it is compared with OFDM and other recently proposed FB structures. Numerical results show that the proposed approach can lead to significant reduction in the bit error rate (BER), as compared to the benchmark approaches, when used in the presence of narrowband interference or frequency offset.","PeriodicalId":331889,"journal":{"name":"2011 19th European Signal Processing Conference","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124749518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}