{"title":"Mixed Guassian and uniform impulse noise analysis using robust estimation for digital images","authors":"Jie Xiang Yang, H. Wu","doi":"10.1109/ICDSP.2009.5201092","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201092","url":null,"abstract":"Previous work on mixed Gaussian and impulse noise (MGIN) reduction has impressive quantitative results. However, the estimation of the statistical properties of the MGIN model that varies within a wide range has not been fully investigated. In this paper, statistical properties of the MGIN model are analyzed in detail with a robust estimation. The paper also proposes a two-stage impulse-then-Gaussian filter for MGIN suppression. which makes use of the estimated statistical properties of MGIN. The proposed filtering scheme applies a impulse proportion adaptive median filter (IPAMF) to impulse noise suppression, and a state-of-the-art discrete cosine transform (DCT) domain filter to Gaussian noise reduction. Numerical results, in terms of the peak signal-to-noise ratio (PSNR), and visual samples demonstrate that the proposed filtering scheme achieves better performance of noise reduction than two existing MGIN filtering schemes.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115880363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A new approach for spoken language identification based on sequence kernel SVMs","authors":"A. Ziaei, S. Ahadi, H. Yeganeh, S. M. Mirrezaie","doi":"10.1109/ICDSP.2009.5201071","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201071","url":null,"abstract":"A new back-end classifier for GMM-LM based language identification systems is proposed in this paper. The proposed system consists of a mapping matrix and a back-end classifier of SVMs as its main parts, located in series after the GMM-LM system. While the mapping matrix maps the language model's output vectors to a new space in which the languages are more separable than before, each SVM in the SVM bank-end classifier separates one language from the others. A new sequence kernel is used for each SVM in the bank-end classifier. As a final stage, a fusion block carries out the task of fusing the SVM bank-end scores with those of the GMM-based LID to achieve higher accuracies. We show that not only our new sequence kernel-based SVMs separate languages more efficiently than common Gaussian mixture and GLDS SVM back-end classifiers, but also our new mapping matrix outperforms common linear discriminant matrix in separating classes from each other and finally the introduction of fusion block leads to even superior performance. The overall accuracy of the LID is noticeably increased in comparison with the other LDA-GMM and LDAGLDS SVM back-end classifiers. Our experiments on 5 languages from OGI-TS Multilanguage task prove our claim.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131975628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data processing in studying the temporomandibular joint, using MR imaging and sonographic techniques","authors":"O. Liberda, K. Bartusek, Z. Smékal, J. Mikulka","doi":"10.1109/ICDSP.2009.5201218","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201218","url":null,"abstract":"The temporomandibular joint is one of the most complicated joints in the human body. Diagnosing its disorders is difficult because the pain is mistakenly taken for toothache, pain in the jaw bones, etc. The paper deals with a post-processing tomographic examination of temporomandibular joint. An interesting post-processing method was used to increase the contrast related to relaxation time T2. Both increasing the contrast and enhancing the arthritic region were realized by processing two MR images in different echo-times. The enhancement rate is realized on the basis of subjective MR image evaluation by the surgeon. Magnetic resonance images (MRIs) are of very low resolution and contrast. An appropriate algorithm has been found, which consists of pre-processing the image by a smoothing filter, focusing, and four-phase level set segmentation. This method segments the image on the basis of the intensity of respective regions and is thus suitable to be applied to the above MR images, in which no sharp edges occur.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"23 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133918698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
K. Mankodiya, S. Vogt, A. Kundu, M. Klostermann, J. Pohl, A. Ayoub, H. Gehring, U. Hofmann
{"title":"Portable electrophysiologic monitoring based on the OMAP-family processor from a beginners' prospective","authors":"K. Mankodiya, S. Vogt, A. Kundu, M. Klostermann, J. Pohl, A. Ayoub, H. Gehring, U. Hofmann","doi":"10.1109/ICDSP.2009.5201222","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201222","url":null,"abstract":"We present here the experiences made on taking some first steps into the world of dual-core embedded processor technology. The processors we consider here consist of one DSP core and one general-purpose ARM core as are commonly used today by large mobile phone manufacturers. We describe some beginner's problems we encountered with Texas Instruments' OMAP platform and suggest some ways to avoid these common pitfalls for others. As an example application, we also introduce a new portable biosignal monitoring device that was the motivation for using a dual-core embedded processor in the first place.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134368892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Diffusion of time-varying signals in cortical networks","authors":"F. Rodrigues, L. D. Costa","doi":"10.1109/ICDSP.2009.5201115","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201115","url":null,"abstract":"The relationship between the structure and function of cortical networks is analyzed in terms of signal transmission between different cortical regions in the brains of cat and macaque, as modeled by the fundamental dynamics of diffusion. We investigated the relationship between modular network organization and diffused signal reception and verified that cortical areas in the same topological communities tend to receive signals with similar alterations. In addition, we modeled the diffusion dynamics on the network by a FIR filter whose coefficients correspond to the number of walks of different lengths between the source and destination nodes. Such an approach underlies the possibility to recover, at the destination node, the original signal provided the distribution of paths is known. We verified that the system functions obtained for regions belonging to different cortical communities present distinct roots, reinforcing the strict relationship between the structure and function in cortical networks.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130865592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Missing data imputation using compressive sensing techniques for connected digit recognition","authors":"J. Gemmeke, B. Cranen","doi":"10.1109/ICDSP.2009.5201176","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201176","url":null,"abstract":"An effective way to increase the noise robustness of automatic speech recognition is to label noisy speech features as either reliable or unreliable (missing) prior to decoding, and to replace the missing ones by clean speech estimates. We present a novel method based on techniques from the field of Compressive Sensing to obtain these clean speech estimates. Unlike previous imputation frameworks which work on a frame-by-frame basis, our method focuses on exploiting information from a large time-context. Using a sliding window approach, denoised speech representations are constructed using a sparse representation of the reliable features in an overcomplete dictionary of clean, fixed-length speech exemplars. We demonstrate the potential of our approach with experiments on the AURORA-2 connected digit database.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130949210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multiscale background modelling and segmentation","authors":"D. Culibrk, V. Crnojevic, Borislav Antic","doi":"10.1109/ICDSP.2009.5201193","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201193","url":null,"abstract":"A new multiscale approach to motion based segmentation of objects in video sequences is presented. While image features extracted at multiple scales are commonly used within the pattern recognition community, they have seldom been employed for background modelling and subtraction. The paper describes a methodology for maintaining an explicit background model at multiple scales. Biological inspiration is used to contrive simple, yet effective mechanisms for feature extraction, incorporation of information across multiple scales and segmentation. Results of experiments conducted using sequences from the domain of traffic surveillance are presented in the paper. They suggest that the proposed method is able to achieve good segmentation results. In addition, the evaluated variant of a multiscale segmentation algorithm is far less computationally intensive, able to achieve processing of higher frame rates in real time and requires an order of magnitude less memory resources than the commonly-used approach compared against.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134644691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Separating and tracking ERP subcomponents by constrained particle filtering","authors":"D. Jarchi, Bahador Makki Abadi, S. Sanei","doi":"10.1109/ICDSP.2009.5201049","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201049","url":null,"abstract":"In this paper a new method based on particle filtering for separating and tracking event related-potential (ERP) subcomponents in different trials is presented. The latency and amplitude of each ERP subcomponent is formulated in the state space model. Based on some knowledge about ERP subcomponents, a constraint on the state space variables is provided to prevent the generation of invalid particles and also make use of a small number of particles which are most effective especially in high dimensions. The method is applied on the simulated and real P300 data. The algorithm has the ability of tracking P300 subcomponents i.e. P3a and P3b, in single trials even in the low signal-to-noise ratio situations.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133820681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Speech Enhancement using Adaptive Empirical Mode Decomposition","authors":"N. Chatlani, J. Soraghan","doi":"10.1109/ICDSP.2009.5201120","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201120","url":null,"abstract":"Speech enhancement is performed in a wide and varied range of instruments and systems. In this paper, a novel approach to Speech Enhancement using Adaptive Empirical Mode Decomposition (SEAEMD) is presented. Spectral analysis of non-stationary signals can be performed by employing techniques such as the STFT and the Wavelet transform (WT), which use predefined basis functions. Empirical Mode Decomposition (EMD) performs very well in such environments. EMD decomposes a signal into a finite number of data-adaptive basis functions, called Intrinsic Mode Functions (IMFs). The new SEAEMD system incorporates this multi-resolution approach with adaptive noise cancellation (ANC) for effective speech enhancement on an IMF level, in stationary and non-stationary noise environments. A comparative performance study is included that compares the competitive method of conventional ANC to the robust SEAEMD system. The results demonstrate that the new system achieves significantly improved speech quality with a lower level of residual noise.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114596236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Efficient voice activity detection in reverberant enclosures using far field microphones","authors":"Theodore Petsatodis, Christos Boukis","doi":"10.1109/ICDSP.2009.5201159","DOIUrl":"https://doi.org/10.1109/ICDSP.2009.5201159","url":null,"abstract":"An algorithm suitable for voice activity detection under reverberant conditions is proposed in this paper. Due to the use of far-filed microphones the proposed solution processes speech signals of highly-varying intensity and signal to noise ratio, that are contaminated with several echoes. The core of the system is a pair of Hidden Markov Models, that effectively model the speech presence and speech absence situations. To minimise mis-detections an adaptive threshold is used, while a hang-over scheme caters for the intra-frame correlation of speech signals. Experimental results conducted in a typical office room using a single far field microphone to support the analysis.","PeriodicalId":409669,"journal":{"name":"2009 16th International Conference on Digital Signal Processing","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117337542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}