Vishwa Gupta, Gilles Boulianne, Frédéric Osterrath, P. Ouellet
{"title":"Crim's French speech transcription system for ETAPE 2011","authors":"Vishwa Gupta, Gilles Boulianne, Frédéric Osterrath, P. Ouellet","doi":"10.1109/WOSSPA.2013.6602390","DOIUrl":"https://doi.org/10.1109/WOSSPA.2013.6602390","url":null,"abstract":"This paper describes the French broadcast speech transcription system by CRIM for the ETAPE 2011 evaluation. The key elements in this recognizer include over 140,000-word dictionary, 478 hours of audio for training the acoustic models, feature-space MMI and boosted MMI discriminative training of the acoustic models, variable-frame-rate decoding with trigram language model, lattice rescoring with quadgram language model, soft penalty on silence models, confusion network decoding with minimum Bayes risk, and combining multiple recognizers with ROVER. Recognition enhancements after the ETAPE evaluation include discriminative training of the subspace Gaussian mixture models and lattice rescoring with neural net language models.","PeriodicalId":417940,"journal":{"name":"2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114744789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A novel feature extractor employing regularized MVDR spectrum estimator and subband spectrum enhancement technique","authors":"Md. Jahangir Alam, D. O'Shaughnessy, P. Kenny","doi":"10.1109/WOSSPA.2013.6602388","DOIUrl":"https://doi.org/10.1109/WOSSPA.2013.6602388","url":null,"abstract":"This paper presents a novel feature extractor for robust large vocabulary continuous speech recognition (LVCSR) task. For accurate and robust estimation of speech power spectrum we propose to compute the features from the regularized minimum variance distortionless response (regMVDR) spectral estimate instead of the windowed periodogram estimate. A sigmoid shape subband spectrum enhancement technique and a short-time feature normalization, known as short-time mean and scale normalization (STMSN), are also used for robust estimation of the cepstral features for speech recognition task. When evaluated on the AURORA-4 LVCSR corpus proposed feature extractor provides an average relative improvement of 38.5%,35.0%, and 34.3%,30.7%,5.6%, and 7.1% over the MFCC, PLP, MVDR-based MFCC, regMVDR-based MFCC, PNCC and the robust feature extractor of [4], respectively, in terms of the recognition accuracy.","PeriodicalId":417940,"journal":{"name":"2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123902105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sensor fault detection and diagnosis in drinking water distribution networks","authors":"Soumia Bouzid, M. Ramdani","doi":"10.1109/WOSSPA.2013.6602395","DOIUrl":"https://doi.org/10.1109/WOSSPA.2013.6602395","url":null,"abstract":"In this work, the local PCA approach is used as a statistical process control tool for drinking water distribution(DWD) systems to detect and isolate sensor faults. The multivariate statistical process monitoring task is carried out by learning a finite mixture model to describe the local statistical behavior in each cluster, followed by the determination of the local statistical confidence limits. The objective of a water distribution system is to convey treated water to consumers through a pressurized network pipe. The aim is diagnosing sensor faults in DWD. Experimental results using a model of an actual water distribution network illustrate the effectiveness of the proposed approach.","PeriodicalId":417940,"journal":{"name":"2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128752958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multimodal image fusion of anatomical structures for diagnosis, therapy planning and assistance","authors":"F. Cheriet","doi":"10.1109/WOSSPA.2013.6602332","DOIUrl":"https://doi.org/10.1109/WOSSPA.2013.6602332","url":null,"abstract":"This paper provides an overview of work done in recent years by our research group to fuse multimodal images of the trunk of patients with Adolescent Idiopathic Scoliosis (AIS) treated at Sainte-Justine University Hospital Center (CHU). We first describe our surface acquisition system and introduce a set of clinical measurements (indices) based on the trunk's external shape, to quantify its degree of asymmetry. We then describe our 3D reconstruction system of the spine and rib cage from biplanar radiographs and present our methodology for multimodal fusion of MRI, X-ray and external surface images of the trunk We finally present a physical model of the human trunk including bone and soft tissue for the simulation of the surgical outcome on the external trunk shape in AIS.","PeriodicalId":417940,"journal":{"name":"2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129059010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Recognition of aggressive human behavior based on SURF and SVM","authors":"A. Ouanane, A. Serir, N. Djelal","doi":"10.1109/WOSSPA.2013.6602398","DOIUrl":"https://doi.org/10.1109/WOSSPA.2013.6602398","url":null,"abstract":"In this paper, we aim to develop a novel decision algorithm of human behavior using both Speeded Up Robust Features (SURF) and PCA techniques. The SURF offers the opportunity to obtain a high level of performance under the constraint of scale variation with low computing coast to form spatio-temporal features. Thus, the PCA algorithm is used to reduce the dimensionality of the provided features to form robust pattern. The latter is performed as an input for training the Support Vector Machine (SVM). This machine is going to be able to classify the aggressive and nonaggressive behaviors. Different tests are conducted on KTH actions datasets. The obtained results have shown that the proposed technique provides more significant accuracy rate in comparison with current techniques as well as it drives more robustness to a dynamic environment.","PeriodicalId":417940,"journal":{"name":"2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132076232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Common GPS/Galileo signals: MBOC VS BOC(1,1) performance comparison","authors":"S. Zitouni, D. Chikouche, K. Rouabah","doi":"10.1109/WOSSPA.2013.6602416","DOIUrl":"https://doi.org/10.1109/WOSSPA.2013.6602416","url":null,"abstract":"The Multiplexed Binary Offset Carrier (MBOC) signal has been recommended as optimized modulation on L1/El frequency band by both United States and European Union communities. MBOC signal is defined on the basis of its spectrum, while different time waveforms can be selected as possible modulation candidates. In this paper we present the different options of the new optimized MBOC signal in both time and frequency domains in order to analyze their performances in terms of tracking accuracy, thermal noise and multipath-induced pseudorange error. A comparison between the MBOC options and the baseline signal BOC(1,1) is also presented based on software simulation results to illustrate that the use of MBOC signal for GPS and Galileo satellite navigation systems is an opportunity to provide the best possible performances for evolved receivers.","PeriodicalId":417940,"journal":{"name":"2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA)","volume":"4 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134447411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Elhocine Boutellaa, Messaoud Bengherabi, F. Harizi
{"title":"Improving online signature verification by user-specific likelihood ratio score normalization","authors":"Elhocine Boutellaa, Messaoud Bengherabi, F. Harizi","doi":"10.1109/WOSSPA.2013.6602379","DOIUrl":"https://doi.org/10.1109/WOSSPA.2013.6602379","url":null,"abstract":"Online handwritten signature is a behavioral biometric trait with several practical applications. Examples of these applications include access control to personal devices and validation of online transactions. Several research work have been done to improve the performance of online signature verification systems. This paper presents an improvement of a recently proposed online signature verification system by introducing a new user-specific score normalization strategy. This new normalization strategy relies on user-specific log likelihood ratio resulting from the Maximum a Posteriori Adaptation (MAP) of both client and impostor scores modeled a priori by Gaussian mixture distributions. Experimental results on the SUSIG database demonstrate the effectiveness of the proposed strategy. The EER is reduced from 6.2 to 2.8%.","PeriodicalId":417940,"journal":{"name":"2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132544919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Low complexity turbo equalization over unknown frequency selective Rayleigh channels","authors":"Berdai Abdellah, J. Chouinard, Loukhaoukha Khaled","doi":"10.1109/WOSSPA.2013.6602384","DOIUrl":"https://doi.org/10.1109/WOSSPA.2013.6602384","url":null,"abstract":"For turbo equalization, the use of Minimum Mean Square Error or trellis-based algorithms give remarkable performances over static channels, particularly when efficient error correcting codes are used. For the selective time varying channels, when the fading rate is sufficiently slow, a separate trained Least Mean Square (LMS) or Recursive Least Square (RLS) channel estimator may be used to inform the equalizer module. However, when the fading rate is high, it is very difficult to estimate the channel with great precision which can significantly degrade the bit error rate (BER). Channel estimators based on the use of the Kalman filter, maximum likelihood and the Wiener filter are efficient for fast fading channels. However, they require knowledge of channel statistics such as Doppler shift and noise variance. In this paper, we focus on turbo equalizers in realistic scenarios (i.e. statistics are unknown). We propose and evaluate a low complexity iterative receiver, integrating the channel statistics estimation. Simulation results show that the proposed receiver can achieve performances near those obtained with known statistics. It is also proved that if the Doppler frequency is set to a value above or below the true value, the BER will significantly degrade.","PeriodicalId":417940,"journal":{"name":"2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA)","volume":"217 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133679146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An enhanced technique for roller bearing defect detection using an impulse response wavelet based sparse code shrinkage de-noising algorithm","authors":"M. Boufenar, S. Rechak","doi":"10.1109/WOSSPA.2013.6602400","DOIUrl":"https://doi.org/10.1109/WOSSPA.2013.6602400","url":null,"abstract":"Detection of defects at early stage is crucial to fault prognostics. Periodic impulses indicate the occurrence of faults in roller bearings. However, it is difficult to detect the impulses of initiating defects because they are rather weak and are often immersed in heavy noise. Existing wavelet threshold de-noising methods are not efficient because they use orthogonal wavelets, which do not match correctly the impulse and do not utilize prior information on the impulses. Hence, a Sparse Code Shrinkage (SCS) method based on maximum likelihood estimation (MLE) for thresholding using an adapted wavelet is developed. Based on SCS de-noising, the present method gives an in-depth analysis of the inspected signal even at very low signal to noise ratio (SNR).","PeriodicalId":417940,"journal":{"name":"2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130518274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A review of wavelet denoising in medical imaging","authors":"A. Ouahabi","doi":"10.1109/WOSSPA.2013.6602330","DOIUrl":"https://doi.org/10.1109/WOSSPA.2013.6602330","url":null,"abstract":"In this tutorial, we review recent wavelet denoising techniques for medical ultrasound and for magnetic resonance images. We evaluate their implementation via MATLAB package and discuss their performances in terms of SNR (signal-to-noise ratio) or PSNR (peak signal-to-noise ratio) and visual aspects of image quality. Image denoising using wavelet-based multiresolution analysis requires a delicate compromise between noise reduction and preserving significant image details. Hence, some subtleties associated with these denoising techniques will be explained in detail.","PeriodicalId":417940,"journal":{"name":"2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA)","volume":"163 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115685615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}