{"title":"Cooperative modular neural predictive coding","authors":"M. Chetouani, B. Gas, J. Zarader","doi":"10.1109/NNSP.2003.1318063","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318063","url":null,"abstract":"Speech feature extraction is one of the most important stage in the speech recognition process. In this paper, we propose a new neural networks architecture called the cooperative modular neural predictive coding (CMNPC). It is based on the interaction of discriminant experts DFE-NPC (discriminant feature extraction) optimized for macro-classification by the help of a criterion: the modelisation error ratio (MER). We propose a theoretical validation of this model by linking The MER with a likelihood ratio. The performances of this architecture are estimated in a phoneme recognition task. The phonemes are extracted from the Darpa-Timit speech database. Comparisons with coding methods (LPC, MFCC, PLP) are presented. They put in obviousness an improvement of the recognition rates.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123111521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Recognition of isolated handwritten Persian/Arabic characters and numerals using support vector machines","authors":"A. Mowlaei, K. Faez","doi":"10.1109/NNSP.2003.1318054","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318054","url":null,"abstract":"We propose a system for recognition of isolated handwritten Persian/Arabic characters and numerals. Wavelet transform has been used for feature extraction in this system using Haar wavelet. The support vector machine (SVM), which is a new learning machine with very good generalization ability, and has been used widely in pattern recognition and regression estimation, uses as classifier in this system. The training and test patterns were gathered from various people with different ages and different educational backgrounds. The 32 characters in Persian language were categorized into 8 different classes in which characters of each class are very similar to each other. There are ten digits in Persian/Arabic languages where two of them are not used in zip codes in Iran. So, we have 8 different extra classes for digits. This system was used for recognizing the isolated handwritten postal addresses, which contain the name of cities and their zip codes. Our database contains 579 postal addresses in Iran. The system yields the recognition rate of 98.96% for these postal addresses. The results show an increment in recognition rates in comparison with our previous work in which we used the MLP neural network as classifier.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"205 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123254909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"ART-R: a novel reinforcement learning algorithm using an ART module for state representation","authors":"L. Brignone, M. Howarth","doi":"10.1109/NNSP.2003.1318082","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318082","url":null,"abstract":"The work introduces a neural network (NN) algorithm capable of merging the fast and stable learning behaviour offered by the adaptive resonance theory (ART) and the advantageous properties of a reinforcement learning agent. The result is ART-R a neural algorithm particularly suited to learning state-action mappings in control applications. A real time example addressing a typical problem found in autonomous robotic assembly is discussed to highlight the achievement of unsupervised and fast learning of an optimal behaviour.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115600377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Neural speech enhancement in the time-frequency domain","authors":"M. Volkmer","doi":"10.1109/NNSP.2003.1318061","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318061","url":null,"abstract":"A pulsed neural network approach for the enhancement of speech is presented. It employs a recently introduced pulse-induced masking principle on a joint time-frequency representation of signals. Based on the notion of a spectro-temporal receptive field, the principle provides signal-dependent dictionary elements, which are integrated into an enhancement process in the time-frequency domain. Experiments on signals with different types of noise are conducted with the resulting neural time-frequency domain method.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"108 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120991657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A self-organizing method for map reconstruction","authors":"I. K. Altinel, N. Aras, B. Oommen","doi":"10.1109/NNSP.2003.1318067","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318067","url":null,"abstract":"A variety of problems in geographical and satellite-based remote sensing signal processing, and in the area of \"zero-error\" pattern recognition dealing with processing the information contained in the distances between the points in the geographical or feature space. In this paper we consider one such problem, namely, that of reconstructing the points in the geographical or feature space, when we are only given the approximate distances between the points themselves. In particular, we are interested in the problem of reconstructing a map when the given data is the set of intercity road travel distances. Reported solution approaches primarily involve multi-dimensional scaling techniques. However, we propose a self-organizing method. The new method is tested and compared with the classical multi-dimensional scaling and ALSCAL on different data sets obtained from various countries.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116713189","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive room acoustic response simulation: a virtual 3D application","authors":"G. Costantini, D. Casali, A. Uncini","doi":"10.1109/NNSP.2003.1318066","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318066","url":null,"abstract":"In this paper we propose a method to simulate a 3D acoustical environment in which sound sources are positioned in well defined sides. Our method is real-time applications oriented, due to the low computational cost of the implemented operations. The spatial position that the human brain assigns to a sound is influenced mainly by the differences between the sound signals that reach the listener's ears, related to the sound source angulation with respect to the listener's head. The reverberation effect, on the other side, depends on the type of environment. All this elements have to be simulated in order to give the illusion that a sound comes from a particular position in a particular environment. To obtain this result, we perform a suitable sound processing, that can be separated in two main tasks: reverberation and spatialization. The first one is mainly related to the environment itself: it depends on the shape of the environment and on the absorption coefficients of the walls. This is the most computational intensive component, if we want to reproduce it accurately, so we approximate it by an adaptive IIR filter. By the spatialization, the listener hears the sound as coming from a particular direction. This task, carried out by using the head related transfer functions (HRTFs), has to be applied to every sound source differently.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127930134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Barros, J. Príncipe, Y. Takeuchi, C. H. Sales, N. Ohnishi
{"title":"An algorithm based on the even moments of the error","authors":"A. Barros, J. Príncipe, Y. Takeuchi, C. H. Sales, N. Ohnishi","doi":"10.1109/NNSP.2003.1318087","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318087","url":null,"abstract":"We propose an algorithm based on a linear combination of the even moments of the error for adaptive filtering, called weighted even moment (WEM) algorithm. It is similar to the well-known least mean square (LMS) and to the family of algorithms proposed by Walach and Widrow (1994). This later ones were shown to behave poorer than the LMS, however, when the noise was Gaussian. We study the WEM algorithm convergence behavior and deduce equations for the misadjustment and the learning time. The results showed that the WEM had better performance than the LMS when the noise had a Gaussian distribution.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122124845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Local Hammerstein modeling based on self-organizing map","authors":"Jeongho Cho, J. Príncipe, M. Motter","doi":"10.1109/NNSP.2003.1318080","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318080","url":null,"abstract":"This work presents a method to determine a local polynomial model from a finite number of measurements of the inputs and outputs for Hammerstein systems which are a zero-memory nonlinearity followed by a linear filter. Self-organizing map (SOM) is utilized to cluster the dynamics in the input-output joint space, where processing-elements (PEs) are extended with local models to enable the original algorithm to learn input-output relationships with reasonable accuracy. Moreover, in order to increase the approximation accuracy, local models are built by polynomial models instead of just linear models. The identification method is applied to two simulation examples of a discrete-time system and compared with other neural networks-based alternatives to demonstrate the performance and efficiency of the proposed technique.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129042540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bayesian and RBF structures for wireless communications detection","authors":"L. M. San-José-Revuelta, Jesús Cid-Sueiro","doi":"10.1109/NNSP.2003.1318074","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318074","url":null,"abstract":"This work presents two different algorithms for multiuser detection in wireless DS/CDMA environments. First, a Bayesian detector which implements merging techniques, based on natural computation selection strategies, for complexity limitation, is analyzed, and, second, a low complexity radial basis function-based detector is presented. Both approaches share in common a low computational load and the capability to be implemented even with a high number of active users, since their complexity does not increase exponentially with it. Their performance and characteristics are compared with those of traditional multiuser detectors, such as the matched filter, the decorrelator and the MMSE detector, as well as with other low complexity detectors based on evolutionary computation methods.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129306649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Tracking of feature points in a scene of moving rigid objects by Bayesian switching structure model with particle filter","authors":"N. Ikoma, Yasutake Miyahara, H. Maeda","doi":"10.1109/NNSP.2003.1318071","DOIUrl":"https://doi.org/10.1109/NNSP.2003.1318071","url":null,"abstract":"Causal estimation of multiple feature points trajectories by using a switching state space model is proposed. The state vector of the model consists of the position of each feature point, the velocity of each rigid object, and some indicator variables for each feature point. Ther are two types of indicator variables: an object indicator representing the association between the feature point and rigid object, and an aperture indicator representing the attribute of the point, e.g. aperture or not. By estimating the state vector using a Rao-Blackwellized particle filter, smooth trajectories of feature points, velocity of objects, object indicators, and aperture indicators are obtained simultaneously. Performance on a real image sequence is presented by comparing to a Kalman filter being given true indicators.","PeriodicalId":315958,"journal":{"name":"2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126199118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}