{"title":"Methodology for voice commands recognition using stochastic classifiers","authors":"W. A. Bedoya, L. D. Muñoz","doi":"10.1109/STSIVA.2012.6340559","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340559","url":null,"abstract":"The incidence of people with motor disabilities in Colombia is around 6.4%, which is a major social problem, because people with such disabilities lose their autonomy to perform basic actions such as displacement. Therefore, we propose a solution to the problem of mobility in people with motor disabilities, allowing to take control of the engines, with a voice comand recognition system. This paper presents a methodology for recognition of isolated spanish words (silla, atrás, adelante, derecha, izquierda, pare). To this end, we use a methodology based on the wavelet transform preprocessing. The characterization of the filtered signal is performed by Mel Cepstral Coefficients and classification stage using hidden Markov models. The methodology has proven to be robust, because the databases used for training the system have been acquired in noisy environments as well as controlled, presenting performances in classification acuracy of 98%.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124963360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
L. Velásquez-Martínez, S. Murillo-Rendón, G. Castellanos-Domínguez
{"title":"Relevance and redundancy analysis of PCG signals in the detection of heart murmurs by ANOVA","authors":"L. Velásquez-Martínez, S. Murillo-Rendón, G. Castellanos-Domínguez","doi":"10.1109/STSIVA.2012.6340580","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340580","url":null,"abstract":"One of the most important operating problems of the human heart valves is the murmur, which can be detected by auscultation employing an stethoscope. In order to design an aided diagnosis system for detecting the mentioned pathology, it is required to establish the discriminat capacity of the calculated features, which are used during the decision making stage. In this work, the analysis of variance (ANOVA) is employed to select the relevant features from an original feature set. The analysis allows to reduce the feature space dimension without decrementing the classification performance of the automatic system, compared to the classification performance obtained when the full feature set is employed.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127277338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Measurement system for acoustic characterization of materials at normal sound incidence","authors":"J. Gil, D. Giraldo, E. R. Cordoba, A. M. Cárdenas","doi":"10.1109/STSIVA.2012.6340578","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340578","url":null,"abstract":"The research of sound behavior in materials made possible to design the correct acoustic systems, but experimental studies offers better options than theoretical studies. Thus, a measurements system is necessary to obtain these characteristics that we are looking for. In this paper, we show the construction of an acoustic measurement system of materials and how for obtaining the complex reflection factor at normal sound incidence, based on the impedance tube prototype and the development of an algorithm that relate the signals of two microphones of the system. The method of calculation, construction and measurement are based on the procedure recommended by the ISO10534-2, which reaches a transfer function between two signals of sound pressure measures in an impedance tube. Simulations were performed with fictional materials to prove the functionality of the algorithm and his coherence in the results, and then implement it with real experimental signals in the range between 280Hz and 2465Hz. In addition to showing the results of measurement, we analyze the physical phenomena inside the tube that influences the measurement and determinate the corresponding corrections to make in the algorithm.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130485674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Rúa, S. Zuluaga, A. Redondo, A. Orozco-Duque, J. Restrepo, J. Bustamante
{"title":"Machine learning algorithms for real time arrhythmias detection in portable cardiac devices: microcontroller implementation and comparative analysis","authors":"S. Rúa, S. Zuluaga, A. Redondo, A. Orozco-Duque, J. Restrepo, J. Bustamante","doi":"10.1109/STSIVA.2012.6340556","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340556","url":null,"abstract":"This paper presents the development of two machine learning algorithms on a 32-bit ARM® Cortex® M4 microcontroller core from Freescale Semiconductors. A neural network (ANN) and a support vector machine (SVM) were implemented for real time detection of ventricular tachycardia (VT) and ventricular fibrillation (VF), and they were compared in terms of accuracy. In the feature extraction step a Fast Wavelet Transform (FWT) was used; which was analyzed using the time-frequency characteristics of energy in each sub-band frequency. For the training and validation algorithms, signals from MIT-BIH database with normal sinus rhythm, VF and VT in a time window of 2 seconds were used. Validation results achieve test accuracy of 99.46% by ANN and SVM in VT/VF detection.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127623283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
T. J. Ramírez-Rozo, J. García-Álvarez, C. Castellanos-Dominguez
{"title":"Infrared thermal image segmentation using expectation-maximization-based clustering","authors":"T. J. Ramírez-Rozo, J. García-Álvarez, C. Castellanos-Dominguez","doi":"10.1109/STSIVA.2012.6340586","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340586","url":null,"abstract":"In infrared (IR) based non-destructive and evaluation tests (NDT&E) for automated fault detection and identification processes, the segmentation task is a crucial stage. In fact, thermal imaging gives vital condition information of equipment and structures. So, pattern recognition algorithms can perform an accurate diagnosis, through an adequate segmentation. In this paper the Expectation Maximization Clustering (EM-Clustering) segmentation is evaluated for IR images, using as reference watershed transform-based segmentation. IR images were acquired from a test rig of an operating motor at Vibrations Laboratory. Proposed Clustering based segmentation performance is assessed by Dice's coefficient metric, obtaining an average 0.87 Dice's coefficient value. Demonstrating that EM-Clustering Segmentation is a valid choice for IR image processing.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128840030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. C. Briñez, Francisco-Eugenio Lopez Giraldo, A. Martinez
{"title":"RGB profiles in digital image analysis for the description of interference colors produced on photoelasticity studies of the plastic film deformation","authors":"J. C. Briñez, Francisco-Eugenio Lopez Giraldo, A. Martinez","doi":"10.1109/STSIVA.2012.6340597","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340597","url":null,"abstract":"Over the last years image analysis has taken great positioning in the automation of industrial processes, digital photoelasticity employs techniques supported in the digital image analysis applied to studies based on measuring of the intensity of light and the reconstruction of the phase maps generated in the transmission of light through the studied material. This paper proposes the implementation of profiles RGB image analysis to describe the behavior of the interference colors produced in photoelasticity studies during deformation of plastic films, these techniques aim to study the photoelastic characteristics of plastic films starting from the analysis of the content of an image generated through a pattern of polarization. Plastic films samples are subjected to tensile mechanical, optical polarization assemblies are used for observing the phenomenon of photoelasticity, a digital camera is used to capture changes in interference colors, and RGB profiles are used for image analysis. The use of the implemented technique allows to identify the transition points for the change of order in the colors of interference, and to describe the behavior of the color stripes in the same order.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126940505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
T. Villa-Cañas, E. Belalcazar-Bolamos, S. Bedoya-Jaramillo, J. F. Garcés, J. Orozco-Arroyave, J. D. Arias-Londoño, J. Vargas-Bonilla
{"title":"Automatic detection of laryngeal pathologies using cepstral analysis in Mel and Bark scales","authors":"T. Villa-Cañas, E. Belalcazar-Bolamos, S. Bedoya-Jaramillo, J. F. Garcés, J. Orozco-Arroyave, J. D. Arias-Londoño, J. Vargas-Bonilla","doi":"10.1109/STSIVA.2012.6340567","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340567","url":null,"abstract":"Problems in voice production can appear due to functional disorders and laryngeal pathologies. The presence of laryngeal pathologies can causes significant changes in the vibrational patterns of the vocal folds and it is demonstrated that the impact of such pathologies can be reduced through continuous speech therapy. We propose a methodology based on non-parametric cepstral coefficients in Mel and Bark scales. The most relevant features are automatically selected using two algorithms, one is based on Principal Components Analysis (PCA) and other is based on Sequential Floating Features Selection (SFFS). In order to decide whether a voice recording is healthy or pathological, four different classifiers are implemented: linear and quadratic Bayesian, K nearest neighbors and Parzen. The best result was 89.18%, it was obtained from the union between MFCC and BFCC.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125861173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
T. Arrighi, J. E. Rojas, J. Soto, C. Madrigal, J. A. Londoño
{"title":"Recognition and classification of numerical labels using digital image processing techniques","authors":"T. Arrighi, J. E. Rojas, J. Soto, C. Madrigal, J. A. Londoño","doi":"10.1109/STSIVA.2012.6340592","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340592","url":null,"abstract":"This article describes the methodology used for the automatic classification of finished products at Familia Sancela Company, Medellin plant, by visual recognition of numeric codes labels, printed on their packaging, before the stowage and storage procedures. Based on the morphology and package design and techniques using digital image processing and artificial vision, it seeks to graphically detect a numeric label that encodes the product, whose characters are framed in a box. For this, an image preprocessing by thresholding, are the outlines of the image and using the polynomial approximation method detected the rectangle that frames the numerical code, this region is applied an orientation correction algorithm, it is a segmentation of each digit in individual images and finally apply the algorithm of Optical Character Recognition (OCR), which determines the value of the character by comparing the Euclidean distances between the projection of the character and the established databases. The implementation of this automation results in an optimization in the packaging procedure as well as decrease of time, costs and errors. All processing is done using the computer vision library, OpenCV and cvBlobsLib, in the development platform Microsoft Visual Studio C + + 2010.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"226 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121889294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Preliminary studies on the taxonomy of object's tracking algorithms in video sequences","authors":"A. M. Ocaña, F. Calderon","doi":"10.1109/STSIVA.2012.6340574","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340574","url":null,"abstract":"Different techniques for tracking objects in controlled environments using video cameras have been proposed. These state of the art algorithms are focused especially on how to find a better segmentation of the tracking object and also on how to make this segmentation stable through time, regardless of temporal changes on the morphology of the object. Unlike any of that, this article reviews the state of the art, focusing on algorithms for segmentation of the scene and of tracking objects, then addresses the previous steps in the creation of a binary image that segments the objects and convert them into useful data, found frame by frame to be used afterwards for tracking. The intention is to classify the methods of temporal matching between the binary images which are the outcome of the segmentation of foreground and background into general groups, in order to give an organized starting point to the advances made regarding the tracking of moving objects with fixed cameras and to be able to adapt faster to the implementation of tracking on the new advances in specific techniques in the field of the proposed taxonomy.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129926992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Belalcázar-Bolaños, T. Villa-Cañas, S. Bedoya-Jaramillo, J. F. Garces-Rodriguez, J. Orozco-Arroyave, J. D. Arias-Londoño, J. Vargas-Bonilla
{"title":"Feature selection for hypernasality detection using PCA, LDA, kernel PCA and greedy kernel PCA","authors":"E. Belalcázar-Bolaños, T. Villa-Cañas, S. Bedoya-Jaramillo, J. F. Garces-Rodriguez, J. Orozco-Arroyave, J. D. Arias-Londoño, J. Vargas-Bonilla","doi":"10.1109/STSIVA.2012.6340591","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340591","url":null,"abstract":"Cleft lip and palate, due to morphological problems, allow the passage of air through the nasal cavity, introducing inappropriate nasal resonance during speech production and resulting in hypernasality speech. This paper proposes a methodology based on spectral and cepstral features, such as Modified Group Delay Functions with Mel Frequency Cepstral Coefficients, and uses relevance analysis and redundancy elimination, allowing the automatic hypernsality detection. The methodology seeks to evaluate four kinds of selection techniques: LDA (Linear Discriminator Analysis), PCA (Principal Component Analysis), Kernel PCA and Greedy Kernel PCA which provide a lot of information in the detection process and in turn contain the lowest value of redundancy. The experiments were performed considering a database which includes the five Spanish vowels uttered by 130 children whose voices were diagnosed as hypernasal by a phoniatrics expert plus 108 healthy were analyzed.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129584396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}