2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)最新文献

筛选
英文 中文
Methodology for voice commands recognition using stochastic classifiers 基于随机分类器的语音命令识别方法
2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA) Pub Date : 2012-11-12 DOI: 10.1109/STSIVA.2012.6340559
W. A. Bedoya, L. D. Muñoz
{"title":"Methodology for voice commands recognition using stochastic classifiers","authors":"W. A. Bedoya, L. D. Muñoz","doi":"10.1109/STSIVA.2012.6340559","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340559","url":null,"abstract":"The incidence of people with motor disabilities in Colombia is around 6.4%, which is a major social problem, because people with such disabilities lose their autonomy to perform basic actions such as displacement. Therefore, we propose a solution to the problem of mobility in people with motor disabilities, allowing to take control of the engines, with a voice comand recognition system. This paper presents a methodology for recognition of isolated spanish words (silla, atrás, adelante, derecha, izquierda, pare). To this end, we use a methodology based on the wavelet transform preprocessing. The characterization of the filtered signal is performed by Mel Cepstral Coefficients and classification stage using hidden Markov models. The methodology has proven to be robust, because the databases used for training the system have been acquired in noisy environments as well as controlled, presenting performances in classification acuracy of 98%.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124963360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Relevance and redundancy analysis of PCG signals in the detection of heart murmurs by ANOVA 用方差分析分析PCG信号在心音检测中的相关性和冗余性
2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA) Pub Date : 2012-11-12 DOI: 10.1109/STSIVA.2012.6340580
L. Velásquez-Martínez, S. Murillo-Rendón, G. Castellanos-Domínguez
{"title":"Relevance and redundancy analysis of PCG signals in the detection of heart murmurs by ANOVA","authors":"L. Velásquez-Martínez, S. Murillo-Rendón, G. Castellanos-Domínguez","doi":"10.1109/STSIVA.2012.6340580","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340580","url":null,"abstract":"One of the most important operating problems of the human heart valves is the murmur, which can be detected by auscultation employing an stethoscope. In order to design an aided diagnosis system for detecting the mentioned pathology, it is required to establish the discriminat capacity of the calculated features, which are used during the decision making stage. In this work, the analysis of variance (ANOVA) is employed to select the relevant features from an original feature set. The analysis allows to reduce the feature space dimension without decrementing the classification performance of the automatic system, compared to the classification performance obtained when the full feature set is employed.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127277338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Measurement system for acoustic characterization of materials at normal sound incidence 正常声入射下材料声学特性测量系统
2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA) Pub Date : 2012-11-12 DOI: 10.1109/STSIVA.2012.6340578
J. Gil, D. Giraldo, E. R. Cordoba, A. M. Cárdenas
{"title":"Measurement system for acoustic characterization of materials at normal sound incidence","authors":"J. Gil, D. Giraldo, E. R. Cordoba, A. M. Cárdenas","doi":"10.1109/STSIVA.2012.6340578","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340578","url":null,"abstract":"The research of sound behavior in materials made possible to design the correct acoustic systems, but experimental studies offers better options than theoretical studies. Thus, a measurements system is necessary to obtain these characteristics that we are looking for. In this paper, we show the construction of an acoustic measurement system of materials and how for obtaining the complex reflection factor at normal sound incidence, based on the impedance tube prototype and the development of an algorithm that relate the signals of two microphones of the system. The method of calculation, construction and measurement are based on the procedure recommended by the ISO10534-2, which reaches a transfer function between two signals of sound pressure measures in an impedance tube. Simulations were performed with fictional materials to prove the functionality of the algorithm and his coherence in the results, and then implement it with real experimental signals in the range between 280Hz and 2465Hz. In addition to showing the results of measurement, we analyze the physical phenomena inside the tube that influences the measurement and determinate the corresponding corrections to make in the algorithm.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130485674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning algorithms for real time arrhythmias detection in portable cardiac devices: microcontroller implementation and comparative analysis 便携式心脏设备中实时心律失常检测的机器学习算法:微控制器实现和比较分析
2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA) Pub Date : 2012-11-12 DOI: 10.1109/STSIVA.2012.6340556
S. Rúa, S. Zuluaga, A. Redondo, A. Orozco-Duque, J. Restrepo, J. Bustamante
{"title":"Machine learning algorithms for real time arrhythmias detection in portable cardiac devices: microcontroller implementation and comparative analysis","authors":"S. Rúa, S. Zuluaga, A. Redondo, A. Orozco-Duque, J. Restrepo, J. Bustamante","doi":"10.1109/STSIVA.2012.6340556","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340556","url":null,"abstract":"This paper presents the development of two machine learning algorithms on a 32-bit ARM® Cortex® M4 microcontroller core from Freescale Semiconductors. A neural network (ANN) and a support vector machine (SVM) were implemented for real time detection of ventricular tachycardia (VT) and ventricular fibrillation (VF), and they were compared in terms of accuracy. In the feature extraction step a Fast Wavelet Transform (FWT) was used; which was analyzed using the time-frequency characteristics of energy in each sub-band frequency. For the training and validation algorithms, signals from MIT-BIH database with normal sinus rhythm, VF and VT in a time window of 2 seconds were used. Validation results achieve test accuracy of 99.46% by ANN and SVM in VT/VF detection.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127623283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Infrared thermal image segmentation using expectation-maximization-based clustering 基于期望最大化聚类的红外热图像分割
2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA) Pub Date : 2012-11-12 DOI: 10.1109/STSIVA.2012.6340586
T. J. Ramírez-Rozo, J. García-Álvarez, C. Castellanos-Dominguez
{"title":"Infrared thermal image segmentation using expectation-maximization-based clustering","authors":"T. J. Ramírez-Rozo, J. García-Álvarez, C. Castellanos-Dominguez","doi":"10.1109/STSIVA.2012.6340586","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340586","url":null,"abstract":"In infrared (IR) based non-destructive and evaluation tests (NDT&E) for automated fault detection and identification processes, the segmentation task is a crucial stage. In fact, thermal imaging gives vital condition information of equipment and structures. So, pattern recognition algorithms can perform an accurate diagnosis, through an adequate segmentation. In this paper the Expectation Maximization Clustering (EM-Clustering) segmentation is evaluated for IR images, using as reference watershed transform-based segmentation. IR images were acquired from a test rig of an operating motor at Vibrations Laboratory. Proposed Clustering based segmentation performance is assessed by Dice's coefficient metric, obtaining an average 0.87 Dice's coefficient value. Demonstrating that EM-Clustering Segmentation is a valid choice for IR image processing.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128840030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
RGB profiles in digital image analysis for the description of interference colors produced on photoelasticity studies of the plastic film deformation RGB轮廓在数字图像分析中用于描述干涉色,对塑料薄膜的光弹性变形进行了研究
2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA) Pub Date : 2012-11-12 DOI: 10.1109/STSIVA.2012.6340597
J. C. Briñez, Francisco-Eugenio Lopez Giraldo, A. Martinez
{"title":"RGB profiles in digital image analysis for the description of interference colors produced on photoelasticity studies of the plastic film deformation","authors":"J. C. Briñez, Francisco-Eugenio Lopez Giraldo, A. Martinez","doi":"10.1109/STSIVA.2012.6340597","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340597","url":null,"abstract":"Over the last years image analysis has taken great positioning in the automation of industrial processes, digital photoelasticity employs techniques supported in the digital image analysis applied to studies based on measuring of the intensity of light and the reconstruction of the phase maps generated in the transmission of light through the studied material. This paper proposes the implementation of profiles RGB image analysis to describe the behavior of the interference colors produced in photoelasticity studies during deformation of plastic films, these techniques aim to study the photoelastic characteristics of plastic films starting from the analysis of the content of an image generated through a pattern of polarization. Plastic films samples are subjected to tensile mechanical, optical polarization assemblies are used for observing the phenomenon of photoelasticity, a digital camera is used to capture changes in interference colors, and RGB profiles are used for image analysis. The use of the implemented technique allows to identify the transition points for the change of order in the colors of interference, and to describe the behavior of the color stripes in the same order.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126940505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Automatic detection of laryngeal pathologies using cepstral analysis in Mel and Bark scales 用Mel和Bark鳞片的倒谱分析自动检测喉部病变
2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA) Pub Date : 2012-11-12 DOI: 10.1109/STSIVA.2012.6340567
T. Villa-Cañas, E. Belalcazar-Bolamos, S. Bedoya-Jaramillo, J. F. Garcés, J. Orozco-Arroyave, J. D. Arias-Londoño, J. Vargas-Bonilla
{"title":"Automatic detection of laryngeal pathologies using cepstral analysis in Mel and Bark scales","authors":"T. Villa-Cañas, E. Belalcazar-Bolamos, S. Bedoya-Jaramillo, J. F. Garcés, J. Orozco-Arroyave, J. D. Arias-Londoño, J. Vargas-Bonilla","doi":"10.1109/STSIVA.2012.6340567","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340567","url":null,"abstract":"Problems in voice production can appear due to functional disorders and laryngeal pathologies. The presence of laryngeal pathologies can causes significant changes in the vibrational patterns of the vocal folds and it is demonstrated that the impact of such pathologies can be reduced through continuous speech therapy. We propose a methodology based on non-parametric cepstral coefficients in Mel and Bark scales. The most relevant features are automatically selected using two algorithms, one is based on Principal Components Analysis (PCA) and other is based on Sequential Floating Features Selection (SFFS). In order to decide whether a voice recording is healthy or pathological, four different classifiers are implemented: linear and quadratic Bayesian, K nearest neighbors and Parzen. The best result was 89.18%, it was obtained from the union between MFCC and BFCC.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125861173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Recognition and classification of numerical labels using digital image processing techniques 使用数字图像处理技术的数字标签识别和分类
2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA) Pub Date : 2012-11-12 DOI: 10.1109/STSIVA.2012.6340592
T. Arrighi, J. E. Rojas, J. Soto, C. Madrigal, J. A. Londoño
{"title":"Recognition and classification of numerical labels using digital image processing techniques","authors":"T. Arrighi, J. E. Rojas, J. Soto, C. Madrigal, J. A. Londoño","doi":"10.1109/STSIVA.2012.6340592","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340592","url":null,"abstract":"This article describes the methodology used for the automatic classification of finished products at Familia Sancela Company, Medellin plant, by visual recognition of numeric codes labels, printed on their packaging, before the stowage and storage procedures. Based on the morphology and package design and techniques using digital image processing and artificial vision, it seeks to graphically detect a numeric label that encodes the product, whose characters are framed in a box. For this, an image preprocessing by thresholding, are the outlines of the image and using the polynomial approximation method detected the rectangle that frames the numerical code, this region is applied an orientation correction algorithm, it is a segmentation of each digit in individual images and finally apply the algorithm of Optical Character Recognition (OCR), which determines the value of the character by comparing the Euclidean distances between the projection of the character and the established databases. The implementation of this automation results in an optimization in the packaging procedure as well as decrease of time, costs and errors. All processing is done using the computer vision library, OpenCV and cvBlobsLib, in the development platform Microsoft Visual Studio C + + 2010.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"226 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121889294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Preliminary studies on the taxonomy of object's tracking algorithms in video sequences 视频序列中目标跟踪算法分类的初步研究
2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA) Pub Date : 2012-11-12 DOI: 10.1109/STSIVA.2012.6340574
A. M. Ocaña, F. Calderon
{"title":"Preliminary studies on the taxonomy of object's tracking algorithms in video sequences","authors":"A. M. Ocaña, F. Calderon","doi":"10.1109/STSIVA.2012.6340574","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340574","url":null,"abstract":"Different techniques for tracking objects in controlled environments using video cameras have been proposed. These state of the art algorithms are focused especially on how to find a better segmentation of the tracking object and also on how to make this segmentation stable through time, regardless of temporal changes on the morphology of the object. Unlike any of that, this article reviews the state of the art, focusing on algorithms for segmentation of the scene and of tracking objects, then addresses the previous steps in the creation of a binary image that segments the objects and convert them into useful data, found frame by frame to be used afterwards for tracking. The intention is to classify the methods of temporal matching between the binary images which are the outcome of the segmentation of foreground and background into general groups, in order to give an organized starting point to the advances made regarding the tracking of moving objects with fixed cameras and to be able to adapt faster to the implementation of tracking on the new advances in specific techniques in the field of the proposed taxonomy.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129926992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Feature selection for hypernasality detection using PCA, LDA, kernel PCA and greedy kernel PCA 基于PCA、LDA、核PCA和贪婪核PCA的鼻音检测特征选择
2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA) Pub Date : 2012-11-12 DOI: 10.1109/STSIVA.2012.6340591
E. Belalcázar-Bolaños, T. Villa-Cañas, S. Bedoya-Jaramillo, J. F. Garces-Rodriguez, J. Orozco-Arroyave, J. D. Arias-Londoño, J. Vargas-Bonilla
{"title":"Feature selection for hypernasality detection using PCA, LDA, kernel PCA and greedy kernel PCA","authors":"E. Belalcázar-Bolaños, T. Villa-Cañas, S. Bedoya-Jaramillo, J. F. Garces-Rodriguez, J. Orozco-Arroyave, J. D. Arias-Londoño, J. Vargas-Bonilla","doi":"10.1109/STSIVA.2012.6340591","DOIUrl":"https://doi.org/10.1109/STSIVA.2012.6340591","url":null,"abstract":"Cleft lip and palate, due to morphological problems, allow the passage of air through the nasal cavity, introducing inappropriate nasal resonance during speech production and resulting in hypernasality speech. This paper proposes a methodology based on spectral and cepstral features, such as Modified Group Delay Functions with Mel Frequency Cepstral Coefficients, and uses relevance analysis and redundancy elimination, allowing the automatic hypernsality detection. The methodology seeks to evaluate four kinds of selection techniques: LDA (Linear Discriminator Analysis), PCA (Principal Component Analysis), Kernel PCA and Greedy Kernel PCA which provide a lot of information in the detection process and in turn contain the lowest value of redundancy. The experiments were performed considering a database which includes the five Spanish vowels uttered by 130 children whose voices were diagnosed as hypernasal by a phoniatrics expert plus 108 healthy were analyzed.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129584396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信