T. Villa-Cañas, E. Belalcazar-Bolamos, S. Bedoya-Jaramillo, J. F. Garcés, J. Orozco-Arroyave, J. D. Arias-Londoño, J. Vargas-Bonilla
{"title":"用Mel和Bark鳞片的倒谱分析自动检测喉部病变","authors":"T. Villa-Cañas, E. Belalcazar-Bolamos, S. Bedoya-Jaramillo, J. F. Garcés, J. Orozco-Arroyave, J. D. Arias-Londoño, J. Vargas-Bonilla","doi":"10.1109/STSIVA.2012.6340567","DOIUrl":null,"url":null,"abstract":"Problems in voice production can appear due to functional disorders and laryngeal pathologies. The presence of laryngeal pathologies can causes significant changes in the vibrational patterns of the vocal folds and it is demonstrated that the impact of such pathologies can be reduced through continuous speech therapy. We propose a methodology based on non-parametric cepstral coefficients in Mel and Bark scales. The most relevant features are automatically selected using two algorithms, one is based on Principal Components Analysis (PCA) and other is based on Sequential Floating Features Selection (SFFS). In order to decide whether a voice recording is healthy or pathological, four different classifiers are implemented: linear and quadratic Bayesian, K nearest neighbors and Parzen. The best result was 89.18%, it was obtained from the union between MFCC and BFCC.","PeriodicalId":383297,"journal":{"name":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Automatic detection of laryngeal pathologies using cepstral analysis in Mel and Bark scales\",\"authors\":\"T. Villa-Cañas, E. Belalcazar-Bolamos, S. Bedoya-Jaramillo, J. F. Garcés, J. Orozco-Arroyave, J. D. Arias-Londoño, J. Vargas-Bonilla\",\"doi\":\"10.1109/STSIVA.2012.6340567\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Problems in voice production can appear due to functional disorders and laryngeal pathologies. The presence of laryngeal pathologies can causes significant changes in the vibrational patterns of the vocal folds and it is demonstrated that the impact of such pathologies can be reduced through continuous speech therapy. We propose a methodology based on non-parametric cepstral coefficients in Mel and Bark scales. The most relevant features are automatically selected using two algorithms, one is based on Principal Components Analysis (PCA) and other is based on Sequential Floating Features Selection (SFFS). In order to decide whether a voice recording is healthy or pathological, four different classifiers are implemented: linear and quadratic Bayesian, K nearest neighbors and Parzen. The best result was 89.18%, it was obtained from the union between MFCC and BFCC.\",\"PeriodicalId\":383297,\"journal\":{\"name\":\"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)\",\"volume\":\"89 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/STSIVA.2012.6340567\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 XVII Symposium of Image, Signal Processing, and Artificial Vision (STSIVA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/STSIVA.2012.6340567","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic detection of laryngeal pathologies using cepstral analysis in Mel and Bark scales
Problems in voice production can appear due to functional disorders and laryngeal pathologies. The presence of laryngeal pathologies can causes significant changes in the vibrational patterns of the vocal folds and it is demonstrated that the impact of such pathologies can be reduced through continuous speech therapy. We propose a methodology based on non-parametric cepstral coefficients in Mel and Bark scales. The most relevant features are automatically selected using two algorithms, one is based on Principal Components Analysis (PCA) and other is based on Sequential Floating Features Selection (SFFS). In order to decide whether a voice recording is healthy or pathological, four different classifiers are implemented: linear and quadratic Bayesian, K nearest neighbors and Parzen. The best result was 89.18%, it was obtained from the union between MFCC and BFCC.