{"title":"Low band continuous speech system for voice pathologies identification","authors":"Hugo Cordeiro, C. Meneses","doi":"10.23919/SPA.2018.8563393","DOIUrl":null,"url":null,"abstract":"This paper describes the impact of the signal bandwidth reduction in the identification of voice pathologies. The implemented systems evaluate the identification of 3 classes divided by healthy subjects, subjects diagnosed with physiological larynx pathologies and subjects diagnosed with neuromuscular larynx pathologies. Continuous speech signals are down-sampled to 4 kHz and the extracted spectral parameters are applied to a GMM classifier. No significant change in accuracy occurs, being possible to conclude that the low frequencies contain sufficient information to allow the classification of pathologies. A second objective is to test the effects of suppressing the voice activity detection and the increasing the analysis window length. In both cases the accuracy increases. In conclusion, a pathological voice identification system based on signals sampled at 4 kHz, without voice activity detection and with an analysis window length of 40 ms is proposed, getting 81.8% accuracy. The proposed system has also the advantage of reduces the storage memory and the processing time.","PeriodicalId":265587,"journal":{"name":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","volume":"1 4, Part 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/SPA.2018.8563393","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
This paper describes the impact of the signal bandwidth reduction in the identification of voice pathologies. The implemented systems evaluate the identification of 3 classes divided by healthy subjects, subjects diagnosed with physiological larynx pathologies and subjects diagnosed with neuromuscular larynx pathologies. Continuous speech signals are down-sampled to 4 kHz and the extracted spectral parameters are applied to a GMM classifier. No significant change in accuracy occurs, being possible to conclude that the low frequencies contain sufficient information to allow the classification of pathologies. A second objective is to test the effects of suppressing the voice activity detection and the increasing the analysis window length. In both cases the accuracy increases. In conclusion, a pathological voice identification system based on signals sampled at 4 kHz, without voice activity detection and with an analysis window length of 40 ms is proposed, getting 81.8% accuracy. The proposed system has also the advantage of reduces the storage memory and the processing time.