Horderlin Vrangel Robles Vega, V. Molina, Luis Martinez
{"title":"VAD algorithms energy-based and spectral-domain applied in River Plate Castilian","authors":"Horderlin Vrangel Robles Vega, V. Molina, Luis Martinez","doi":"10.1109/STSIVA.2016.7743346","DOIUrl":null,"url":null,"abstract":"Because the English and Castilian have marked acoustic and phonetic differences, this paper shows the study of the effectiveness of different algorithms VAD (Voice Activity Detection) literature, applied to the Castilian, especially riplatense. This article is intended to publicize the results achieved to date. In the first part of the document briefly explained the three implemented methods, namely the autocorrelation function short time (STACF), the average magnitude of the differential junction (FDMA) and the linear prediction coefficients (LPC). Immediately, tests and experiments with BEPPA battery to evaluate the effectiveness of these algorithms VAD will be described. In this step 10 sentences were applied in selected Rioplatense Spanish BEPPA battery of each VAD to detect sound segments, they were used without voice and silence. Immediately, the results obtained in the experimental phase is disclosed, evaluate classifications using the confusion matrix of the 10 phrases in 65 words were about 40 segments of silence. Finally, conclusions and future work are described. Clearly that shows that the algorithms have not been implemented show overall efficiency in detecting voice activity in Spanish of the Rio de la Plata. We also found that the algorithms implemented using linear prediction coefficients show better performance.","PeriodicalId":373420,"journal":{"name":"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 XXI Symposium on Signal Processing, Images and Artificial Vision (STSIVA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/STSIVA.2016.7743346","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Because the English and Castilian have marked acoustic and phonetic differences, this paper shows the study of the effectiveness of different algorithms VAD (Voice Activity Detection) literature, applied to the Castilian, especially riplatense. This article is intended to publicize the results achieved to date. In the first part of the document briefly explained the three implemented methods, namely the autocorrelation function short time (STACF), the average magnitude of the differential junction (FDMA) and the linear prediction coefficients (LPC). Immediately, tests and experiments with BEPPA battery to evaluate the effectiveness of these algorithms VAD will be described. In this step 10 sentences were applied in selected Rioplatense Spanish BEPPA battery of each VAD to detect sound segments, they were used without voice and silence. Immediately, the results obtained in the experimental phase is disclosed, evaluate classifications using the confusion matrix of the 10 phrases in 65 words were about 40 segments of silence. Finally, conclusions and future work are described. Clearly that shows that the algorithms have not been implemented show overall efficiency in detecting voice activity in Spanish of the Rio de la Plata. We also found that the algorithms implemented using linear prediction coefficients show better performance.