{"title":"EMD-Based Noise-Robust Method for Speech/Pause Segmentation","authors":"A. Alimuradov, A. Tychkov","doi":"10.1109/REEPE51337.2021.9388066","DOIUrl":null,"url":null,"abstract":"The article presents a noise-robust method for speech/pause segmentation based on empirical mode decomposition. The method has been developed on the basis of a combined analysis of zero-crossing rate and short-term energy using empirical mode decomposition at the stage of preprocessing. Based on the results of preliminary processing, a set of new investigated signals, containing the most reliable information about the boundaries of the beginning and the end of informative sections of noisy speech, has been formed. The effect of the decomposition method and the influence of fragment duration of the investigated signals on the segmentation efficiency of noisy speech at different signal-to-noise ratio levels, from 20 to -5 dB with a step size of 5 dB, were assessed. The research results have shown a decrease in the values of the first and second kind errors during the segmentation of noisy speech signals using the proposed noise-robust method.","PeriodicalId":272476,"journal":{"name":"2021 3rd International Youth Conference on Radio Electronics, Electrical and Power Engineering (REEPE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 3rd International Youth Conference on Radio Electronics, Electrical and Power Engineering (REEPE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/REEPE51337.2021.9388066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The article presents a noise-robust method for speech/pause segmentation based on empirical mode decomposition. The method has been developed on the basis of a combined analysis of zero-crossing rate and short-term energy using empirical mode decomposition at the stage of preprocessing. Based on the results of preliminary processing, a set of new investigated signals, containing the most reliable information about the boundaries of the beginning and the end of informative sections of noisy speech, has been formed. The effect of the decomposition method and the influence of fragment duration of the investigated signals on the segmentation efficiency of noisy speech at different signal-to-noise ratio levels, from 20 to -5 dB with a step size of 5 dB, were assessed. The research results have shown a decrease in the values of the first and second kind errors during the segmentation of noisy speech signals using the proposed noise-robust method.