{"title":"Enhancement of Speech Signal Segmentation Using Teager Energy Operator","authors":"A. Alimuradov","doi":"10.1109/DSPA51283.2021.9535805","DOIUrl":null,"url":null,"abstract":"The article presents the improved method for speech signal segmentation, which provides an increase in the efficiency of detecting voiced and unvoiced sections, and pauses using the Teager energy operator. The method is based on energy analysis of speech signal fragments using the Teager energy operator with the subsequent analysis of zero-crossing rate and short-term energy of the energy characteristic function. The research was carried out to assess the performance and noise robustness of the proposed method in comparison with methods based on the analysis of zero-crossing rate, short-term energy, and one-dimensional Mahalanobis distance. In accordance with the obtained research results, it was concluded that due to the good susceptibility of the Teager energy operator to changes in signal amplitude and frequency, the improved method provides an enhancement of segmentation of speech signals, including noisy ones. Depending on the requirements for segmentation accuracy, the proposed method provides variability in the values of the first and second kind errors by changing the threshold coefficient.","PeriodicalId":393602,"journal":{"name":"2021 23rd International Conference on Digital Signal Processing and its Applications (DSPA)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 23rd International Conference on Digital Signal Processing and its Applications (DSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSPA51283.2021.9535805","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The article presents the improved method for speech signal segmentation, which provides an increase in the efficiency of detecting voiced and unvoiced sections, and pauses using the Teager energy operator. The method is based on energy analysis of speech signal fragments using the Teager energy operator with the subsequent analysis of zero-crossing rate and short-term energy of the energy characteristic function. The research was carried out to assess the performance and noise robustness of the proposed method in comparison with methods based on the analysis of zero-crossing rate, short-term energy, and one-dimensional Mahalanobis distance. In accordance with the obtained research results, it was concluded that due to the good susceptibility of the Teager energy operator to changes in signal amplitude and frequency, the improved method provides an enhancement of segmentation of speech signals, including noisy ones. Depending on the requirements for segmentation accuracy, the proposed method provides variability in the values of the first and second kind errors by changing the threshold coefficient.