{"title":"基于Hilbert包络单通道处理的激励Epoch和语音检测","authors":"H. Dasgupta, P. C. Pandey","doi":"10.1109/NCC55593.2022.9806818","DOIUrl":null,"url":null,"abstract":"A technique is presented for excitation epoch and voicing detection in speech signal using its Hilbert envelope and employing single-pass processing. The excitation epoch detection comprises dynamic range compression for reducing amplitude variability, Hilbert envelope calculation and dynamic peak detection for excitation saliency enhancement, and epoch marking by locating the maximum-sum subarray peaks. The voicing detection is based on thresholding the inter-epoch similarity calculated as the normalized covariance of the adjacent inter-epoch intervals of the Hilbert envelope. The total algorithmic delay is less than 60 ms. The epoch detection and the voicing detection for clean and telephone-quality speech showed a good match with those obtained from the EGG, and the detection performances compared favorably with the earlier techniques.","PeriodicalId":403870,"journal":{"name":"2022 National Conference on Communications (NCC)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Excitation Epoch and Voicing Detection Using Hilbert Envelope with Single-Pass Processing\",\"authors\":\"H. Dasgupta, P. C. Pandey\",\"doi\":\"10.1109/NCC55593.2022.9806818\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A technique is presented for excitation epoch and voicing detection in speech signal using its Hilbert envelope and employing single-pass processing. The excitation epoch detection comprises dynamic range compression for reducing amplitude variability, Hilbert envelope calculation and dynamic peak detection for excitation saliency enhancement, and epoch marking by locating the maximum-sum subarray peaks. The voicing detection is based on thresholding the inter-epoch similarity calculated as the normalized covariance of the adjacent inter-epoch intervals of the Hilbert envelope. The total algorithmic delay is less than 60 ms. The epoch detection and the voicing detection for clean and telephone-quality speech showed a good match with those obtained from the EGG, and the detection performances compared favorably with the earlier techniques.\",\"PeriodicalId\":403870,\"journal\":{\"name\":\"2022 National Conference on Communications (NCC)\",\"volume\":\"116 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 National Conference on Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC55593.2022.9806818\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC55593.2022.9806818","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Excitation Epoch and Voicing Detection Using Hilbert Envelope with Single-Pass Processing
A technique is presented for excitation epoch and voicing detection in speech signal using its Hilbert envelope and employing single-pass processing. The excitation epoch detection comprises dynamic range compression for reducing amplitude variability, Hilbert envelope calculation and dynamic peak detection for excitation saliency enhancement, and epoch marking by locating the maximum-sum subarray peaks. The voicing detection is based on thresholding the inter-epoch similarity calculated as the normalized covariance of the adjacent inter-epoch intervals of the Hilbert envelope. The total algorithmic delay is less than 60 ms. The epoch detection and the voicing detection for clean and telephone-quality speech showed a good match with those obtained from the EGG, and the detection performances compared favorably with the earlier techniques.