{"title":"通信非线性信道中基于小波包变换的语音活动检测","authors":"R. Chiodi, D. Massicotte","doi":"10.1109/.27","DOIUrl":null,"url":null,"abstract":"This paper presents a voice activity detection (VAD) algorithm based on the Wavelet Packet Transform and the Teager Energy Operation (TEO) processing. The signal is decomposed into subband signals. We used the multi-resolution analysis property of the Wavelet Transform to extract and analyse time-frequency components corresponding to speech. In order to obtain a parameter called Voice Activity Shape (VAS), we used TEO processing to better distinguish subband signals corresponding to speech. The subband variance values of each TEO signal are summed to obtain the VAS, which is higher in speech regions than in non speech regions. Experimental results show that our VAD perform better than the G729B, particularly in difficult noise conditions and also in the case when the speech sound is passed in a nonlinear communication channel. Experimental results are shown in the case of real speech communications from a spaceship to terrestrial 3G cellular network assuming nonlinear interferences.","PeriodicalId":408299,"journal":{"name":"2009 First International Conference on Advances in Satellite and Space Communications","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Voice Activity Detection Based on Wavelet Packet Transform in Communication Nonlinear Channel\",\"authors\":\"R. Chiodi, D. Massicotte\",\"doi\":\"10.1109/.27\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a voice activity detection (VAD) algorithm based on the Wavelet Packet Transform and the Teager Energy Operation (TEO) processing. The signal is decomposed into subband signals. We used the multi-resolution analysis property of the Wavelet Transform to extract and analyse time-frequency components corresponding to speech. In order to obtain a parameter called Voice Activity Shape (VAS), we used TEO processing to better distinguish subband signals corresponding to speech. The subband variance values of each TEO signal are summed to obtain the VAS, which is higher in speech regions than in non speech regions. Experimental results show that our VAD perform better than the G729B, particularly in difficult noise conditions and also in the case when the speech sound is passed in a nonlinear communication channel. Experimental results are shown in the case of real speech communications from a spaceship to terrestrial 3G cellular network assuming nonlinear interferences.\",\"PeriodicalId\":408299,\"journal\":{\"name\":\"2009 First International Conference on Advances in Satellite and Space Communications\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-07-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 First International Conference on Advances in Satellite and Space Communications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/.27\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 First International Conference on Advances in Satellite and Space Communications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/.27","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Voice Activity Detection Based on Wavelet Packet Transform in Communication Nonlinear Channel
This paper presents a voice activity detection (VAD) algorithm based on the Wavelet Packet Transform and the Teager Energy Operation (TEO) processing. The signal is decomposed into subband signals. We used the multi-resolution analysis property of the Wavelet Transform to extract and analyse time-frequency components corresponding to speech. In order to obtain a parameter called Voice Activity Shape (VAS), we used TEO processing to better distinguish subband signals corresponding to speech. The subband variance values of each TEO signal are summed to obtain the VAS, which is higher in speech regions than in non speech regions. Experimental results show that our VAD perform better than the G729B, particularly in difficult noise conditions and also in the case when the speech sound is passed in a nonlinear communication channel. Experimental results are shown in the case of real speech communications from a spaceship to terrestrial 3G cellular network assuming nonlinear interferences.