{"title":"具有语音活动检测的600BPS MELP声码器","authors":"Qiuyun Hao, Ye Li, Peng Zhang, Yanhong Fan, Xiaofeng Ma, Jingsai Jiang","doi":"10.1109/ICALIP.2016.7846549","DOIUrl":null,"url":null,"abstract":"In the underwater communication, satellite communication, secure communication and other channels, the channel bandwidth is narrow and the channel condition is relatively poor. Therefore, higher quality and lower rate speech coding is needed. In order to improve the synthetic speech quality and save channel bandwidth, voice activity detection (VAD) technique is introduced to Mixed Excitation Linear Prediction (MELP) vocoder at 600bps in this paper. It can save channel bandwidth and reduce noise, coding rate and power consumption. In order to improve the accuracy of speech endpoint detection at low signal-to-noise ratio (SNR), noise reduction is adopted to improve SNR, and the VAD algorithm based on statistical model (STAT-VAD) is used. The MELP vocoder with VAD and noise reduction not only has good anti-noise ability and can improve robustness in random channel, but also can reduce the average coding rate and save channel bandwidth. The quality of synthetic speech can achieve the desired results at low SNR. In addition, the vocoder at 600bps with VAD can run in real-time on TMS320VC5510 DSP platform.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A 600BPS MELP vocoder with voice activity detection\",\"authors\":\"Qiuyun Hao, Ye Li, Peng Zhang, Yanhong Fan, Xiaofeng Ma, Jingsai Jiang\",\"doi\":\"10.1109/ICALIP.2016.7846549\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the underwater communication, satellite communication, secure communication and other channels, the channel bandwidth is narrow and the channel condition is relatively poor. Therefore, higher quality and lower rate speech coding is needed. In order to improve the synthetic speech quality and save channel bandwidth, voice activity detection (VAD) technique is introduced to Mixed Excitation Linear Prediction (MELP) vocoder at 600bps in this paper. It can save channel bandwidth and reduce noise, coding rate and power consumption. In order to improve the accuracy of speech endpoint detection at low signal-to-noise ratio (SNR), noise reduction is adopted to improve SNR, and the VAD algorithm based on statistical model (STAT-VAD) is used. The MELP vocoder with VAD and noise reduction not only has good anti-noise ability and can improve robustness in random channel, but also can reduce the average coding rate and save channel bandwidth. The quality of synthetic speech can achieve the desired results at low SNR. In addition, the vocoder at 600bps with VAD can run in real-time on TMS320VC5510 DSP platform.\",\"PeriodicalId\":184170,\"journal\":{\"name\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Audio, Language and Image Processing (ICALIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICALIP.2016.7846549\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICALIP.2016.7846549","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A 600BPS MELP vocoder with voice activity detection
In the underwater communication, satellite communication, secure communication and other channels, the channel bandwidth is narrow and the channel condition is relatively poor. Therefore, higher quality and lower rate speech coding is needed. In order to improve the synthetic speech quality and save channel bandwidth, voice activity detection (VAD) technique is introduced to Mixed Excitation Linear Prediction (MELP) vocoder at 600bps in this paper. It can save channel bandwidth and reduce noise, coding rate and power consumption. In order to improve the accuracy of speech endpoint detection at low signal-to-noise ratio (SNR), noise reduction is adopted to improve SNR, and the VAD algorithm based on statistical model (STAT-VAD) is used. The MELP vocoder with VAD and noise reduction not only has good anti-noise ability and can improve robustness in random channel, but also can reduce the average coding rate and save channel bandwidth. The quality of synthetic speech can achieve the desired results at low SNR. In addition, the vocoder at 600bps with VAD can run in real-time on TMS320VC5510 DSP platform.