{"title":"基于听觉模型的小波线性预测声码器","authors":"Sun Xun, Limin Du, Wei-Yean Howng","doi":"10.1109/ICOSP.1998.770282","DOIUrl":null,"url":null,"abstract":"It is very difficult but very important to get higher quantity at lower bit rate in the field of speech coding. Traditional vocoders, such as LPC10e and CELPC, can give acceptable results but suffer from a hoarse output. We attribute this problem to the inconsistency between the equal resolution characteristic of linear prediction throughout the whole frequency band and the characteristic of the human ear's unequal resolution at different frequency bands. In this paper, we present a new wavelet linear prediction subband coding algorithm (WLPSC), by employing a wavelet filter bank based on the auditory model. We divide the input speech signal into four subbands, and then code each subband respectively. Experimental results show that this algorithm can greatly reduce the hoarse output of vocoders.","PeriodicalId":145700,"journal":{"name":"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Wavelet linear prediction vocoder based on auditory model\",\"authors\":\"Sun Xun, Limin Du, Wei-Yean Howng\",\"doi\":\"10.1109/ICOSP.1998.770282\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It is very difficult but very important to get higher quantity at lower bit rate in the field of speech coding. Traditional vocoders, such as LPC10e and CELPC, can give acceptable results but suffer from a hoarse output. We attribute this problem to the inconsistency between the equal resolution characteristic of linear prediction throughout the whole frequency band and the characteristic of the human ear's unequal resolution at different frequency bands. In this paper, we present a new wavelet linear prediction subband coding algorithm (WLPSC), by employing a wavelet filter bank based on the auditory model. We divide the input speech signal into four subbands, and then code each subband respectively. Experimental results show that this algorithm can greatly reduce the hoarse output of vocoders.\",\"PeriodicalId\":145700,\"journal\":{\"name\":\"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOSP.1998.770282\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.1998.770282","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Wavelet linear prediction vocoder based on auditory model
It is very difficult but very important to get higher quantity at lower bit rate in the field of speech coding. Traditional vocoders, such as LPC10e and CELPC, can give acceptable results but suffer from a hoarse output. We attribute this problem to the inconsistency between the equal resolution characteristic of linear prediction throughout the whole frequency band and the characteristic of the human ear's unequal resolution at different frequency bands. In this paper, we present a new wavelet linear prediction subband coding algorithm (WLPSC), by employing a wavelet filter bank based on the auditory model. We divide the input speech signal into four subbands, and then code each subband respectively. Experimental results show that this algorithm can greatly reduce the hoarse output of vocoders.