基于听觉模型的小波线性预测声码器

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344) Pub Date : 1998-10-12 DOI:10.1109/ICOSP.1998.770282

Sun Xun, Limin Du, Wei-Yean Howng

{"title":"基于听觉模型的小波线性预测声码器","authors":"Sun Xun, Limin Du, Wei-Yean Howng","doi":"10.1109/ICOSP.1998.770282","DOIUrl":null,"url":null,"abstract":"It is very difficult but very important to get higher quantity at lower bit rate in the field of speech coding. Traditional vocoders, such as LPC10e and CELPC, can give acceptable results but suffer from a hoarse output. We attribute this problem to the inconsistency between the equal resolution characteristic of linear prediction throughout the whole frequency band and the characteristic of the human ear's unequal resolution at different frequency bands. In this paper, we present a new wavelet linear prediction subband coding algorithm (WLPSC), by employing a wavelet filter bank based on the auditory model. We divide the input speech signal into four subbands, and then code each subband respectively. Experimental results show that this algorithm can greatly reduce the hoarse output of vocoders.","PeriodicalId":145700,"journal":{"name":"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Wavelet linear prediction vocoder based on auditory model\",\"authors\":\"Sun Xun, Limin Du, Wei-Yean Howng\",\"doi\":\"10.1109/ICOSP.1998.770282\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It is very difficult but very important to get higher quantity at lower bit rate in the field of speech coding. Traditional vocoders, such as LPC10e and CELPC, can give acceptable results but suffer from a hoarse output. We attribute this problem to the inconsistency between the equal resolution characteristic of linear prediction throughout the whole frequency band and the characteristic of the human ear's unequal resolution at different frequency bands. In this paper, we present a new wavelet linear prediction subband coding algorithm (WLPSC), by employing a wavelet filter bank based on the auditory model. We divide the input speech signal into four subbands, and then code each subband respectively. Experimental results show that this algorithm can greatly reduce the hoarse output of vocoders.\",\"PeriodicalId\":145700,\"journal\":{\"name\":\"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOSP.1998.770282\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.1998.770282","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

在语音编码领域，如何以较低的码率获得较高的信息量是一个非常困难而又非常重要的问题。传统的声编码器，如LPC10e和CELPC，可以提供可接受的结果，但遭受沙哑输出。我们将这一问题归因于线性预测在整个频带的等分辨率特性与人耳在不同频带的不等分辨率特性之间的不一致。本文提出了一种基于听觉模型的小波滤波器组的小波线性预测子带编码算法。我们将输入语音信号分成四个子带，然后分别对每个子带进行编码。实验结果表明，该算法可以大大降低声码器的沙哑输出。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Wavelet linear prediction vocoder based on auditory model

It is very difficult but very important to get higher quantity at lower bit rate in the field of speech coding. Traditional vocoders, such as LPC10e and CELPC, can give acceptable results but suffer from a hoarse output. We attribute this problem to the inconsistency between the equal resolution characteristic of linear prediction throughout the whole frequency band and the characteristic of the human ear's unequal resolution at different frequency bands. In this paper, we present a new wavelet linear prediction subband coding algorithm (WLPSC), by employing a wavelet filter bank based on the auditory model. We divide the input speech signal into four subbands, and then code each subband respectively. Experimental results show that this algorithm can greatly reduce the hoarse output of vocoders.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344)

自引率

0.00%

发文量