高质量的谐波编码在非常低的比特率

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing Pub Date : 1994-04-19 DOI:10.1109/ICASSP.1994.389325

Gao Yang, H. Leich

{"title":"高质量的谐波编码在非常低的比特率","authors":"Gao Yang, H. Leich","doi":"10.1109/ICASSP.1994.389325","DOIUrl":null,"url":null,"abstract":"The paper presents a harmonic vocoder to produce high-quality speech at very low bit rates (below 2 kb/s). Voiced speech is decomposed into forward and backward signals which consist of interpolated harmonics. Unvoiced speech is reconstructed in the time domain with an approach similar to CELP. To remove the \"buzzy\" quality and avoid the \"hoarse\" quality, three methods are presented: the randomness of the harmonic phases is controlled according to pitch value and the continuity of synthetic speech is maintained; the spectral envelope determined by the LP model is modified; some noise components can be introduced for voiced synthetic speech. The harmonic vocoder produces quite natural, clear speech. Its perceptual quality is much better than that of the LPC-10 vocoder.<<ETX>>","PeriodicalId":290798,"journal":{"name":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"High-quality harmonic coding at very low bit rates\",\"authors\":\"Gao Yang, H. Leich\",\"doi\":\"10.1109/ICASSP.1994.389325\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The paper presents a harmonic vocoder to produce high-quality speech at very low bit rates (below 2 kb/s). Voiced speech is decomposed into forward and backward signals which consist of interpolated harmonics. Unvoiced speech is reconstructed in the time domain with an approach similar to CELP. To remove the \\\"buzzy\\\" quality and avoid the \\\"hoarse\\\" quality, three methods are presented: the randomness of the harmonic phases is controlled according to pitch value and the continuity of synthetic speech is maintained; the spectral envelope determined by the LP model is modified; some noise components can be introduced for voiced synthetic speech. The harmonic vocoder produces quite natural, clear speech. Its perceptual quality is much better than that of the LPC-10 vocoder.<<ETX>>\",\"PeriodicalId\":290798,\"journal\":{\"name\":\"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1994-04-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASSP.1994.389325\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1994.389325","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

本文提出了一种谐波声码器，可以在非常低的比特率(低于2 kb/s)下产生高质量的语音。将浊音分解为由插值谐波组成的前向和后向信号。用一种类似于CELP的方法在时域内重构不发声语音。为了消除“嗡嗡”声质，避免“沙哑”声质，提出了三种方法:根据音高值控制谐波相位的随机性，保持合成语音的连续性;对LP模型确定的光谱包络线进行了修正;对于浊音合成语音，可以引入一些噪声成分。谐波声码器产生相当自然，清晰的语音。其感知质量远优于LPC-10声码器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

High-quality harmonic coding at very low bit rates

The paper presents a harmonic vocoder to produce high-quality speech at very low bit rates (below 2 kb/s). Voiced speech is decomposed into forward and backward signals which consist of interpolated harmonics. Unvoiced speech is reconstructed in the time domain with an approach similar to CELP. To remove the "buzzy" quality and avoid the "hoarse" quality, three methods are presented: the randomness of the harmonic phases is controlled according to pitch value and the continuity of synthetic speech is maintained; the spectral envelope determined by the LP model is modified; some noise components can be introduced for voiced synthetic speech. The harmonic vocoder produces quite natural, clear speech. Its perceptual quality is much better than that of the LPC-10 vocoder.<>

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing

自引率

0.00%

发文量