Diphone-like units without phonemes - option for very low bit rate speech coding

EUROCON'2001. International Conference on Trends in Communications. Technical Program, Proceedings (Cat. No.01EX439) Pub Date : 2001-07-04 DOI:10.1109/EURCON.2001.938162

P. Motlícek, G. Baudoin, J. Černocký

引用次数: 5

Abstract

The aim of our effort is to reach higher quality of the resulting speech coded by a very low bit rate (VLBR) segmental coder. The basic units are found automatically in a training database using temporal decomposition and vector quantization. They are modeled by HMM. Then two methods of re-segmentation are used in order to find new longer units. In the first approach borders are set to the centers of previous units. In the second, borders are fixed to the centers of middle HMM states of previous units. The number of frames in new units is conditioned to be bigger than a fixed constant. Hence, new units can consist of several previous segments. Decreasing transition noise of the resultant speech was obtained using these techniques.

查看原文本刊更多论文

没有音素的类似电话的单位-非常低比特率语音编码的选项

我们努力的目标是通过极低比特率(VLBR)分段编码器达到更高质量的语音编码。使用时间分解和向量量化，在训练数据库中自动找到基本单元。它们由HMM建模。然后采用两种方法进行再分割，以寻找新的较长的单元。在第一种方法中，边界被设置为前面单元的中心。在第二种方法中，边界固定在前一个单元的中间HMM状态的中心。新单位的帧数被限定为大于一个固定常数。因此，新的单位可以由之前的几个部分组成。利用这些技术降低了合成语音的过渡噪声。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

EUROCON'2001. International Conference on Trends in Communications. Technical Program, Proceedings (Cat. No.01EX439)

自引率

0.00%

发文量