A novel method of formant analysis and glottal inverse filtering

5th International Conference on Spoken Language Processing (ICSLP 1998) Pub Date : 1998-11-30 DOI:10.21437/ICSLP.1998-543

Steve Pearson

引用次数: 1

Abstract

This paper presents a class of methods for automatically extracting formant parameters from speech. The methods rely on an iterative optimization algorithm. It was found that formant parameter data derived with these methods was less prone to discontinuity errors than conventional methods. Also, experiments were conducted that demonstrated that these methods are capable of better accuracy in formant estimation than LPC, especially for the first formant. In some cases, the analytic (non-iterative) solution has been derived, making real time applications feasible. The main target that we have been pursuing is text-to-speech (TTS) conversion. These methods are being used to automatically analyze a concatenation database, without the need for a tuning phase to fix errors. In addition, they are instrumental in realizing high quality pitch tracking, and pitch epoch marking.

查看原文本刊更多论文

一种新的形成峰分析和声门反滤波方法

提出了一种从语音中自动提取形成峰参数的方法。该方法依赖于迭代优化算法。研究发现，与传统方法相比，用这些方法得到的构造峰参数数据不容易出现不连续误差。此外，实验表明，这些方法能够比LPC更准确地估计形成峰，特别是对于第一个形成峰。在某些情况下，推导出了解析(非迭代)解，使实时应用变得可行。我们一直追求的主要目标是文本到语音(TTS)的转换。这些方法用于自动分析串联数据库，而不需要调优阶段来修复错误。此外，它们有助于实现高质量的音高跟踪和音高epoch标记。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

5th International Conference on Spoken Language Processing (ICSLP 1998)

自引率

0.00%

发文量