Deciphering speech waveforms

ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing Pub Date : 1986-04-07 DOI:10.1109/ICASSP.1986.1168540

M. O'Kane, Judy Gillis, Philip Rose, Michael Wagner

引用次数: 3

Abstract

Many phoneticians are remarkably expert at 'reading' speech waveforms. This paper describes an attempt to capture this knowledge for use as a segmentation and early labelling knowledge source for a continuous speech recognition system. As well as deriving information from the waveform directly, the decisions made by the waveform deciphering knowledge source are based on a related series of functions derived from the waveform. These functions, which relate to both valley-to-peak and zero crossing measures, are computationally very efficient and it would seem that the frequency analogues of these functions could provide an alternative means of deriving a certain amount of the spectral information more usually obtained through spectrograms.

查看原文本刊更多论文

解码语音波形

许多语音学家都非常擅长“读懂”语音波形。本文描述了一种获取这些知识的尝试，用于连续语音识别系统的分割和早期标记知识库。除了直接从波形中获取信息外，波形解密知识源所做的决策是基于从波形中导出的一系列相关函数。这些函数与谷峰和零交叉测量有关，在计算上非常有效，而且这些函数的频率类似物似乎可以提供一种替代方法，以获得通常通过谱图获得的一定数量的频谱信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing

自引率

0.00%

发文量