Automatic feature extraction from spectrograms for acoustic-phonetic analysis

Q4 Computer Science

模式识别与人工智能 Pub Date : 1992-08-30 DOI:10.1109/ICPR.1992.201873

E. Edmonds, L. Pan, Stella M. O'Brien

引用次数: 1

Abstract

Proposes a new approach for automatic feature extraction from spectrograms, which is an essential component of acoustic-phonetic analysis in automatic continuous speech recognition. The method comprised four levels: segmentation, pattern classification, feature recognition and labelling, and a post-processor. There were three types of patterns: fuzzy, formant and silence. The extracted features included voice bar, stripes, cut-off and transitions of the first four formants. Some techniques are presented, such as two special distortion functions used in segmentation, and a peak-iterate function to detect the stripes feature. This software has been implemented as part of a speech knowledge interface, which was an expert system for speech analysis for speaker-independent, continuous speech recognition. It has been tested with a set of data chosen from a spectrogram database; the correct detection rate for most features was over 89%, and in some cases was as high as 98%.<>

查看原文本刊更多论文

从声谱图中自动提取特征用于声学-语音分析

提出了一种新的自动特征提取方法，该方法是自动连续语音识别中声音分析的重要组成部分。该方法包括四个层次:分割、模式分类、特征识别和标记以及后置处理器。有三种类型的模式:模糊，形成和沉默。提取的特征包括声条、条纹、截断和前四个共振峰的过渡。提出了一些技术，如用于分割的两种特殊的失真函数，以及用于检测条纹特征的峰值迭代函数。该软件已作为语音知识界面的一部分实现，该界面是一个独立于说话人的连续语音识别的语音分析专家系统。它已经用从谱图数据库中选择的一组数据进行了测试;大多数特征的正确检测率超过89%，在某些情况下高达98%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊