从声谱图中自动提取特征用于声学-语音分析

Q4 Computer Science

模式识别与人工智能 Pub Date : 1992-08-30 DOI:10.1109/ICPR.1992.201873

E. Edmonds, L. Pan, Stella M. O'Brien

{"title":"从声谱图中自动提取特征用于声学-语音分析","authors":"E. Edmonds, L. Pan, Stella M. O'Brien","doi":"10.1109/ICPR.1992.201873","DOIUrl":null,"url":null,"abstract":"Proposes a new approach for automatic feature extraction from spectrograms, which is an essential component of acoustic-phonetic analysis in automatic continuous speech recognition. The method comprised four levels: segmentation, pattern classification, feature recognition and labelling, and a post-processor. There were three types of patterns: fuzzy, formant and silence. The extracted features included voice bar, stripes, cut-off and transitions of the first four formants. Some techniques are presented, such as two special distortion functions used in segmentation, and a peak-iterate function to detect the stripes feature. This software has been implemented as part of a speech knowledge interface, which was an expert system for speech analysis for speaker-independent, continuous speech recognition. It has been tested with a set of data chosen from a spectrogram database; the correct detection rate for most features was over 89%, and in some cases was as high as 98%.<<ETX>>","PeriodicalId":34917,"journal":{"name":"模式识别与人工智能","volume":"2 1","pages":"701-704"},"PeriodicalIF":0.0000,"publicationDate":"1992-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Automatic feature extraction from spectrograms for acoustic-phonetic analysis\",\"authors\":\"E. Edmonds, L. Pan, Stella M. O'Brien\",\"doi\":\"10.1109/ICPR.1992.201873\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Proposes a new approach for automatic feature extraction from spectrograms, which is an essential component of acoustic-phonetic analysis in automatic continuous speech recognition. The method comprised four levels: segmentation, pattern classification, feature recognition and labelling, and a post-processor. There were three types of patterns: fuzzy, formant and silence. The extracted features included voice bar, stripes, cut-off and transitions of the first four formants. Some techniques are presented, such as two special distortion functions used in segmentation, and a peak-iterate function to detect the stripes feature. This software has been implemented as part of a speech knowledge interface, which was an expert system for speech analysis for speaker-independent, continuous speech recognition. It has been tested with a set of data chosen from a spectrogram database; the correct detection rate for most features was over 89%, and in some cases was as high as 98%.<<ETX>>\",\"PeriodicalId\":34917,\"journal\":{\"name\":\"模式识别与人工智能\",\"volume\":\"2 1\",\"pages\":\"701-704\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1992-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"模式识别与人工智能\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.1109/ICPR.1992.201873\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"模式识别与人工智能","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1109/ICPR.1992.201873","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 1

摘要

提出了一种新的自动特征提取方法，该方法是自动连续语音识别中声音分析的重要组成部分。该方法包括四个层次:分割、模式分类、特征识别和标记以及后置处理器。有三种类型的模式:模糊，形成和沉默。提取的特征包括声条、条纹、截断和前四个共振峰的过渡。提出了一些技术，如用于分割的两种特殊的失真函数，以及用于检测条纹特征的峰值迭代函数。该软件已作为语音知识界面的一部分实现，该界面是一个独立于说话人的连续语音识别的语音分析专家系统。它已经用从谱图数据库中选择的一组数据进行了测试;大多数特征的正确检测率超过89%，在某些情况下高达98%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Automatic feature extraction from spectrograms for acoustic-phonetic analysis

Proposes a new approach for automatic feature extraction from spectrograms, which is an essential component of acoustic-phonetic analysis in automatic continuous speech recognition. The method comprised four levels: segmentation, pattern classification, feature recognition and labelling, and a post-processor. There were three types of patterns: fuzzy, formant and silence. The extracted features included voice bar, stripes, cut-off and transitions of the first four formants. Some techniques are presented, such as two special distortion functions used in segmentation, and a peak-iterate function to detect the stripes feature. This software has been implemented as part of a speech knowledge interface, which was an expert system for speech analysis for speaker-independent, continuous speech recognition. It has been tested with a set of data chosen from a spectrogram database; the correct detection rate for most features was over 89%, and in some cases was as high as 98%.<>

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

模式识别与人工智能 Computer Science-Artificial Intelligence

CiteScore

1.60

自引率

0.00%

发文量

3316

期刊介绍：