Automatic feature extraction from spectrograms for acoustic-phonetic analysis

Q4 Computer Science
E. Edmonds, L. Pan, Stella M. O'Brien
{"title":"Automatic feature extraction from spectrograms for acoustic-phonetic analysis","authors":"E. Edmonds, L. Pan, Stella M. O'Brien","doi":"10.1109/ICPR.1992.201873","DOIUrl":null,"url":null,"abstract":"Proposes a new approach for automatic feature extraction from spectrograms, which is an essential component of acoustic-phonetic analysis in automatic continuous speech recognition. The method comprised four levels: segmentation, pattern classification, feature recognition and labelling, and a post-processor. There were three types of patterns: fuzzy, formant and silence. The extracted features included voice bar, stripes, cut-off and transitions of the first four formants. Some techniques are presented, such as two special distortion functions used in segmentation, and a peak-iterate function to detect the stripes feature. This software has been implemented as part of a speech knowledge interface, which was an expert system for speech analysis for speaker-independent, continuous speech recognition. It has been tested with a set of data chosen from a spectrogram database; the correct detection rate for most features was over 89%, and in some cases was as high as 98%.<<ETX>>","PeriodicalId":34917,"journal":{"name":"模式识别与人工智能","volume":"2 1","pages":"701-704"},"PeriodicalIF":0.0000,"publicationDate":"1992-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"模式识别与人工智能","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.1109/ICPR.1992.201873","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 1

Abstract

Proposes a new approach for automatic feature extraction from spectrograms, which is an essential component of acoustic-phonetic analysis in automatic continuous speech recognition. The method comprised four levels: segmentation, pattern classification, feature recognition and labelling, and a post-processor. There were three types of patterns: fuzzy, formant and silence. The extracted features included voice bar, stripes, cut-off and transitions of the first four formants. Some techniques are presented, such as two special distortion functions used in segmentation, and a peak-iterate function to detect the stripes feature. This software has been implemented as part of a speech knowledge interface, which was an expert system for speech analysis for speaker-independent, continuous speech recognition. It has been tested with a set of data chosen from a spectrogram database; the correct detection rate for most features was over 89%, and in some cases was as high as 98%.<>
从声谱图中自动提取特征用于声学-语音分析
提出了一种新的自动特征提取方法,该方法是自动连续语音识别中声音分析的重要组成部分。该方法包括四个层次:分割、模式分类、特征识别和标记以及后置处理器。有三种类型的模式:模糊,形成和沉默。提取的特征包括声条、条纹、截断和前四个共振峰的过渡。提出了一些技术,如用于分割的两种特殊的失真函数,以及用于检测条纹特征的峰值迭代函数。该软件已作为语音知识界面的一部分实现,该界面是一个独立于说话人的连续语音识别的语音分析专家系统。它已经用从谱图数据库中选择的一组数据进行了测试;大多数特征的正确检测率超过89%,在某些情况下高达98%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
模式识别与人工智能
模式识别与人工智能 Computer Science-Artificial Intelligence
CiteScore
1.60
自引率
0.00%
发文量
3316
期刊介绍:
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信