Evaluating vowel pronunciation quality: Formant space matching versus ASR confidence scoring

2010 National Conference On Communications (NCC) Pub Date : 2010-03-15 DOI:10.1109/NCC.2010.5430187

Ashish Patil, Chitralekha Gupta, P. Rao

引用次数: 9

Abstract

Quantitative evaluation of the quality of a speaker's pronunciation of the vowels of a language can contribute to the important task of speaker accent detection. Our aim is to qualitatively and quantitatively distinguish between native and non-native speakers of a language on the basis of a comparative study of two analysis methods. One deals with relative positions of their vowels in formant (F1-F2) space that conveys important articulatory information. The other method exploits the sensitivity of trained phone models to accent variations, as captured by the log likelihood scores, to distinguish between native and non-native speakers.

查看原文本刊更多论文

评估元音发音质量:共振峰空间匹配与ASR可信度评分

定量评价说话者对一种语言的元音发音质量有助于说话者口音检测的重要任务。我们的目的是在两种分析方法的比较研究的基础上定性和定量地区分一种语言的母语和非母语使用者。一种是处理元音在构音(F1-F2)空间中的相对位置，这传达了重要的发音信息。另一种方法是利用经过训练的电话模型对口音变化的敏感性，通过对数似然评分来区分母语和非母语人士。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 National Conference On Communications (NCC)

自引率

0.00%

发文量