Building speakers' vowel models and its application in text independent speaker verification

2013 47th International Carnahan Conference on Security Technology (ICCST) Pub Date : 2013-10-01 DOI:10.1109/CCST.2013.6922034

J. Leu, Liang-tsair Geeng, C. Pu, Jyh-Bin Shiau

引用次数: 0

Abstract

In text-independent speaker verification, we compare two sets of sentences with different text content for their tonal similarity to determine if they were due to the same speaker. Since the sentences are different, we may not have matching words to compare. However, the sentences are constructed from the same set of phonemes of the language used, including vowels and consonants. Generally speaking, vowels are fewer in number, but are the more significant parts of a sentence in terms of duration and loudness, very suitable to be used for tonal comparison. In this paper, we first built spectral models for the simple vowels in Mandarin Chinese. Then we applied the models to analyze two given sets of speech sentences, detecting the various simple vowels in the sentences, and used the detected vowels to build a tonal model for each speaker. After that, we proceed to compare the two tonal models to determine the probability that the two speakers are indeed the same person.

查看原文本刊更多论文

说话人元音模型的建立及其在语篇独立说话人验证中的应用

在文本无关的说话人验证中，我们比较两组具有不同文本内容的句子的音调相似性，以确定它们是否来自同一说话人。由于句子不同，我们可能没有匹配的单词来比较。然而，这些句子是由所用语言的同一组音素构成的，包括元音和辅音。一般来说，元音在数量上较少，但在持续时间和响度上是句子中比较重要的部分，非常适合用于音调比较。本文首先建立了普通话简单元音的谱模型。然后，我们应用该模型对两组给定的语音句子进行分析，检测句子中的各种简单元音，并使用检测到的元音为每个说话者建立音调模型。之后，我们继续比较两个音调模型，以确定两个说话者确实是同一个人的概率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 47th International Carnahan Conference on Security Technology (ICCST)

自引率

0.00%

发文量