{"title":"Building speakers' vowel models and its application in text independent speaker verification","authors":"J. Leu, Liang-tsair Geeng, C. Pu, Jyh-Bin Shiau","doi":"10.1109/CCST.2013.6922034","DOIUrl":null,"url":null,"abstract":"In text-independent speaker verification, we compare two sets of sentences with different text content for their tonal similarity to determine if they were due to the same speaker. Since the sentences are different, we may not have matching words to compare. However, the sentences are constructed from the same set of phonemes of the language used, including vowels and consonants. Generally speaking, vowels are fewer in number, but are the more significant parts of a sentence in terms of duration and loudness, very suitable to be used for tonal comparison. In this paper, we first built spectral models for the simple vowels in Mandarin Chinese. Then we applied the models to analyze two given sets of speech sentences, detecting the various simple vowels in the sentences, and used the detected vowels to build a tonal model for each speaker. After that, we proceed to compare the two tonal models to determine the probability that the two speakers are indeed the same person.","PeriodicalId":243791,"journal":{"name":"2013 47th International Carnahan Conference on Security Technology (ICCST)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 47th International Carnahan Conference on Security Technology (ICCST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCST.2013.6922034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In text-independent speaker verification, we compare two sets of sentences with different text content for their tonal similarity to determine if they were due to the same speaker. Since the sentences are different, we may not have matching words to compare. However, the sentences are constructed from the same set of phonemes of the language used, including vowels and consonants. Generally speaking, vowels are fewer in number, but are the more significant parts of a sentence in terms of duration and loudness, very suitable to be used for tonal comparison. In this paper, we first built spectral models for the simple vowels in Mandarin Chinese. Then we applied the models to analyze two given sets of speech sentences, detecting the various simple vowels in the sentences, and used the detected vowels to build a tonal model for each speaker. After that, we proceed to compare the two tonal models to determine the probability that the two speakers are indeed the same person.