{"title":"基于自适应gmm的对数似然核方法用于说话人验证","authors":"Liang He, Yi Yang, Jia Liu","doi":"10.1109/GCIS.2012.100","DOIUrl":null,"url":null,"abstract":"Kernels in a SVM-based text-independent speaker verification system determine the performance. One of the main difficulties in designing a kernel arises from the unequal length of cepstral vector sequences. To simplify the above problem, time information is discarded and each speaker is presumed to have a unique probability density distribution. Gaussian mixture models (GMMs) are often used to estimate the probability density distribution from the train cepstral vector sequence. The methods of constructing SVM kernels by adapted GMMs become an open and key question in a GMM-SVM system. In this paper, we introduce a novel way of measuring the similarity between adapted GMMs and propose a log-likelihood kernel. We demonstrate that the presented kernel has an excellent performance on the National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) 2008 tel-tel English corpus.","PeriodicalId":337629,"journal":{"name":"2012 Third Global Congress on Intelligent Systems","volume":"639 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Log-Likelihood Kernels Based on Adapted GMMs for Speaker Verification\",\"authors\":\"Liang He, Yi Yang, Jia Liu\",\"doi\":\"10.1109/GCIS.2012.100\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Kernels in a SVM-based text-independent speaker verification system determine the performance. One of the main difficulties in designing a kernel arises from the unequal length of cepstral vector sequences. To simplify the above problem, time information is discarded and each speaker is presumed to have a unique probability density distribution. Gaussian mixture models (GMMs) are often used to estimate the probability density distribution from the train cepstral vector sequence. The methods of constructing SVM kernels by adapted GMMs become an open and key question in a GMM-SVM system. In this paper, we introduce a novel way of measuring the similarity between adapted GMMs and propose a log-likelihood kernel. We demonstrate that the presented kernel has an excellent performance on the National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) 2008 tel-tel English corpus.\",\"PeriodicalId\":337629,\"journal\":{\"name\":\"2012 Third Global Congress on Intelligent Systems\",\"volume\":\"639 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 Third Global Congress on Intelligent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/GCIS.2012.100\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Third Global Congress on Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GCIS.2012.100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Log-Likelihood Kernels Based on Adapted GMMs for Speaker Verification
Kernels in a SVM-based text-independent speaker verification system determine the performance. One of the main difficulties in designing a kernel arises from the unequal length of cepstral vector sequences. To simplify the above problem, time information is discarded and each speaker is presumed to have a unique probability density distribution. Gaussian mixture models (GMMs) are often used to estimate the probability density distribution from the train cepstral vector sequence. The methods of constructing SVM kernels by adapted GMMs become an open and key question in a GMM-SVM system. In this paper, we introduce a novel way of measuring the similarity between adapted GMMs and propose a log-likelihood kernel. We demonstrate that the presented kernel has an excellent performance on the National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) 2008 tel-tel English corpus.