Milan M. Dobrovic, V. Delić, N. Jakovljević, I. Jokic
{"title":"自动说话人识别性能与标准功能的比较","authors":"Milan M. Dobrovic, V. Delić, N. Jakovljević, I. Jokic","doi":"10.1109/SISY.2012.6339541","DOIUrl":null,"url":null,"abstract":"This paper presents a study of speaker recognition accuracy depending on the choice of features, window width and model complexity. The standard features were considered, such as linear and perceptual prediction coefficients (LPC and PLP) and mel-frequency cepstral coefficients (MFCC). Gaussian mixture model (GMM), with the use of HTK tools, was chosen for speaker modelling. Speech database S70W100s120, recorded at the Electrical Engineering Department of Belgrade University, was used for purposes of system training and testing. Ten speaker models and the universal background model (UBM) were trained.","PeriodicalId":207630,"journal":{"name":"2012 IEEE 10th Jubilee International Symposium on Intelligent Systems and Informatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Comparison of the automatic speaker recognition performance over standard features\",\"authors\":\"Milan M. Dobrovic, V. Delić, N. Jakovljević, I. Jokic\",\"doi\":\"10.1109/SISY.2012.6339541\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a study of speaker recognition accuracy depending on the choice of features, window width and model complexity. The standard features were considered, such as linear and perceptual prediction coefficients (LPC and PLP) and mel-frequency cepstral coefficients (MFCC). Gaussian mixture model (GMM), with the use of HTK tools, was chosen for speaker modelling. Speech database S70W100s120, recorded at the Electrical Engineering Department of Belgrade University, was used for purposes of system training and testing. Ten speaker models and the universal background model (UBM) were trained.\",\"PeriodicalId\":207630,\"journal\":{\"name\":\"2012 IEEE 10th Jubilee International Symposium on Intelligent Systems and Informatics\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE 10th Jubilee International Symposium on Intelligent Systems and Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SISY.2012.6339541\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 10th Jubilee International Symposium on Intelligent Systems and Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SISY.2012.6339541","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparison of the automatic speaker recognition performance over standard features
This paper presents a study of speaker recognition accuracy depending on the choice of features, window width and model complexity. The standard features were considered, such as linear and perceptual prediction coefficients (LPC and PLP) and mel-frequency cepstral coefficients (MFCC). Gaussian mixture model (GMM), with the use of HTK tools, was chosen for speaker modelling. Speech database S70W100s120, recorded at the Electrical Engineering Department of Belgrade University, was used for purposes of system training and testing. Ten speaker models and the universal background model (UBM) were trained.