Na Li, Xiangyang Zeng, Zhifeng Li, Y. Qiao, W. Jiang
{"title":"Using Fishervoice to enhance the performance of I-vector based speaker verification system","authors":"Na Li, Xiangyang Zeng, Zhifeng Li, Y. Qiao, W. Jiang","doi":"10.1109/ICIST.2014.6920544","DOIUrl":null,"url":null,"abstract":"I-vector is a popular feature representation technique in speaker verification systems. In this paper, we use Fishervoice algorithm in combination with i-vector feature representation to improve speaker verification performance. By applying the Fishervoice model to map the i-vector into a low-dimensional discriminant subspace, the intra-speaker variability can be reduced and the discriminative class boundary information can be emphasized for enhanced recognition performance. Experiments on NIST SRE 2008 core test task show that the proposed framework achieves 19.9% and 8.5% dramatic relative decrease in EER and minDCF metrics respectively compared to the state-of-the-art PLDA based method.","PeriodicalId":306383,"journal":{"name":"2014 4th IEEE International Conference on Information Science and Technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 4th IEEE International Conference on Information Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIST.2014.6920544","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
I-vector is a popular feature representation technique in speaker verification systems. In this paper, we use Fishervoice algorithm in combination with i-vector feature representation to improve speaker verification performance. By applying the Fishervoice model to map the i-vector into a low-dimensional discriminant subspace, the intra-speaker variability can be reduced and the discriminative class boundary information can be emphasized for enhanced recognition performance. Experiments on NIST SRE 2008 core test task show that the proposed framework achieves 19.9% and 8.5% dramatic relative decrease in EER and minDCF metrics respectively compared to the state-of-the-art PLDA based method.