2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)最新文献_第2页

Normalization of total variability matrix for i-vector/PLDA speaker verification i-vector/PLDA扬声器验证的总变异性矩阵归一化

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178758

Wei Rao, M. Mak, Kong-Aik Lee

{"title":"Normalization of total variability matrix for i-vector/PLDA speaker verification","authors":"Wei Rao, M. Mak, Kong-Aik Lee","doi":"10.1109/ICASSP.2015.7178758","DOIUrl":"https://doi.org/10.1109/ICASSP.2015.7178758","url":null,"abstract":"Gaussian PLDA with uncertainty propagation is effective for i-vector based speaker verification. The idea is to propagate the uncertainty of i-vectors caused by the duration variability of utterances to the PLDA model. However, a limitation of the method is the difficulty of performing length normalization on the posterior covariance matrix of an i-vector. This paper proposes a method to avoid performing length normalization on i-vectors in Gaussian PLDA modeling so that uncertainty propagation can be directly applied without transforming the posterior covariance matrices of i-vectors. Instead of performing length normalization on i-vectors independently, the proposed method normalizes the column vectors of the total variability matrix. Because the i-vectors of all utterances are derived from the same normalized total variability matrix, they will be subject to the same degree of normalization, thereby avoiding the undesirable distortion introduced by the utterance-dependent length-normalization process. Experimental results on both NIST 2010 and 2012 SREs demonstrate that the proposed method achieves a performance similar to (and in some situations better than) that of Gaussian PLDA with length normalization. The method has the potential of improving the performance of uncertainty propagation for i-vector/PLDA speaker verification.","PeriodicalId":117666,"journal":{"name":"2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114971306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

An Encryption-then-Compression system for JPEG 2000 standard JPEG 2000标准的先加密后压缩系统

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178165

Osamu Watanabe, Akira Uchida, T. Fukuhara, H. Kiya

引用次数: 50

Efficient spectrogram-based binary image feature for audio copy detection 高效的基于谱图的二值图像特征音频拷贝检测

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178279

Chahid Ouali, P. Dumouchel, Vishwa Gupta

引用次数: 13

Paraphrastic recurrent neural network language models 释义递归神经网络语言模型

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7179004

Xunying Liu, Xie Chen, M. Gales, P. Woodland

引用次数: 10

Combining two phase codes to extend the radar unambiguous range and get a trade-off in terms of performance for any clutter 结合两相码扩展雷达的无二义距离，并在任何杂波条件下获得性能上的折衷

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178285

T. Rouffet, É. Grivel, P. Vallet, C. Enderli, S. Kemkemian

引用次数: 3

Single underwater image descattering and color correction 单水下图像散射和色彩校正

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178245

Huimin Lu, Yujie Li, S. Serikawa

引用次数: 17

Improved linear least squares estimation using bounded data uncertainty 利用有界数据不确定性改进线性最小二乘估计

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178607

Tarig Ballal, T. Al-Naffouri

引用次数: 8

Learning the sparsity basis in low-rank plus sparse model for dynamic MRI reconstruction 学习用于动态MRI重建的低秩加稀疏模型的稀疏基

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178075

A. Majumdar, R. Ward

引用次数: 5

Face hallucination via Cauchy regularized sparse representation 基于柯西正则化稀疏表示的人脸幻觉

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7178163

Shenming Qu, R. Hu, Shihong Chen, Zhongyuan Wang, Junjun Jiang, Cheng Yang

引用次数: 7

Individualizing a monaural beamformer for cochlear implant users 针对人工耳蜗使用者的个体化单耳波束形成器

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Pub Date : 2015-04-19 DOI: 10.1109/ICASSP.2015.7179071

Waldo Nogueira, Marta Lopez, Thilo Rode, S. Doclo, A. Büchner

引用次数: 5