Variants of cepstrum based speaker identity verification

ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing Pub Date : 1988-04-11 DOI:10.1109/ICASSP.1988.196652

G. Velius

引用次数: 23

Abstract

Analysis parameters and various distance measures are investigated for a template matching scheme for speaker identity verification (SIV). Two parameters are systematically varied-the length of the signal analysis window, and the order of the linear predictive coding/-cepstrum analysis. Computational costs associated with the choice of parameters are also considered. The distance measures tested are the Euclidean, inverse variance weighting, differential mean weighting, Kahn's simplified weighting, the Mahalanobis distance, and the Fisher linear discriminant. Using the equal error rate (EER) of pairwise utterance dissimilarity distributions, performance is estimated for prespecified and (a simulation of) user-determined input vocabulary. Performance varies significantly across vocabulary, and average performance is approximately 5% EER for the better algorithms on telephone speech.<>

查看原文本刊更多论文

基于倒频谱的说话人身份验证变体

研究了一种说话人身份验证模板匹配方案的分析参数和各种距离度量。系统地改变了两个参数——信号分析窗口的长度和线性预测编码/倒谱分析的阶数。还考虑了与参数选择相关的计算成本。测试的距离度量有欧几里得加权、方差逆加权、微分均值加权、Kahn简化加权、马氏距离和Fisher线性判别。使用相等错误率(EER)的两两话语不相似分布，性能估计预先指定和(模拟)用户确定的输入词汇。不同词汇的表现差异很大，较好的算法在电话语音上的平均表现约为5% EER

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing

自引率

0.00%

发文量