A. M. T. S. B. Adikari, S. Devadithya, A. Bandara, K. V. Dharmawardane, K. Wavegedara
{"title":"自动说话人验证技术在法医证据评估中的应用","authors":"A. M. T. S. B. Adikari, S. Devadithya, A. Bandara, K. V. Dharmawardane, K. Wavegedara","doi":"10.1109/ICDSP.2014.6900703","DOIUrl":null,"url":null,"abstract":"When applying speaker verification techniques for forensic evidence evaluation, many challenges arise. One such challenge is the interpretation of the verification results in terms of probability. The output of the existing speaker verification is a ratio between two likelihood values and hence does not have a very meaningful representation. In this paper, we introduce a method which outputs a posterior probability function, such that given the evidence, the probability it came from a certain suspect can be calculated. Another key challenge in forensic evidence evaluation is the practical difficulty in recreating the evidence environment in order to obtain the training voice samples from the suspect. For this we propose a novel approach based on the Baysean probability framework, which uses a pre-computed Universal Background Model (UBM). The speaker verification system is built around Mel Frequency Cepstral Coefficients (MFCC) for feature extraction and Gaussian Mixture Models (GMM) for speaker modeling. The optimum values for different parameters of these techniques are found experimentally. Our results demonstrate that a model with 32 mixtures and a dimension of size 13 gives the best performance, as a trade-off between the accuracy and the computational efficiency.","PeriodicalId":301856,"journal":{"name":"2014 19th International Conference on Digital Signal Processing","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Application of automatic speaker verification techniques for forensic evidence evaluation\",\"authors\":\"A. M. T. S. B. Adikari, S. Devadithya, A. Bandara, K. V. Dharmawardane, K. Wavegedara\",\"doi\":\"10.1109/ICDSP.2014.6900703\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When applying speaker verification techniques for forensic evidence evaluation, many challenges arise. One such challenge is the interpretation of the verification results in terms of probability. The output of the existing speaker verification is a ratio between two likelihood values and hence does not have a very meaningful representation. In this paper, we introduce a method which outputs a posterior probability function, such that given the evidence, the probability it came from a certain suspect can be calculated. Another key challenge in forensic evidence evaluation is the practical difficulty in recreating the evidence environment in order to obtain the training voice samples from the suspect. For this we propose a novel approach based on the Baysean probability framework, which uses a pre-computed Universal Background Model (UBM). The speaker verification system is built around Mel Frequency Cepstral Coefficients (MFCC) for feature extraction and Gaussian Mixture Models (GMM) for speaker modeling. The optimum values for different parameters of these techniques are found experimentally. Our results demonstrate that a model with 32 mixtures and a dimension of size 13 gives the best performance, as a trade-off between the accuracy and the computational efficiency.\",\"PeriodicalId\":301856,\"journal\":{\"name\":\"2014 19th International Conference on Digital Signal Processing\",\"volume\":\"58 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 19th International Conference on Digital Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSP.2014.6900703\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 19th International Conference on Digital Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSP.2014.6900703","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Application of automatic speaker verification techniques for forensic evidence evaluation
When applying speaker verification techniques for forensic evidence evaluation, many challenges arise. One such challenge is the interpretation of the verification results in terms of probability. The output of the existing speaker verification is a ratio between two likelihood values and hence does not have a very meaningful representation. In this paper, we introduce a method which outputs a posterior probability function, such that given the evidence, the probability it came from a certain suspect can be calculated. Another key challenge in forensic evidence evaluation is the practical difficulty in recreating the evidence environment in order to obtain the training voice samples from the suspect. For this we propose a novel approach based on the Baysean probability framework, which uses a pre-computed Universal Background Model (UBM). The speaker verification system is built around Mel Frequency Cepstral Coefficients (MFCC) for feature extraction and Gaussian Mixture Models (GMM) for speaker modeling. The optimum values for different parameters of these techniques are found experimentally. Our results demonstrate that a model with 32 mixtures and a dimension of size 13 gives the best performance, as a trade-off between the accuracy and the computational efficiency.