基于网格密度的说话人特征分类

Anti-counterfeiting, Security, and Identification Pub Date : 2012-10-11 DOI:10.1109/ICASID.2012.6325282

Lin Li, Wei Wang, Shan He

{"title":"基于网格密度的说话人特征分类","authors":"Lin Li, Wei Wang, Shan He","doi":"10.1109/ICASID.2012.6325282","DOIUrl":null,"url":null,"abstract":"A new strategy of feature classification method for speaker recognition based on the grid-density clustering is presented. According to the concept of density-based and grid-distance-based distribution in the Mel-frequency cepstrum domain, the feature vectors of each speaker were self-adaptively classified into L clusters with less overlapped. With these convex and non-interwoven clusters, the Gaussian Mixture Model could statistically analyze and estimate the distinct feature classification for each speaker. Moreover, a new speaker recognition system was established by using GMM-UBM model. The experimental results showed that the clustering effect of the proposed method was superior to the K-means plus EM clustering method, and the proposed speaker recognition system achieves better classification performance in terms of verification accuracy and computational complexity.","PeriodicalId":408223,"journal":{"name":"Anti-counterfeiting, Security, and Identification","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Grid-density based feature classification for speaker recognition\",\"authors\":\"Lin Li, Wei Wang, Shan He\",\"doi\":\"10.1109/ICASID.2012.6325282\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A new strategy of feature classification method for speaker recognition based on the grid-density clustering is presented. According to the concept of density-based and grid-distance-based distribution in the Mel-frequency cepstrum domain, the feature vectors of each speaker were self-adaptively classified into L clusters with less overlapped. With these convex and non-interwoven clusters, the Gaussian Mixture Model could statistically analyze and estimate the distinct feature classification for each speaker. Moreover, a new speaker recognition system was established by using GMM-UBM model. The experimental results showed that the clustering effect of the proposed method was superior to the K-means plus EM clustering method, and the proposed speaker recognition system achieves better classification performance in terms of verification accuracy and computational complexity.\",\"PeriodicalId\":408223,\"journal\":{\"name\":\"Anti-counterfeiting, Security, and Identification\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Anti-counterfeiting, Security, and Identification\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASID.2012.6325282\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Anti-counterfeiting, Security, and Identification","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASID.2012.6325282","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

提出了一种基于网格密度聚类的说话人识别特征分类方法。根据mel频率倒谱域中基于密度和基于网格距离的分布概念，将每个说话人的特征向量自适应地划分为重叠较少的L类。利用这些凸聚类和非交织聚类，高斯混合模型可以统计分析和估计每个说话人的不同特征分类。在此基础上，利用GMM-UBM模型建立了新的说话人识别系统。实验结果表明，所提方法的聚类效果优于K-means + EM聚类方法，所提说话人识别系统在验证精度和计算复杂度方面都取得了更好的分类性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Grid-density based feature classification for speaker recognition

A new strategy of feature classification method for speaker recognition based on the grid-density clustering is presented. According to the concept of density-based and grid-distance-based distribution in the Mel-frequency cepstrum domain, the feature vectors of each speaker were self-adaptively classified into L clusters with less overlapped. With these convex and non-interwoven clusters, the Gaussian Mixture Model could statistically analyze and estimate the distinct feature classification for each speaker. Moreover, a new speaker recognition system was established by using GMM-UBM model. The experimental results showed that the clustering effect of the proposed method was superior to the K-means plus EM clustering method, and the proposed speaker recognition system achieves better classification performance in terms of verification accuracy and computational complexity.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Anti-counterfeiting, Security, and Identification

自引率

0.00%

发文量