Source and system features for text independent speaker identification using iterative clustering approach

2009 IEEE International Conference on Signal and Image Processing Applications Pub Date : 2009-11-01 DOI:10.1109/ICSIPA.2009.5478637

A. Revathi, Y. Venkataramani

{"title":"Source and system features for text independent speaker identification using iterative clustering approach","authors":"A. Revathi, Y. Venkataramani","doi":"10.1109/ICSIPA.2009.5478637","DOIUrl":null,"url":null,"abstract":"The main objective of this paper is to explore the effectiveness of perceptual features combined with pitch for text independent speaker recognition. The proposed combined features are captured and training models are developed by K-means clustering procedure. Speaker recognition system is evaluated on clean test speeches and the experimental results reveal the performance of the proposed algorithm in performing speaker recognition based on minimum distance between test features and clusters. This algorithm gives the overall accuracy of 99.675% and 98.75% for the combined features and perceptual features respectively for identifying speaker among 8 speakers chosen randomly from 8 different dialect regions in “TIMIT” database. It also gives average accuracy of 96.375% and 95.625% for perceptual linear predictive cepstrum combined with pitch and perceptual linear predictive cepstrum respectively for 8 speakers chosen randomly from the same dialect region. The noteworthy feature of speaker identification algorithm is to evaluate the testing procedure on identical messages for all speakers. In this work, Fratio is computed as a theoretical measure to validate the experimental results on speaker recognition.","PeriodicalId":400165,"journal":{"name":"2009 IEEE International Conference on Signal and Image Processing Applications","volume":"400 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE International Conference on Signal and Image Processing Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSIPA.2009.5478637","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

The main objective of this paper is to explore the effectiveness of perceptual features combined with pitch for text independent speaker recognition. The proposed combined features are captured and training models are developed by K-means clustering procedure. Speaker recognition system is evaluated on clean test speeches and the experimental results reveal the performance of the proposed algorithm in performing speaker recognition based on minimum distance between test features and clusters. This algorithm gives the overall accuracy of 99.675% and 98.75% for the combined features and perceptual features respectively for identifying speaker among 8 speakers chosen randomly from 8 different dialect regions in “TIMIT” database. It also gives average accuracy of 96.375% and 95.625% for perceptual linear predictive cepstrum combined with pitch and perceptual linear predictive cepstrum respectively for 8 speakers chosen randomly from the same dialect region. The noteworthy feature of speaker identification algorithm is to evaluate the testing procedure on identical messages for all speakers. In this work, Fratio is computed as a theoretical measure to validate the experimental results on speaker recognition.

查看原文本刊更多论文

使用迭代聚类方法进行文本独立说话人识别的源和系统特征

本文的主要目的是探讨将感知特征与音高相结合用于文本独立说话人识别的有效性。通过K-means聚类过程捕获所提出的组合特征并建立训练模型。用干净的测试语音对说话人识别系统进行了评估，实验结果表明了该算法在基于测试特征与聚类之间最小距离的说话人识别方面的性能。该算法对从“TIMIT”数据库中8个不同方言区域随机抽取的8个说话人进行识别，组合特征和感知特征的总体准确率分别为99.675%和98.75%。从同一方言区域随机选取8名说话人，结合音高的感知线性预测倒谱和感知线性预测倒谱的平均准确率分别为96.375%和95.625%。说话人识别算法值得注意的特点是对所有说话人对相同消息的测试过程进行评估。在本工作中，计算了比率作为理论度量来验证说话人识别的实验结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 IEEE International Conference on Signal and Image Processing Applications

自引率

0.00%

发文量