{"title":"Robust speaker identification system using multi-band dominant features with empirical mode decomposition","authors":"M. Molla, K. Hirose, N. Minematsu","doi":"10.1109/ICCITECHN.2007.4579395","DOIUrl":null,"url":null,"abstract":"This paper presents a text independent speaker identification system using multi-band features with artificial neural network. Linear predictive cepstrum coefficients (LPCCs) computed from sub-band signals with higher order statistics (HOS) are employed as the main features to represent the speaker characteristics. The multi-band representation of the speech signal is implemented by empirical mode decomposition (EMD). Dominant feature vectors are derived by applying principal component analysis (PCA) on LPCC space computed from the speech signal. The experimental results show that the proposed system improves the speaker identification performance. The efficiency is also compared for different features with noisy speech signals.","PeriodicalId":338170,"journal":{"name":"2007 10th international conference on computer and information technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 10th international conference on computer and information technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCITECHN.2007.4579395","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This paper presents a text independent speaker identification system using multi-band features with artificial neural network. Linear predictive cepstrum coefficients (LPCCs) computed from sub-band signals with higher order statistics (HOS) are employed as the main features to represent the speaker characteristics. The multi-band representation of the speech signal is implemented by empirical mode decomposition (EMD). Dominant feature vectors are derived by applying principal component analysis (PCA) on LPCC space computed from the speech signal. The experimental results show that the proposed system improves the speaker identification performance. The efficiency is also compared for different features with noisy speech signals.