{"title":"基于生物启发特征的稳健说话人识别","authors":"Youssef Zouhir, I. Fredj, K. Ouni, Mohamed Zarka","doi":"10.1504/IJSISE.2020.10036131","DOIUrl":null,"url":null,"abstract":"This paper proposes two speech parameterisation techniques for noise-robust speaker recognition: the normalised gammachirp cepstral coefficients (NGCC) and the perceptual linear predictive normalised gammachirp (PLPnGc). These techniques employ a biologically inspired auditory model that simulates the cochlea spectral behaviour. In an automatic speaker recognition (ASR) system, we consider the Gaussian mixture model-universal background model (GMM-UBM) for speaker modelling. The performances are evaluated in clean and noisy environments using Timit, Aurora, and Demand databases. The experimental results in noisy environments showed that the biologically inspired feature extraction techniques give a better recognition rate than state-of-the-art methods.","PeriodicalId":56359,"journal":{"name":"International Journal of Signal and Imaging Systems Engineering","volume":"1 1","pages":""},"PeriodicalIF":0.6000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Robust speaker recognition based on biologically inspired features\",\"authors\":\"Youssef Zouhir, I. Fredj, K. Ouni, Mohamed Zarka\",\"doi\":\"10.1504/IJSISE.2020.10036131\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes two speech parameterisation techniques for noise-robust speaker recognition: the normalised gammachirp cepstral coefficients (NGCC) and the perceptual linear predictive normalised gammachirp (PLPnGc). These techniques employ a biologically inspired auditory model that simulates the cochlea spectral behaviour. In an automatic speaker recognition (ASR) system, we consider the Gaussian mixture model-universal background model (GMM-UBM) for speaker modelling. The performances are evaluated in clean and noisy environments using Timit, Aurora, and Demand databases. The experimental results in noisy environments showed that the biologically inspired feature extraction techniques give a better recognition rate than state-of-the-art methods.\",\"PeriodicalId\":56359,\"journal\":{\"name\":\"International Journal of Signal and Imaging Systems Engineering\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.6000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Signal and Imaging Systems Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJSISE.2020.10036131\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Engineering\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Signal and Imaging Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJSISE.2020.10036131","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Engineering","Score":null,"Total":0}
Robust speaker recognition based on biologically inspired features
This paper proposes two speech parameterisation techniques for noise-robust speaker recognition: the normalised gammachirp cepstral coefficients (NGCC) and the perceptual linear predictive normalised gammachirp (PLPnGc). These techniques employ a biologically inspired auditory model that simulates the cochlea spectral behaviour. In an automatic speaker recognition (ASR) system, we consider the Gaussian mixture model-universal background model (GMM-UBM) for speaker modelling. The performances are evaluated in clean and noisy environments using Timit, Aurora, and Demand databases. The experimental results in noisy environments showed that the biologically inspired feature extraction techniques give a better recognition rate than state-of-the-art methods.