{"title":"利用感知线谱对频率进行说话人识别","authors":"Md. Sahidullah, G. Saha","doi":"10.1109/NCC.2010.5430208","DOIUrl":null,"url":null,"abstract":"Line Spectral Pairs Frequencies (LSFs) provide an alternative representation of the linear prediction coefficients. In this paper an investigation is carried out for extracting feature for speaker identification task which is based on perceptual analysis of speech signal and LSF. A modified version of the standard perceptual analysis is applied to obtain better performance. We have extracted the conventional LSF from the perceptually modified speech signal. State-of-the art Gaussian Mixture Model (GMM) based classifier is employed to design the closed set speaker identification system. The proposed method shows significant performance improvement over existing techniques in three different speech corpuses.","PeriodicalId":130953,"journal":{"name":"2010 National Conference On Communications (NCC)","volume":"20 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"On the use of perceptual Line Spectral Pairs Frequencies for speaker identification\",\"authors\":\"Md. Sahidullah, G. Saha\",\"doi\":\"10.1109/NCC.2010.5430208\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Line Spectral Pairs Frequencies (LSFs) provide an alternative representation of the linear prediction coefficients. In this paper an investigation is carried out for extracting feature for speaker identification task which is based on perceptual analysis of speech signal and LSF. A modified version of the standard perceptual analysis is applied to obtain better performance. We have extracted the conventional LSF from the perceptually modified speech signal. State-of-the art Gaussian Mixture Model (GMM) based classifier is employed to design the closed set speaker identification system. The proposed method shows significant performance improvement over existing techniques in three different speech corpuses.\",\"PeriodicalId\":130953,\"journal\":{\"name\":\"2010 National Conference On Communications (NCC)\",\"volume\":\"20 2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-03-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 National Conference On Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC.2010.5430208\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 National Conference On Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2010.5430208","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On the use of perceptual Line Spectral Pairs Frequencies for speaker identification
Line Spectral Pairs Frequencies (LSFs) provide an alternative representation of the linear prediction coefficients. In this paper an investigation is carried out for extracting feature for speaker identification task which is based on perceptual analysis of speech signal and LSF. A modified version of the standard perceptual analysis is applied to obtain better performance. We have extracted the conventional LSF from the perceptually modified speech signal. State-of-the art Gaussian Mixture Model (GMM) based classifier is employed to design the closed set speaker identification system. The proposed method shows significant performance improvement over existing techniques in three different speech corpuses.