{"title":"使用多锥度MFCC和高斯混合模型的说话人识别和噪声语音验证","authors":"Dominic Mathew","doi":"10.1109/PICC.2015.7455806","DOIUrl":null,"url":null,"abstract":"The two major applications of speaker recognition applications are speaker verification and speaker identification. But in most of the cases the signal is corrupted with background interferences such as noise and echo. This paper proposes the method of speaker recognition and identification after the noise separation. Support Vector Machine(SVM) classification based signal separation is adopted here. MFCC and Multitaper MFCC are used for feature extraction. Despite having low bias, MFCC has large variance. One promising technique for reducing the variance is to replace Hamming windowed spectrum with a multi-taper spectrum estimate. Gaussian Mixture models along with Universal Background Model(UBM) is used for speaker verification and identification tasks.","PeriodicalId":373395,"journal":{"name":"2015 International Conference on Power, Instrumentation, Control and Computing (PICC)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Speaker identification and verification of noisy speech using multitaper MFCC and Gaussian Mixture models\",\"authors\":\"Dominic Mathew\",\"doi\":\"10.1109/PICC.2015.7455806\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The two major applications of speaker recognition applications are speaker verification and speaker identification. But in most of the cases the signal is corrupted with background interferences such as noise and echo. This paper proposes the method of speaker recognition and identification after the noise separation. Support Vector Machine(SVM) classification based signal separation is adopted here. MFCC and Multitaper MFCC are used for feature extraction. Despite having low bias, MFCC has large variance. One promising technique for reducing the variance is to replace Hamming windowed spectrum with a multi-taper spectrum estimate. Gaussian Mixture models along with Universal Background Model(UBM) is used for speaker verification and identification tasks.\",\"PeriodicalId\":373395,\"journal\":{\"name\":\"2015 International Conference on Power, Instrumentation, Control and Computing (PICC)\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Power, Instrumentation, Control and Computing (PICC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PICC.2015.7455806\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Power, Instrumentation, Control and Computing (PICC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PICC.2015.7455806","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speaker identification and verification of noisy speech using multitaper MFCC and Gaussian Mixture models
The two major applications of speaker recognition applications are speaker verification and speaker identification. But in most of the cases the signal is corrupted with background interferences such as noise and echo. This paper proposes the method of speaker recognition and identification after the noise separation. Support Vector Machine(SVM) classification based signal separation is adopted here. MFCC and Multitaper MFCC are used for feature extraction. Despite having low bias, MFCC has large variance. One promising technique for reducing the variance is to replace Hamming windowed spectrum with a multi-taper spectrum estimate. Gaussian Mixture models along with Universal Background Model(UBM) is used for speaker verification and identification tasks.