{"title":"宽带语音中说话人信息的意义","authors":"G. Pradhan, S. R. Mahadeva Prasanna","doi":"10.1109/NCC.2011.5734710","DOIUrl":null,"url":null,"abstract":"In this work, speech signal having information up to 4 kHz is termed as narrowband (NB) speech and the other having information up to 8 kHz is termed as wideband (WB) speech. The objective is to demonstrate the significance of speaker information present in the WB speech. A speaker verification (SV) system is developed using the mel-frequency cepstral coefficients (MFCCs) computed from the WB speech and modeled using Gaussian mixture models (GMM). For comparison, a SV system is also developed from the corresponding NB speech. The experimental results show that the SV performance improves for WB speech and the improvement is significant under degraded conditions. Further, the performance improvement is better for female speakers.","PeriodicalId":158295,"journal":{"name":"2011 National Conference on Communications (NCC)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Significance of speaker information in wideband speech\",\"authors\":\"G. Pradhan, S. R. Mahadeva Prasanna\",\"doi\":\"10.1109/NCC.2011.5734710\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work, speech signal having information up to 4 kHz is termed as narrowband (NB) speech and the other having information up to 8 kHz is termed as wideband (WB) speech. The objective is to demonstrate the significance of speaker information present in the WB speech. A speaker verification (SV) system is developed using the mel-frequency cepstral coefficients (MFCCs) computed from the WB speech and modeled using Gaussian mixture models (GMM). For comparison, a SV system is also developed from the corresponding NB speech. The experimental results show that the SV performance improves for WB speech and the improvement is significant under degraded conditions. Further, the performance improvement is better for female speakers.\",\"PeriodicalId\":158295,\"journal\":{\"name\":\"2011 National Conference on Communications (NCC)\",\"volume\":\"77 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-03-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 National Conference on Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC.2011.5734710\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2011.5734710","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Significance of speaker information in wideband speech
In this work, speech signal having information up to 4 kHz is termed as narrowband (NB) speech and the other having information up to 8 kHz is termed as wideband (WB) speech. The objective is to demonstrate the significance of speaker information present in the WB speech. A speaker verification (SV) system is developed using the mel-frequency cepstral coefficients (MFCCs) computed from the WB speech and modeled using Gaussian mixture models (GMM). For comparison, a SV system is also developed from the corresponding NB speech. The experimental results show that the SV performance improves for WB speech and the improvement is significant under degraded conditions. Further, the performance improvement is better for female speakers.