Md Foezur Rahman Chowdhury, S. Selouani, D. O'Shaughnessy
{"title":"使用GMM-UBM扬声器模型的分布式自动文本独立扬声器识别","authors":"Md Foezur Rahman Chowdhury, S. Selouani, D. O'Shaughnessy","doi":"10.1109/CCECE.2009.5090157","DOIUrl":null,"url":null,"abstract":"The ETSI “Aurora” is a digit-based standard developed for distributed speech recognition (DSR) over telephone communication channels. This paper introduces a digit-based text-independent distributed speaker identification (DSID) system over telephone channels within the DSR framework. In this DSID system, the hypothesized speaker model is derived by GMM-UBM model training using Aurora2 connected digit training speech data and maximum a posteriori (MAP) adaptation. The UBM technique for speaker models is incorporated into this DSID system to reduce the computational complexities significantly. Experiments on the Aurora2 speech recognition corpus show that GMM-UBM yields excellent performance for speaker recognition over telephone channels. Compared to the baseline system, we got 100% recognition accuracy for this proposed DSID within the ETSI DSR framework.","PeriodicalId":153464,"journal":{"name":"2009 Canadian Conference on Electrical and Computer Engineering","volume":"162 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Distributed automatic text-independent speaker identification using GMM-UBM speaker models\",\"authors\":\"Md Foezur Rahman Chowdhury, S. Selouani, D. O'Shaughnessy\",\"doi\":\"10.1109/CCECE.2009.5090157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The ETSI “Aurora” is a digit-based standard developed for distributed speech recognition (DSR) over telephone communication channels. This paper introduces a digit-based text-independent distributed speaker identification (DSID) system over telephone channels within the DSR framework. In this DSID system, the hypothesized speaker model is derived by GMM-UBM model training using Aurora2 connected digit training speech data and maximum a posteriori (MAP) adaptation. The UBM technique for speaker models is incorporated into this DSID system to reduce the computational complexities significantly. Experiments on the Aurora2 speech recognition corpus show that GMM-UBM yields excellent performance for speaker recognition over telephone channels. Compared to the baseline system, we got 100% recognition accuracy for this proposed DSID within the ETSI DSR framework.\",\"PeriodicalId\":153464,\"journal\":{\"name\":\"2009 Canadian Conference on Electrical and Computer Engineering\",\"volume\":\"162 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Canadian Conference on Electrical and Computer Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCECE.2009.5090157\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Canadian Conference on Electrical and Computer Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCECE.2009.5090157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Distributed automatic text-independent speaker identification using GMM-UBM speaker models
The ETSI “Aurora” is a digit-based standard developed for distributed speech recognition (DSR) over telephone communication channels. This paper introduces a digit-based text-independent distributed speaker identification (DSID) system over telephone channels within the DSR framework. In this DSID system, the hypothesized speaker model is derived by GMM-UBM model training using Aurora2 connected digit training speech data and maximum a posteriori (MAP) adaptation. The UBM technique for speaker models is incorporated into this DSID system to reduce the computational complexities significantly. Experiments on the Aurora2 speech recognition corpus show that GMM-UBM yields excellent performance for speaker recognition over telephone channels. Compared to the baseline system, we got 100% recognition accuracy for this proposed DSID within the ETSI DSR framework.