Md Foezur Rahman Chowdhury, S. Selouani, D. O'Shaughnessy
{"title":"Distributed automatic text-independent speaker identification using GMM-UBM speaker models","authors":"Md Foezur Rahman Chowdhury, S. Selouani, D. O'Shaughnessy","doi":"10.1109/CCECE.2009.5090157","DOIUrl":null,"url":null,"abstract":"The ETSI “Aurora” is a digit-based standard developed for distributed speech recognition (DSR) over telephone communication channels. This paper introduces a digit-based text-independent distributed speaker identification (DSID) system over telephone channels within the DSR framework. In this DSID system, the hypothesized speaker model is derived by GMM-UBM model training using Aurora2 connected digit training speech data and maximum a posteriori (MAP) adaptation. The UBM technique for speaker models is incorporated into this DSID system to reduce the computational complexities significantly. Experiments on the Aurora2 speech recognition corpus show that GMM-UBM yields excellent performance for speaker recognition over telephone channels. Compared to the baseline system, we got 100% recognition accuracy for this proposed DSID within the ETSI DSR framework.","PeriodicalId":153464,"journal":{"name":"2009 Canadian Conference on Electrical and Computer Engineering","volume":"162 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Canadian Conference on Electrical and Computer Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCECE.2009.5090157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
The ETSI “Aurora” is a digit-based standard developed for distributed speech recognition (DSR) over telephone communication channels. This paper introduces a digit-based text-independent distributed speaker identification (DSID) system over telephone channels within the DSR framework. In this DSID system, the hypothesized speaker model is derived by GMM-UBM model training using Aurora2 connected digit training speech data and maximum a posteriori (MAP) adaptation. The UBM technique for speaker models is incorporated into this DSID system to reduce the computational complexities significantly. Experiments on the Aurora2 speech recognition corpus show that GMM-UBM yields excellent performance for speaker recognition over telephone channels. Compared to the baseline system, we got 100% recognition accuracy for this proposed DSID within the ETSI DSR framework.