{"title":"A study of applying subspace based pronunciation modeling in verifying pronunciation accuracy","authors":"Shou-Chun Yin, R. Rose, Yun Tang","doi":"10.1109/ISSPA.2012.6310622","DOIUrl":null,"url":null,"abstract":"This paper investigates a new approach for detecting phoneme level mispronunciations from utterances obtained from impaired children with neuromuscular disorders. This new pronunciation verification (PV) approach is obtained from the subspace based Gaussian mixture model (SGMM) based pronunciation model, where a set of state level projection vectors is applied for representing phonetic variability. SGMM models are trained from disabled speakers' utterances and PV scores are computed directly from distances between disabled and reference speaker projection vectors. An experimental study was performed to evaluate the performance of the SGMM based approach with respect to an approach based on the lattice posterior probabilities. A reduction in equal error rate (EER) of approximately 15% was obtained when the SGMM based scores were combined with lattice posterior probabilities.","PeriodicalId":248763,"journal":{"name":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPA.2012.6310622","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper investigates a new approach for detecting phoneme level mispronunciations from utterances obtained from impaired children with neuromuscular disorders. This new pronunciation verification (PV) approach is obtained from the subspace based Gaussian mixture model (SGMM) based pronunciation model, where a set of state level projection vectors is applied for representing phonetic variability. SGMM models are trained from disabled speakers' utterances and PV scores are computed directly from distances between disabled and reference speaker projection vectors. An experimental study was performed to evaluate the performance of the SGMM based approach with respect to an approach based on the lattice posterior probabilities. A reduction in equal error rate (EER) of approximately 15% was obtained when the SGMM based scores were combined with lattice posterior probabilities.