{"title":"Phone-dependent transformation of posterior probability measure for automatic pronunciation quality evaluation","authors":"Ke Yan","doi":"10.1109/IWECA.2014.6845702","DOIUrl":null,"url":null,"abstract":"Posterior probability measure is widely accepted as the most promising feature for automatic pronunciation quality evaluation. However, this measure is not phonetically consistent. This work presents a novel trainable phone-dependent transformation of posterior probability to deal with the problem. Both linear and non-linear transforms are investigated. Close form solution is found for linear transformation and gradient-based method is derived for nonlinear transformation. Experimental results on the database of 3685 people showed significant improvement. The cross-correlation between human and machine scores increases from 0.582 to 0.760.","PeriodicalId":383024,"journal":{"name":"2014 IEEE Workshop on Electronics, Computer and Applications","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Workshop on Electronics, Computer and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWECA.2014.6845702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Posterior probability measure is widely accepted as the most promising feature for automatic pronunciation quality evaluation. However, this measure is not phonetically consistent. This work presents a novel trainable phone-dependent transformation of posterior probability to deal with the problem. Both linear and non-linear transforms are investigated. Close form solution is found for linear transformation and gradient-based method is derived for nonlinear transformation. Experimental results on the database of 3685 people showed significant improvement. The cross-correlation between human and machine scores increases from 0.582 to 0.760.