{"title":"Using output probability distribution for oov word rejection","authors":"Shilei Huang, Xiang Xie, Pascale Fung","doi":"10.1109/SLT.2008.4777880","DOIUrl":null,"url":null,"abstract":"This paper proposes a method to calculate the confidence score for out-of-vocabulary (OOV) word verification based on the Output Probability Distribution (OPD) of phoneme HMMs. Compared with input vector for dynamic garbage model, OPD vector contains more information than the sorted probabilities. Confidence score of each phoneme is calculated by SVM with OPD vectors as input. Hypotheses are accepted or rejected based on this confidence score. Experimental results showed that the proposed method achieved lower EER in word verification task than the conventional dynamic garbage model.","PeriodicalId":186876,"journal":{"name":"2008 IEEE Spoken Language Technology Workshop","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE Spoken Language Technology Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SLT.2008.4777880","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This paper proposes a method to calculate the confidence score for out-of-vocabulary (OOV) word verification based on the Output Probability Distribution (OPD) of phoneme HMMs. Compared with input vector for dynamic garbage model, OPD vector contains more information than the sorted probabilities. Confidence score of each phoneme is calculated by SVM with OPD vectors as input. Hypotheses are accepted or rejected based on this confidence score. Experimental results showed that the proposed method achieved lower EER in word verification task than the conventional dynamic garbage model.