{"title":"A comparison study on contextual modeling for estimating functional loads of phonological contrasts","authors":"Bin Wu, Yanlu Xie, Jinsong Zhang","doi":"10.1109/ICSDA.2015.7357886","DOIUrl":null,"url":null,"abstract":"Functional load (FL) is the quantitative measure of the importance of phonological contrasts, which stand for the differentiation of communicative linguistic units. Correct estimate of FLs is useful for the studies of speech recognition, language evolution, language teaching and etc. Conventional approaches use phonological transcriptions and unigram probabilities for the estimation, hence weak in contextual modeling. Based on the measurement of mutual information (MI) between the text and its phonological transcription, we previously proposed a novel FL measurement which utilizes n-gram word probabilities, hence owing better context modeling power. In this study, we compare the effects of different context on the estimation of FL: syllable, word, n-gram word model, and open data. Experimental results show: the wider the context modeling, the smaller the FL; FL based on MI with the trigram model achieves the best performance in modeling the context in our experiments. Compared with FL based on entropy, FL based on MI showed smaller value and is applicable to open data.","PeriodicalId":290790,"journal":{"name":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSDA.2015.7357886","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Functional load (FL) is the quantitative measure of the importance of phonological contrasts, which stand for the differentiation of communicative linguistic units. Correct estimate of FLs is useful for the studies of speech recognition, language evolution, language teaching and etc. Conventional approaches use phonological transcriptions and unigram probabilities for the estimation, hence weak in contextual modeling. Based on the measurement of mutual information (MI) between the text and its phonological transcription, we previously proposed a novel FL measurement which utilizes n-gram word probabilities, hence owing better context modeling power. In this study, we compare the effects of different context on the estimation of FL: syllable, word, n-gram word model, and open data. Experimental results show: the wider the context modeling, the smaller the FL; FL based on MI with the trigram model achieves the best performance in modeling the context in our experiments. Compared with FL based on entropy, FL based on MI showed smaller value and is applicable to open data.