{"title":"用类比法改进发音概率估计","authors":"J. Kujala, A. Nandi","doi":"10.5281/ZENODO.43156","DOIUrl":null,"url":null,"abstract":"Pronunciation by Analogy is a method for generating phonetic transcriptions for previously unseen written words based on matching substrings of known words and their pronunciations. The method inherently generates several candidate pronunciations and a multitude of heuristics have been proposed for choosing the best one. In [1], a theoretically justified probabilistic approach for scoring the pronunciations was proposed, with performance on par with the best heuristic methods. However, a certain ad hoc modification - a fractional power applied to the estimated probabilities of the substring pronunciations - was also found to improve performance. In this article, we give an explanation for this unexpected improvement. We show that the fractional power in fact improves the estimates of the candidate pronunciation probabilities. This also gives an indirect explanation of the good performance of the current best heuristic proposed in [2].","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improved estimation of probabilities in pronunciation by Analogy\",\"authors\":\"J. Kujala, A. Nandi\",\"doi\":\"10.5281/ZENODO.43156\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Pronunciation by Analogy is a method for generating phonetic transcriptions for previously unseen written words based on matching substrings of known words and their pronunciations. The method inherently generates several candidate pronunciations and a multitude of heuristics have been proposed for choosing the best one. In [1], a theoretically justified probabilistic approach for scoring the pronunciations was proposed, with performance on par with the best heuristic methods. However, a certain ad hoc modification - a fractional power applied to the estimated probabilities of the substring pronunciations - was also found to improve performance. In this article, we give an explanation for this unexpected improvement. We show that the fractional power in fact improves the estimates of the candidate pronunciation probabilities. This also gives an indirect explanation of the good performance of the current best heuristic proposed in [2].\",\"PeriodicalId\":201182,\"journal\":{\"name\":\"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5281/ZENODO.43156\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.43156","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved estimation of probabilities in pronunciation by Analogy
Pronunciation by Analogy is a method for generating phonetic transcriptions for previously unseen written words based on matching substrings of known words and their pronunciations. The method inherently generates several candidate pronunciations and a multitude of heuristics have been proposed for choosing the best one. In [1], a theoretically justified probabilistic approach for scoring the pronunciations was proposed, with performance on par with the best heuristic methods. However, a certain ad hoc modification - a fractional power applied to the estimated probabilities of the substring pronunciations - was also found to improve performance. In this article, we give an explanation for this unexpected improvement. We show that the fractional power in fact improves the estimates of the candidate pronunciation probabilities. This also gives an indirect explanation of the good performance of the current best heuristic proposed in [2].