{"title":"A statistical approach to multilingual phonetic transcription","authors":"Stefan Besling","doi":"10.1016/0165-5817(96)81586-5","DOIUrl":null,"url":null,"abstract":"<div><p>In this paper we present a statistical method for generating phonetic transcriptions from written words. It uses Bayes' decision rule to find the most likely phonetic transcription. The method is illustrated by applying it to English, French and German. For these three languages it produces transcriptions that differ from the correct ones in at most two phonemes for more than 97% of all words. An advantage of the statistical approach lies in the fact that phonotactical knowledge is automatically learned from background lexica and does not have to be explicitly coded. Thus, the system is basically language independent.</p></div>","PeriodicalId":101018,"journal":{"name":"Philips Journal of Research","volume":"49 4","pages":"Pages 367-379"},"PeriodicalIF":0.0000,"publicationDate":"1995-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/0165-5817(96)81586-5","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Philips Journal of Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/0165581796815865","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In this paper we present a statistical method for generating phonetic transcriptions from written words. It uses Bayes' decision rule to find the most likely phonetic transcription. The method is illustrated by applying it to English, French and German. For these three languages it produces transcriptions that differ from the correct ones in at most two phonemes for more than 97% of all words. An advantage of the statistical approach lies in the fact that phonotactical knowledge is automatically learned from background lexica and does not have to be explicitly coded. Thus, the system is basically language independent.