{"title":"音译和机器翻译的自动机","authors":"Kevin Knight","doi":"10.3115/1699705.1699710","DOIUrl":null,"url":null,"abstract":"Automata theory, transliteration, and machine translation (MT) have an interesting and intertwined history. \n \nFinite-state string automata theory became a powerful tool for speech and language after the introduction of the ATT furthermore, these machines can be pipelined to attack complex problems like speech recognition. Likewise, n-gram models can be captured by finite-state acceptors, which can be reused across applications. \n \nIt is possible to mix, match, and compose transducers to flexibly solve all kinds of problems. One such problem is transliteration, which can be modeled as a pipeline of string transformations. MT has also been modeled with transducers, and descendants of the FSM toolkit are now used to implement phrase-based machine translation. Even speech recognizers and MT systems can themselves be composed to deliver speech-to-speech MT. \n \nThe main rub with finite-state string MT is word re-ordering. Tree transducers offer a natural mechanism to solve this problem, and they have recently been employed with some success. \n \nIn this talk, we will survey these ideas (and their origins), and we will finish with a discussion of how transliteration and MT can work together.","PeriodicalId":262513,"journal":{"name":"NEWS@IJCNLP","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Automata for Transliteration and Machine Translation\",\"authors\":\"Kevin Knight\",\"doi\":\"10.3115/1699705.1699710\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automata theory, transliteration, and machine translation (MT) have an interesting and intertwined history. \\n \\nFinite-state string automata theory became a powerful tool for speech and language after the introduction of the ATT furthermore, these machines can be pipelined to attack complex problems like speech recognition. Likewise, n-gram models can be captured by finite-state acceptors, which can be reused across applications. \\n \\nIt is possible to mix, match, and compose transducers to flexibly solve all kinds of problems. One such problem is transliteration, which can be modeled as a pipeline of string transformations. MT has also been modeled with transducers, and descendants of the FSM toolkit are now used to implement phrase-based machine translation. Even speech recognizers and MT systems can themselves be composed to deliver speech-to-speech MT. \\n \\nThe main rub with finite-state string MT is word re-ordering. Tree transducers offer a natural mechanism to solve this problem, and they have recently been employed with some success. \\n \\nIn this talk, we will survey these ideas (and their origins), and we will finish with a discussion of how transliteration and MT can work together.\",\"PeriodicalId\":262513,\"journal\":{\"name\":\"NEWS@IJCNLP\",\"volume\":\"55 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-08-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NEWS@IJCNLP\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3115/1699705.1699710\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NEWS@IJCNLP","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3115/1699705.1699710","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automata for Transliteration and Machine Translation
Automata theory, transliteration, and machine translation (MT) have an interesting and intertwined history.
Finite-state string automata theory became a powerful tool for speech and language after the introduction of the ATT furthermore, these machines can be pipelined to attack complex problems like speech recognition. Likewise, n-gram models can be captured by finite-state acceptors, which can be reused across applications.
It is possible to mix, match, and compose transducers to flexibly solve all kinds of problems. One such problem is transliteration, which can be modeled as a pipeline of string transformations. MT has also been modeled with transducers, and descendants of the FSM toolkit are now used to implement phrase-based machine translation. Even speech recognizers and MT systems can themselves be composed to deliver speech-to-speech MT.
The main rub with finite-state string MT is word re-ordering. Tree transducers offer a natural mechanism to solve this problem, and they have recently been employed with some success.
In this talk, we will survey these ideas (and their origins), and we will finish with a discussion of how transliteration and MT can work together.