NEWS@IJCNLP最新文献_第4页

Phonological Context Approximation and Homophone Treatment for NEWS 2009 English-Chinese Transliteration Shared Task NEWS 2009中英文音译共享任务的语音语境逼近与同音字处理

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699725

O. Kwong

引用次数: 3

Automata for Transliteration and Machine Translation 音译和机器翻译的自动机

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699710

Kevin Knight

{"title":"Automata for Transliteration and Machine Translation","authors":"Kevin Knight","doi":"10.3115/1699705.1699710","DOIUrl":"https://doi.org/10.3115/1699705.1699710","url":null,"abstract":"Automata theory, transliteration, and machine translation (MT) have an interesting and intertwined history. \u0000 \u0000Finite-state string automata theory became a powerful tool for speech and language after the introduction of the ATT furthermore, these machines can be pipelined to attack complex problems like speech recognition. Likewise, n-gram models can be captured by finite-state acceptors, which can be reused across applications. \u0000 \u0000It is possible to mix, match, and compose transducers to flexibly solve all kinds of problems. One such problem is transliteration, which can be modeled as a pipeline of string transformations. MT has also been modeled with transducers, and descendants of the FSM toolkit are now used to implement phrase-based machine translation. Even speech recognizers and MT systems can themselves be composed to deliver speech-to-speech MT. \u0000 \u0000The main rub with finite-state string MT is word re-ordering. Tree transducers offer a natural mechanism to solve this problem, and they have recently been employed with some success. \u0000 \u0000In this talk, we will survey these ideas (and their origins), and we will finish with a discussion of how transliteration and MT can work together.","PeriodicalId":262513,"journal":{"name":"NEWS@IJCNLP","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116278892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A Hybrid Approach to English-Korean Name Transliteration 英韩姓名音译的混合方法

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699733

Gumwon Hong, Min-Jeong Kim, Do-Gil Lee, Hae-Chang Rim

引用次数: 19

Report of NEWS 2009 Machine Transliteration Shared Task NEWS 2009机器音译共享任务报告

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699707

Haizhou Li, A. Kumaran, V. Pervouchine, Min Zhang

{"title":"Report of NEWS 2009 Machine Transliteration Shared Task","authors":"Haizhou Li, A. Kumaran, V. Pervouchine, Min Zhang","doi":"10.3115/1699705.1699707","DOIUrl":"https://doi.org/10.3115/1699705.1699707","url":null,"abstract":"This report documents the details of the Machine Transliteration Shared Task conducted as a part of the Named Entities Workshop (NEWS), an ACL-IJCNLP 2009 workshop. The shared task features machine transliteration of proper names from English to a set of languages. This shared task has witnessed enthusiastic participation of 31 teams from all over the world, with diversity of participation for a given system and wide coverage for a given language pair (more than a dozen participants per language pair). Diverse transliteration methodologies are represented adequately in the shared task for a given language pair, thus underscoring the fact that the workshop may truly indicate the state of the art in machine transliteration in these language pairs. We measure and report 6 performance metrics on the submitted results. We believe that the shared task has successfully achieved the following objectives: (i) bringing together the community of researchers in the area of Machine Transliteration to focus on various research avenues, (ii) Calibrating systems on common corpora, using common metrics, thus creating a reasonable baseline for the state-of-the-art of transliteration systems, and (iii) providing a quantitative basis for meaningful comparison and analysis between various algorithmic approaches used in machine transliteration. We believe that the results of this shared task would uncover a host of interesting research problems, giving impetus to research in this significant research area.","PeriodicalId":262513,"journal":{"name":"NEWS@IJCNLP","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131964729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 93

English-Hindi Transliteration Using Context-Informed PB-SMT: the DCU System for NEWS 2009 使用上下文信息PB-SMT的英语-印地语音译:NEWS 2009的DCU系统

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699732

Rejwanul Haque, Sandipan Dandapat, Ankit K. Srivastava, S. Naskar, Andy Way

引用次数: 33

Tag Confidence Measure for Semi-Automatically Updating Named Entity Recognition 半自动更新命名实体识别的标签置信度度量

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699745

Kuniko Saito, Kenji Imamura

引用次数: 1

NEWS 2009 Machine Transliteration Shared Task System Description: Transliteration with Letter-to-Phoneme Technology NEWS 2009机器音译共享任务系统描述:字母到音素技术的音译

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699723

Colin Cherry, Hisami Suzuki