Transliteration of Proper Names in Cross-Lingual Information Retrieval

NER@ACL Pub Date : 2003-07-12 DOI:10.3115/1119384.1119392

Paola Virga, S. Khudanpur

引用次数: 192

Abstract

We address the problem of transliterating English names using Chinese orthography in support of cross-lingual speech and text processing applications. We demonstrate the application of statistical machine translation techniques to "translate" the phonemic representation of an English name, obtained by using an automatic text-to-speech system, to a sequence of initials and finals, commonly used sub-word units of pronunciation for Chinese. We then use another statistical translation model to map the initial/final sequence to Chinese characters. We also present an evaluation of this module in retrieval of Mandarin spoken documents from the TDT corpus using English text queries.

查看原文本刊更多论文

跨语言信息检索中专名的音译

我们解决了使用中文正字法音译英文名称的问题，以支持跨语言语音和文本处理应用程序。我们演示了统计机器翻译技术的应用，将使用自动文本到语音系统获得的英文名称的音位表示“翻译”为汉语常用的发音子词单元声母和韵母序列。然后，我们使用另一个统计翻译模型将初始/最终序列映射到中文字符。我们还对该模块在使用英语文本查询从TDT语料库检索普通话口语文档中的应用进行了评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

NER@ACL

自引率

0.00%

发文量