NEWS@IJCNLP最新文献_第2页

Czech Named Entity Corpus and SVM-based Recognizer 捷克语命名实体语料库和基于svm的识别器

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699748

Jana Kravalova, Z. Žabokrtský

引用次数: 44

Named Entity Transcription with Pair n-Gram Models 配对n-Gram模型的命名实体转录

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699713

Martin Jansche, R. Sproat

引用次数: 14

A Hybrid Model for Urdu Hindi Transliteration 乌尔都语印地语音译的混合模式

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699746

M. G. A. Malik, L. Besacier, C. Boitet, P. Bhattacharyya

引用次数: 41

Language Independent Transliteration System Using Phrase-based SMT Approach on Substrings 基于短语的子字符串SMT方法的语言独立音译系统

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699734

Sara Noeman

{"title":"Language Independent Transliteration System Using Phrase-based SMT Approach on Substrings","authors":"Sara Noeman","doi":"10.3115/1699705.1699734","DOIUrl":"https://doi.org/10.3115/1699705.1699734","url":null,"abstract":"Everyday the newswire introduce events from all over the world, highlighting new names of persons, locations and organizations with different origins. These names appear as Out of Vocabulary (OOV) words for Machine translation, cross lingual information retrieval, and many other NLP applications. One way to deal with OOV words is to transliterate the unknown words, that is, to render them in the orthography of the second language. We introduce a statistical approach for transliteration only using the bilingual resources released in the shared task and without any previous knowledge of the target languages. Mapping the Transliteration problem to the Machine Translation problem, we make use of the phrase based SMT approach and apply it on substrings of names. In the English to Russian task, we report ACC (Accuracy in top-1) of 0.545, Mean F-score of 0.917, and MRR (Mean Reciprocal Rank) of 0.596. Due to time constraints, we made a single experiment in the English to Chinese task, reporting ACC, Mean F-score, and MRR of 0.411, 0.737, and 0.464 respectively. Finally, it is worth mentioning that the system is language independent since the author is not aware of either languages used in the experiments.","PeriodicalId":262513,"journal":{"name":"NEWS@IJCNLP","volume":"10 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120883249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

Analysis and Robust Extraction of Changing Named Entities 变化命名实体的分析与鲁棒提取

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699743

Masatoshi Tsuchiya, Shoko Endo, S. Nakagawa

引用次数: 5

Transliteration by Bidirectional Statistical Machine Translation 双向统计机器翻译的音译

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699719

A. Finch, E. Sumita

引用次数: 16

Learning Multi Character Alignment Rules and Classification of Training Data for Transliteration 多字符对齐规则学习与音译训练数据分类

NEWS@IJCNLP Pub Date : 2009-08-07 DOI: 10.3115/1699705.1699721

Dipankar Bose, S. Sarkar