Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019最新文献

Comparing a Hand-crafted to an Automatically Generated Feature Set for Deep Learning: Pairwise Translation Evaluation 比较手工制作的深度学习特征集和自动生成的特征集:两两翻译评估

Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019 Pub Date : 2019-10-30 DOI: 10.26615/issn.2683-0078.2019_008

Despoina Mouratidis, Katia Lida Kermanidis

{"title":"Comparing a Hand-crafted to an Automatically Generated Feature Set for Deep Learning: Pairwise Translation Evaluation","authors":"Despoina Mouratidis, Katia Lida Kermanidis","doi":"10.26615/issn.2683-0078.2019_008","DOIUrl":"https://doi.org/10.26615/issn.2683-0078.2019_008","url":null,"abstract":"The automatic evaluation of machine translation (MT) has proven to be a very significant research topic. Most automatic evaluation methods focus on the evaluation of the output of MT as they compute similarity scores that represent translation quality. This work targets on the performance of MT evaluation. We present a general scheme for learning to classify parallel translations, using linguistic information, of two MT model outputs and one human (reference) translation. We present three experiments to this scheme using neural networks (NN). One using string based hand-crafted features (Exp1), the second using automatically trained embeddings from the reference and the two MT outputs (one from a statistical machine translation (SMT) model and the other from a neural ma-chine translation (NMT) model), which are learned using NN (Exp2), and the third experiment (Exp3) that combines information from the other two experiments. The languages involved are English (EN), Greek (GR) and Italian (IT) segments are educational in domain. The proposed language-independent learning scheme which combines information from the two experiments (experiment 3) achieves higher classification accuracy compared with models using BLEU score information as well as other classification approaches, such as Random Forest (RF) and Support Vector Machine (SVM).","PeriodicalId":313947,"journal":{"name":"Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116665582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Towards a Proactive MWE Terminological Platform for Cross-Lingual Mediation in the Age of Big Data 迈向大数据时代跨语言调解的主动MWE术语平台

Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019 Pub Date : 2019-10-30 DOI: 10.26615/issn.2683-0078.2019_014

Benjamin Ka-Yin T'sou, Ka-Po Chow, Junru Nie, Yuan Yuan, Hong Kong Chilin Ltd.

{"title":"Towards a Proactive MWE Terminological Platform for Cross-Lingual Mediation in the Age of Big Data","authors":"Benjamin Ka-Yin T'sou, Ka-Po Chow, Junru Nie, Yuan Yuan, Hong Kong Chilin Ltd.","doi":"10.26615/issn.2683-0078.2019_014","DOIUrl":"https://doi.org/10.26615/issn.2683-0078.2019_014","url":null,"abstract":"The emergence of China as a global economic power in the 21st Century has brought about surging needs for cross-lingual and cross-cultural mediation, typically performed by translators. Advances in Artificial Intelligence and Language Engineering have been bolstered by Machine learning and suitable Big Data cultivation. They have helped to meet some of the translator’s needs, though the technical specialists have not kept pace with the practical and expanding requirements in language mediation. One major technical and linguistic hurdle involves words outside the vocabulary of the translator or the lexical database he/she consults, especially Multi-Word Expressions (Compound Words) in technical subjects. A further problem is in the multiplicity of renditions of a term in the target language. This paper discusses a proactive approach following the successful extraction and application of sizable bilingual Multi-Word Expressions (Compound Words) for language mediation in technical subjects, which do not fall within the expertise of typical translators, who have inadequate appreciation of the range of new technical tools available to help him/her. Our approach draws on the personal reflections of translators and teachers of translation and is based on the prior R&D efforts relating to 300,000 comparable Chinese-English patents. The subsequent protocol we have developed aims to be proactive in meeting four identified practical challenges in technical translation (e.g. patents). It has broader economic implication in the Age of Big Data (Tsou et al, 2015) and Trade War, as the workload, if not, the challenges, increasingly cannot be met by currently available front-line translators. We shall demonstrate how new tools can be harnessed to spearhead the application of language technology not only in language mediation but also in the “teaching” and “learning” of translation. It shows how a better appreciation of their needs may enhance the contributions of the technical specialists, and thus enhance the resultant synergetic benefits.","PeriodicalId":313947,"journal":{"name":"Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129760835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Translation Quality Assessment Tools and Processes in Relation to CAT Tools 与计算机辅助翻译工具相关的翻译质量评估工具和过程

Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019 Pub Date : 2019-10-30 DOI: 10.26615/issn.2683-0078.2019_011

Viktoriya Petrova

引用次数: 3

Comparison between Automatic and Human Subtitling: A Case Study with Game of Thrones 自动字幕与人工字幕的比较——以《权力的游戏》为例

Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019 Pub Date : 2019-10-30 DOI: 10.26615/issn.2683-0078.2019_001

Sabrina Baldo de Brébisson

引用次数: 1

Designing a Frame-Semantic Machine Translation Evaluation Metric 框架语义机器翻译评价指标的设计

Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019 Pub Date : 2019-10-30 DOI: 10.26615/issn.2683-0078.2019_004

Oliver Czulo, Tiago Timponi Torrent, E. Matos, Alexandre Diniz da Costa, Debanjana Kar

引用次数: 6

Parallel Corpus of Croatian-Italian Administrative Texts 克罗地亚-意大利行政文本平行语料库

Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019 Pub Date : 2019-10-30 DOI: 10.26615/issn.2683-0078.2019_002

Marija Brkic Bakaric, Ivana Lalli Paćelat

引用次数: 1

The Success Story of Mitra Translations 米特拉翻译公司的成功故事

Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019 Pub Date : 2019-10-30 DOI: 10.26615/issn.2683-0078.2019_016

Mina Ilieva, M. Kancheva

引用次数: 0

What Influences the Features of Post-editese? A Preliminary Study 是什么影响了后编辑的特点?初步研究

Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019 Pub Date : 2019-10-30 DOI: 10.26615/issn.2683-0078.2019_003

Sheila Castilho, Natália Resende, R. Mitkov

引用次数: 9

Corpus Linguistics, Translation and Error Analysis 语料库语言学、翻译与错误分析

Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019 Pub Date : 2019-10-30 DOI: 10.26615/issn.2683-0078.2019_012

M. Stambolieva

引用次数: 0

The Punster’s Amanuensis: The Proper Place of Humans and Machines in the Translation of Wordplay Punster的Amanuensis:人类和机器在文字游戏翻译中的适当位置

Proceedings of the Second Workshop Human-Informed Translation and Interpreting Technology associated with RANLP 2019 Pub Date : 2019-10-30 DOI: 10.26615/issn.2683-0078.2019_007

Tristan Miller

引用次数: 7