European Association for Machine Translation Conferences/Workshops最新文献

Incorporating Human Translator Style into English-Turkish Literary Machine Translation 将人工译者风格融入英土文学机器翻译

European Association for Machine Translation Conferences/Workshops Pub Date : 2023-07-21 DOI: 10.48550/arXiv.2307.11457

Zeynep Yi̇rmi̇beşoğlu, Olgun Dursun, Harun Dalli, Mehmet Şahin, Ena Hodzik, Sabri Gürses, Tunga Güngör

引用次数: 0

Automatic Discrimination of Human and Neural Machine Translation in Multilingual Scenarios 多语言情景下人类和神经机器翻译的自动识别

European Association for Machine Translation Conferences/Workshops Pub Date : 2023-05-31 DOI: 10.48550/arXiv.2305.19757

Mălina Chichirău, Rik van Noord, Antonio Toral

{"title":"Automatic Discrimination of Human and Neural Machine Translation in Multilingual Scenarios","authors":"Mălina Chichirău, Rik van Noord, Antonio Toral","doi":"10.48550/arXiv.2305.19757","DOIUrl":"https://doi.org/10.48550/arXiv.2305.19757","url":null,"abstract":"We tackle the task of automatically discriminating between human and machine translations. As opposed to most previous work, we perform experiments in a multilingual setting, considering multiple languages and multilingual pretrained language models. We show that a classifier trained on parallel data with a single source language (in our case German–English) can still perform well on English translations that come from different source languages, even when the machine translations were produced by other systems than the one it was trained on. Additionally, we demonstrate that incorporating the source text in the input of a multilingual classifier improves (i) its accuracy and (ii) its robustness on cross-system evaluation, compared to a monolingual classifier. Furthermore, we find that using training data from multiple source languages (German, Russian and Chinese) tends to improve the accuracy of both monolingual and multilingual classifiers. Finally, we show that bilingual classifiers and classifiers trained on multiple source languages benefit from being trained on longer text sequences, rather than on sentences.","PeriodicalId":137211,"journal":{"name":"European Association for Machine Translation Conferences/Workshops","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129901357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Investigating Lexical Sharing in Multilingual Machine Translation for Indian Languages 印度语多语言机器翻译中的词汇共享研究

European Association for Machine Translation Conferences/Workshops Pub Date : 2023-05-04 DOI: 10.48550/arXiv.2305.03207

Sonal Sannigrahi, Rachel Bawden

引用次数: 0

State Spaces Aren’t Enough: Machine Translation Needs Attention 状态空间不够:机器翻译需要注意

European Association for Machine Translation Conferences/Workshops Pub Date : 2023-04-25 DOI: 10.48550/arXiv.2304.12776

Ali Vardasbi, Telmo Pires, Robin M. Schmidt, Stephan Peitz

引用次数: 2

An Empirical Study of Leveraging Knowledge Distillation for Compressing Multilingual Neural Machine Translation Models 利用知识蒸馏压缩多语言神经机器翻译模型的实证研究

European Association for Machine Translation Conferences/Workshops Pub Date : 2023-04-19 DOI: 10.48550/arXiv.2304.09388

Varun Gumma, Raj Dabre, Pratyush Kumar

{"title":"An Empirical Study of Leveraging Knowledge Distillation for Compressing Multilingual Neural Machine Translation Models","authors":"Varun Gumma, Raj Dabre, Pratyush Kumar","doi":"10.48550/arXiv.2304.09388","DOIUrl":"https://doi.org/10.48550/arXiv.2304.09388","url":null,"abstract":"Knowledge distillation (KD) is a well-known method for compressing neural models. However, works focusing on distilling knowledge from large multilingual neural machine translation (MNMT) models into smaller ones are practically nonexistent, despite the popularity and superiority of MNMT. This paper bridges this gap by presenting an empirical investigation of knowledge distillation for compressing MNMT models. We take Indic to English translation as a case study and demonstrate that commonly used language-agnostic and language-aware KD approaches yield models that are 4-5x smaller but also suffer from performance drops of up to 3.5 BLEU. To mitigate this, we then experiment with design considerations such as shallower versus deeper models, heavy parameter sharing, multistage training, and adapters. We observe that deeper compact models tend to be as good as shallower non-compact ones and that fine-tuning a distilled model on a high-quality subset slightly boosts translation quality. Overall, we conclude that compressing MNMT models via KD is challenging, indicating immense scope for further research.","PeriodicalId":137211,"journal":{"name":"European Association for Machine Translation Conferences/Workshops","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132500205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Tailoring Domain Adaptation for Machine Translation Quality Estimation 基于裁剪域自适应的机器翻译质量估计

European Association for Machine Translation Conferences/Workshops Pub Date : 2023-04-18 DOI: 10.48550/arXiv.2304.08891

Javad Pourmostafa Roshan Sharami, D. Shterionov, F. Blain, Eva Vanmassenhove, M. D. Sisto, Chris Emmery, P. Spronck

引用次数: 2

Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM 大型多语言模型的翻译性能研究:以BLOOM为例

European Association for Machine Translation Conferences/Workshops Pub Date : 2023-03-03 DOI: 10.48550/arXiv.2303.01911

Rachel Bawden, Franccois Yvon

引用次数: 19

Large Language Models Are State-of-the-Art Evaluators of Translation Quality 大型语言模型是最先进的翻译质量评估工具

European Association for Machine Translation Conferences/Workshops Pub Date : 2023-02-28 DOI: 10.48550/arXiv.2302.14520

Tom Kocmi, C. Federmann

引用次数: 92

The Roles of Language Models and Hierarchical Models in Neural Sequence-to-Sequence Prediction 语言模型和层次模型在神经序列到序列预测中的作用

European Association for Machine Translation Conferences/Workshops Pub Date : 2020-05-16 DOI: 10.17863/CAM.49422

Felix Stahlberg

{"title":"The Roles of Language Models and Hierarchical Models in Neural Sequence-to-Sequence Prediction","authors":"Felix Stahlberg","doi":"10.17863/CAM.49422","DOIUrl":"https://doi.org/10.17863/CAM.49422","url":null,"abstract":"With the advent of deep learning, research in many areas of machine learning is converging towards the same set of methods and models. For example, long short-term memory networks (Hochreiter and Schmidhuber, 1997) are not only popular for various tasks in natural language processing (NLP) such as speech recognition, machine translation, handwriting recognition, syntactic parsing, etc., but they are also applicable to seemingly unrelated fields such as bioinformatics (Min et al., 2016). Recent advances in contextual word embeddings like BERT (Devlin et al., 2019) boast with achieving state-of-the-art results on 11 NLP tasks with the same model. Before deep learning, a speech recognizer and a syntactic parser used to have little in common as systems were much more tailored towards the task at hand. At the core of this development is the tendency to view each task as yet another data mapping problem, neglecting the particular characteristics and (soft) requirements that tasks often have in practice. This often goes along with a sharp break of deep learning methods with previous research in the specific area. This thesis can be understood as an antithesis to the prevailing paradigm. We show how traditional symbolic statistical machine translation (Koehn, 2009) models can still improve neural machine translation (Kalchbrenner and Blunsom, 2013; Sutskever et al., 2014; Bahdanau et al., 2015, NMT) while reducing the risk of common pathologies of NMT such as hallucinations and neologisms. Other external symbolic models such as spell checkers and morphology databases help neural models to correct grammatical errors in text.","PeriodicalId":137211,"journal":{"name":"European Association for Machine Translation Conferences/Workshops","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131935796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

An English-Swahili parallel corpus and its use for neural machine translation in the news domain 英语-斯瓦希里语平行语料库及其在新闻领域神经机器翻译中的应用

European Association for Machine Translation Conferences/Workshops Pub Date : 2020-03-31 DOI: 10.5281/ZENODO.3923590

F. Sánchez-Martínez, V. M. Sánchez-Cartagena, J. A. Pérez-Ortiz, M. Forcada, M. Esplà-Gomis, Andrew Secker, Susie Coleman, J. Wall

引用次数: 7