{"title":"Source language classification of indirect translations","authors":"I. Ivaska, Laura Ivaska","doi":"10.1075/target.00006.iva","DOIUrl":null,"url":null,"abstract":"\n One of the major barriers to the systematic study of indirect translation – that is, translations of\n translations – is the lack of efficient methods to identify these translations. In this article, we use supervised machine\n learning to examine whether computers can be harnessed to identify indirect translations. Our data consist of a monolingual\n comparable corpus that includes (1) nontranslated Finnish texts, (2) direct translations from English, French, German, Greek, and\n Swedish into Finnish, and (3) indirect translations from Greek (the ultimate source language) via English, French, German, and\n Swedish (mediating languages) into Finnish. We use n-grams of various types and lengths as feature sets and random forests as the\n statistical classification technique. To maximize the transferability of the method, the feature sets were implemented in\n accordance with the Universal Dependencies framework. This study confirms that computers can distinguish between translated and\n nontranslated Finnish, as well as between Finnish translations made from different source languages. Regarding indirect\n translations, the ultimate source language has a greater impact on the linguistic composition of indirect Finnish translations\n than their respective mediating languages. Hence, the indirect translations could not be reliably identified. Therefore, our\n results suggest that the reliable computational identification of indirect translations and their mediating languages requires a\n way to control for the effect of the ultimate source language.","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2022-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1075/target.00006.iva","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 3
Abstract
One of the major barriers to the systematic study of indirect translation – that is, translations of
translations – is the lack of efficient methods to identify these translations. In this article, we use supervised machine
learning to examine whether computers can be harnessed to identify indirect translations. Our data consist of a monolingual
comparable corpus that includes (1) nontranslated Finnish texts, (2) direct translations from English, French, German, Greek, and
Swedish into Finnish, and (3) indirect translations from Greek (the ultimate source language) via English, French, German, and
Swedish (mediating languages) into Finnish. We use n-grams of various types and lengths as feature sets and random forests as the
statistical classification technique. To maximize the transferability of the method, the feature sets were implemented in
accordance with the Universal Dependencies framework. This study confirms that computers can distinguish between translated and
nontranslated Finnish, as well as between Finnish translations made from different source languages. Regarding indirect
translations, the ultimate source language has a greater impact on the linguistic composition of indirect Finnish translations
than their respective mediating languages. Hence, the indirect translations could not be reliably identified. Therefore, our
results suggest that the reliable computational identification of indirect translations and their mediating languages requires a
way to control for the effect of the ultimate source language.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.