{"title":"Can a corpus-driven lexical analysis of human and machine translation unveil discourse features that set them apart?","authors":"A. Frankenberg-Garcia","doi":"10.1075/target.20065.fra","DOIUrl":null,"url":null,"abstract":"\nThere is still much to learn about the ways in which human and machine translation differ with regard to the contexts that regulate the production and interpretation of discourse. The present study explores whether a corpus-driven lexical analysis of human and machine translation can unveil discourse features that set the two apart. A balanced corpus of source texts aligned with authentic, professional translations and neural machine translations was compiled for the study. Lexical discrepancies in the two translation corpora were then extracted via a corpus-driven keyword analysis, and examined qualitatively through parallel concordances of source texts aligned with human and machine translation. The study shows that keyword analysis not only reiterates known problems of discourse in machine translation such as lexical inconsistency and pronoun resolution, but can also provide valuable insights regarding contextual aspects of translated discourse deserving further research.","PeriodicalId":51739,"journal":{"name":"Target-International Journal of Translation Studies","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2021-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Target-International Journal of Translation Studies","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1075/target.20065.fra","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 4
Abstract
There is still much to learn about the ways in which human and machine translation differ with regard to the contexts that regulate the production and interpretation of discourse. The present study explores whether a corpus-driven lexical analysis of human and machine translation can unveil discourse features that set the two apart. A balanced corpus of source texts aligned with authentic, professional translations and neural machine translations was compiled for the study. Lexical discrepancies in the two translation corpora were then extracted via a corpus-driven keyword analysis, and examined qualitatively through parallel concordances of source texts aligned with human and machine translation. The study shows that keyword analysis not only reiterates known problems of discourse in machine translation such as lexical inconsistency and pronoun resolution, but can also provide valuable insights regarding contextual aspects of translated discourse deserving further research.
期刊介绍:
Target promotes the scholarly study of translational phenomena from any part of the world and welcomes submissions of an interdisciplinary nature. The journal"s focus is on research on the theory, history, culture and sociology of translation and on the description and pedagogy that underpin and interact with these foci. We welcome contributions that report on empirical studies as well as speculative and applied studies. We do not publish papers on purely practical matters, and prospective contributors are advised not to submit masters theses in their raw state.