颠覆双语词典编纂:用新方法和新技术提高质量和生产力

IF 0.9 2区 文学 0 LANGUAGE & LINGUISTICS
Lexikos Pub Date : 2022-01-01 DOI:10.5788/32-1-1686
S. Tarp
{"title":"颠覆双语词典编纂:用新方法和新技术提高质量和生产力","authors":"S. Tarp","doi":"10.5788/32-1-1686","DOIUrl":null,"url":null,"abstract":"This is a report from the real world. It informs about the outcome of a project, which the author conducted during a months-long research stay at the Danish company Ordbogen where he integrated its research and development (R&D) team. The first part of the project was to test machine translation and find out to what extent it is usable in the compilation of bilingual lexico­graphical databases. The hypothesis was that the technology was not yet mature. But surprisingly, it turned out that the accuracy rate is already so high that it is worth considering how to implement it. The second part of the project aimed at further developing an idea formulated by Fuertes-Olivera et al. (2018) on how to invert a dictionary without losing semantic content. The new vision is to compile a mono­lingual L2 database, bilingualize it to an L2–L1 database using machine trans­lation, and then invert the relationship between L2 lemmata and L1 equivalents using the L1 defi­ni­tions of the L2 lemmata as the axis. The third part of the project was to test this idea using a specially designed ad hoc pro­gram. The program automatically uploads relevant data from existing lexico­graphical databases, translates L2 definitions and example sentences into L1, suggests adequate L1 equivalents, and eventually inverts the relationship between the two languages. It worked, but the methodology still needs further refinement to be implementable on a large scale. The report con­cludes by listing some of the remaining challenges and defining the new role of the lexicog­rapher in this type of project. Keywords: lexicographical r&d, interdisciplinary collaboration, digital technology, bilingual lexicography, lexicographical databases, machine translation, automatic inversion, object language, auxiliary language, human versus artificial lexicographer","PeriodicalId":43907,"journal":{"name":"Lexikos","volume":"1 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Turning Bilingual Lexicography Upside Down: Improving Quality and Productivity with New Methods and Technology\",\"authors\":\"S. Tarp\",\"doi\":\"10.5788/32-1-1686\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This is a report from the real world. It informs about the outcome of a project, which the author conducted during a months-long research stay at the Danish company Ordbogen where he integrated its research and development (R&D) team. The first part of the project was to test machine translation and find out to what extent it is usable in the compilation of bilingual lexico­graphical databases. The hypothesis was that the technology was not yet mature. But surprisingly, it turned out that the accuracy rate is already so high that it is worth considering how to implement it. The second part of the project aimed at further developing an idea formulated by Fuertes-Olivera et al. (2018) on how to invert a dictionary without losing semantic content. The new vision is to compile a mono­lingual L2 database, bilingualize it to an L2–L1 database using machine trans­lation, and then invert the relationship between L2 lemmata and L1 equivalents using the L1 defi­ni­tions of the L2 lemmata as the axis. The third part of the project was to test this idea using a specially designed ad hoc pro­gram. The program automatically uploads relevant data from existing lexico­graphical databases, translates L2 definitions and example sentences into L1, suggests adequate L1 equivalents, and eventually inverts the relationship between the two languages. It worked, but the methodology still needs further refinement to be implementable on a large scale. The report con­cludes by listing some of the remaining challenges and defining the new role of the lexicog­rapher in this type of project. Keywords: lexicographical r&d, interdisciplinary collaboration, digital technology, bilingual lexicography, lexicographical databases, machine translation, automatic inversion, object language, auxiliary language, human versus artificial lexicographer\",\"PeriodicalId\":43907,\"journal\":{\"name\":\"Lexikos\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2022-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Lexikos\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.5788/32-1-1686\",\"RegionNum\":2,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lexikos","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.5788/32-1-1686","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}
引用次数: 1

摘要

这是一份来自真实世界的报告。它介绍了作者在丹麦公司Ordbogen进行为期数月的研究期间所进行的一个项目的结果,并整合了该公司的研发团队。项目的第一部分是测试机器翻译,并找出它在多大程度上可用于双语词典-图形数据库的编写。当时的假设是这项技术还不成熟。但令人惊讶的是,事实证明,准确率已经很高,值得考虑如何实现它。该项目的第二部分旨在进一步发展Fuertes-Olivera等人(2018)提出的关于如何在不丢失语义内容的情况下反转字典的想法。新的愿景是编译一个单语言L2数据库,使用机器翻译将其双语化为L2 - L1数据库,然后使用L2引理的L1定义作为轴来反转L2引理和L1等价物之间的关系。该项目的第三部分是使用一个特别设计的临时程序来测试这个想法。该程序自动从现有的词典-图形数据库中上传相关数据,将第二语言的定义和例句翻译成第一语言,并建议适当的第一语言对等物,最终反转两种语言之间的关系。它是有效的,但是该方法还需要进一步改进才能在大规模上实现。报告最后列出了一些仍然存在的挑战,并定义了词典编纂者在这类项目中的新角色。关键词:词典研发、跨学科合作、数字技术、双语词典编纂、词典数据库、机器翻译、自动倒装、目标语言、辅助语言、人工与人工词典编纂者
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Turning Bilingual Lexicography Upside Down: Improving Quality and Productivity with New Methods and Technology
This is a report from the real world. It informs about the outcome of a project, which the author conducted during a months-long research stay at the Danish company Ordbogen where he integrated its research and development (R&D) team. The first part of the project was to test machine translation and find out to what extent it is usable in the compilation of bilingual lexico­graphical databases. The hypothesis was that the technology was not yet mature. But surprisingly, it turned out that the accuracy rate is already so high that it is worth considering how to implement it. The second part of the project aimed at further developing an idea formulated by Fuertes-Olivera et al. (2018) on how to invert a dictionary without losing semantic content. The new vision is to compile a mono­lingual L2 database, bilingualize it to an L2–L1 database using machine trans­lation, and then invert the relationship between L2 lemmata and L1 equivalents using the L1 defi­ni­tions of the L2 lemmata as the axis. The third part of the project was to test this idea using a specially designed ad hoc pro­gram. The program automatically uploads relevant data from existing lexico­graphical databases, translates L2 definitions and example sentences into L1, suggests adequate L1 equivalents, and eventually inverts the relationship between the two languages. It worked, but the methodology still needs further refinement to be implementable on a large scale. The report con­cludes by listing some of the remaining challenges and defining the new role of the lexicog­rapher in this type of project. Keywords: lexicographical r&d, interdisciplinary collaboration, digital technology, bilingual lexicography, lexicographical databases, machine translation, automatic inversion, object language, auxiliary language, human versus artificial lexicographer
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Lexikos
Lexikos Multiple-
CiteScore
1.00
自引率
25.00%
发文量
15
审稿时长
7 weeks
期刊介绍: Lexikos (Greek for "of or for words") is a journal for the lexicographical specialist. It is the only journal in Africa which is exclusively devoted to lexicography. Articles dealing with all aspects of lexicography and terminology or the implications that research in related disciplines such as linguistics, computer and information science, etc. has for lexicography will be considered for publication. Articles may be written in Afrikaans, English, Dutch, German and French.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信