Modern Machine Translation Systems: Trends and Prospects

O. Kuzmin
{"title":"Modern Machine Translation Systems: Trends and Prospects","authors":"O. Kuzmin","doi":"10.47388/2072-3490/lunn2021-53-1-41-52","DOIUrl":null,"url":null,"abstract":"The modern world is moving towards global digitalization and accelerated software development with a clear tendency to replace human resources by digital services or programs that imitate the doing of similar tasks. There is no doubt that, long term, the use of such technologies has economic benefits for enterprises and companies. Despite this, however, the quality of the final result is often less than satisfactory, and machine translation systems are no exception, as editing of texts translated by using online translation services is still a demanding task. At the moment, producing high-quality translations using only machine translation systems remains impossible for multiple reasons, the main of which lies in the mysteries of natural language: the existence of sublanguages, abstract words, polysemy, etc. Since improving the quality of machine translation systems is one of the priorities of natural language processing (NLP), this article describes current trends in developing modern machine translation systems as well as the latest advances in the field of natural language processing (NLP) and gives suggestions about software innovations that would minimize the number of errors. Even though recent years have seen a significant breakthrough in the speed of information analysis, in all probability, this will not be a priority issue in the future. The main criteria for evaluating the quality of translated texts will be the semantic coherence of these texts and the semantic accuracy of the lexical material used. To improve machine translation systems, we should introduce elements of data differentiation and personalization of information for individual users and their tasks, employing the method of thematic modeling for determining the subject area of a particular text. Currently, there are algorithms based on deep learning that are able to perform these tasks. However, the process of identifying unique lexical units requires a more detailed linguistic description of their semantic features. The parsing methods that will be used in analyzing texts should also provide for the possibility of clustering by sublanguages. Creating automated electronic dictionaries for specific fields of professional knowledge will help improve the quality of machine translation systems. Notably, to date there have been no successful projects of creating dictionaries for machine translation systems for specific sub-languages. Thus, there is a need to develop such dictionaries and to integrate them into existing online translation systems.","PeriodicalId":151178,"journal":{"name":"Nizhny Novgorod Linguistics University Bulletin","volume":"32 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nizhny Novgorod Linguistics University Bulletin","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47388/2072-3490/lunn2021-53-1-41-52","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The modern world is moving towards global digitalization and accelerated software development with a clear tendency to replace human resources by digital services or programs that imitate the doing of similar tasks. There is no doubt that, long term, the use of such technologies has economic benefits for enterprises and companies. Despite this, however, the quality of the final result is often less than satisfactory, and machine translation systems are no exception, as editing of texts translated by using online translation services is still a demanding task. At the moment, producing high-quality translations using only machine translation systems remains impossible for multiple reasons, the main of which lies in the mysteries of natural language: the existence of sublanguages, abstract words, polysemy, etc. Since improving the quality of machine translation systems is one of the priorities of natural language processing (NLP), this article describes current trends in developing modern machine translation systems as well as the latest advances in the field of natural language processing (NLP) and gives suggestions about software innovations that would minimize the number of errors. Even though recent years have seen a significant breakthrough in the speed of information analysis, in all probability, this will not be a priority issue in the future. The main criteria for evaluating the quality of translated texts will be the semantic coherence of these texts and the semantic accuracy of the lexical material used. To improve machine translation systems, we should introduce elements of data differentiation and personalization of information for individual users and their tasks, employing the method of thematic modeling for determining the subject area of a particular text. Currently, there are algorithms based on deep learning that are able to perform these tasks. However, the process of identifying unique lexical units requires a more detailed linguistic description of their semantic features. The parsing methods that will be used in analyzing texts should also provide for the possibility of clustering by sublanguages. Creating automated electronic dictionaries for specific fields of professional knowledge will help improve the quality of machine translation systems. Notably, to date there have been no successful projects of creating dictionaries for machine translation systems for specific sub-languages. Thus, there is a need to develop such dictionaries and to integrate them into existing online translation systems.
现代机器翻译系统:趋势与展望
现代世界正朝着全球数字化和加速软件开发的方向发展,有一种明显的趋势,即通过模仿类似任务的数字服务或程序取代人力资源。毫无疑问,从长远来看,这些技术的使用对企业和公司具有经济效益。然而,尽管如此,最终结果的质量往往不令人满意,机器翻译系统也不例外,因为使用在线翻译服务翻译的文本编辑仍然是一项要求很高的任务。目前,仅使用机器翻译系统仍然无法产生高质量的翻译,主要原因在于自然语言的奥秘:子语言的存在、抽象词、一词多义等。由于提高机器翻译系统的质量是自然语言处理(NLP)的优先事项之一,本文描述了现代机器翻译系统的发展趋势以及自然语言处理(NLP)领域的最新进展,并提出了有关软件创新的建议,以尽量减少错误的数量。尽管近年来在信息分析速度方面取得了重大突破,但在未来,这很可能不会是一个优先考虑的问题。评价译文质量的主要标准将是这些文本的语义连贯和所用词汇材料的语义准确性。为了改进机器翻译系统,我们应该针对个人用户及其任务引入数据差异化和信息个性化的元素,采用主题建模的方法来确定特定文本的主题领域。目前,有一些基于深度学习的算法能够执行这些任务。然而,识别独特词汇单位的过程需要对其语义特征进行更详细的语言描述。用于分析文本的解析方法还应该提供按子语言聚类的可能性。为特定领域的专业知识创建自动电子词典将有助于提高机器翻译系统的质量。值得注意的是,到目前为止,还没有成功的项目为特定子语言的机器翻译系统创建字典。因此,有必要开发这样的词典,并将其整合到现有的在线翻译系统中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信