AUTOMATIC TEXT TRANSLATION SYSTEM FOR ARTIFICIAL LLANGUAGES

Olesia Barkovska, Anton Havrashenko, Vladyslav Kholiev, Olena Sevostianova
{"title":"AUTOMATIC TEXT TRANSLATION SYSTEM FOR ARTIFICIAL LLANGUAGES","authors":"Olesia Barkovska, Anton Havrashenko, Vladyslav Kholiev, Olena Sevostianova","doi":"10.31891/csit-2021-5-3","DOIUrl":null,"url":null,"abstract":"The growing number and variety of artificial languages leads to the need and relevance of creating automatic dictionaries for their translation in order to facilitate human communication. Such languages include languages where vocabulary, phonetics, and grammar have been specifically designed to achieve specific goals and to communicate with a group of people by interests or place of residence. These languages can be distributed among people of certain professions or among neighboring nations. Examples are slang and surzhik. The common for them is that there is a basic language (literary), the intersection in spelling and meaning of words and phrases with which is quite large. The main goal of the project is to create a system of automatic translation of words and texts from / into arbitrary languages, including hybrid, artificial and slang ones. The proposed model shows the interaction and partial interdependence of the creation and adjustment modules and the translation module of the dictionary, which is explained by tacking the approach of reverse propagation of the translation error. To perform experiments and analyze the performance of the proposed approach to the organization of automatic translation of texts from and into arbitrary language, a software application was developed, which includes a subprogram of initial word processing for dictionary organization, one for creating a working dictionary and one for two-way improvement of created dictionary by the inclusion of new texts in order to improve the quality of translation, including the search for word phrases, idiom, and translation for them, the subprogram of dividing the dictionary into sub-dictionaries with a small percentage of text, the subprogram of the translator itself. To test and analyze the results of the proposed generalized model, three types of source texts were used:  literary poetry translation,  literary prose translation, literal prose translation. The results of the experiments showed that the proposed approach provides a high level of translation (up to 98,8%) in similar languages (between such languages as Ukrainian-Russian, or Ukrainian - Ukrainian-Russian surzhik wih equal word order in the sentence), especially with a literally translated source text. It has become known that the use of artistic texts to generate dictionaries is possible, but not very effective.","PeriodicalId":353631,"journal":{"name":"Computer systems and information technologies","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer systems and information technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31891/csit-2021-5-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The growing number and variety of artificial languages leads to the need and relevance of creating automatic dictionaries for their translation in order to facilitate human communication. Such languages include languages where vocabulary, phonetics, and grammar have been specifically designed to achieve specific goals and to communicate with a group of people by interests or place of residence. These languages can be distributed among people of certain professions or among neighboring nations. Examples are slang and surzhik. The common for them is that there is a basic language (literary), the intersection in spelling and meaning of words and phrases with which is quite large. The main goal of the project is to create a system of automatic translation of words and texts from / into arbitrary languages, including hybrid, artificial and slang ones. The proposed model shows the interaction and partial interdependence of the creation and adjustment modules and the translation module of the dictionary, which is explained by tacking the approach of reverse propagation of the translation error. To perform experiments and analyze the performance of the proposed approach to the organization of automatic translation of texts from and into arbitrary language, a software application was developed, which includes a subprogram of initial word processing for dictionary organization, one for creating a working dictionary and one for two-way improvement of created dictionary by the inclusion of new texts in order to improve the quality of translation, including the search for word phrases, idiom, and translation for them, the subprogram of dividing the dictionary into sub-dictionaries with a small percentage of text, the subprogram of the translator itself. To test and analyze the results of the proposed generalized model, three types of source texts were used:  literary poetry translation,  literary prose translation, literal prose translation. The results of the experiments showed that the proposed approach provides a high level of translation (up to 98,8%) in similar languages (between such languages as Ukrainian-Russian, or Ukrainian - Ukrainian-Russian surzhik wih equal word order in the sentence), especially with a literally translated source text. It has become known that the use of artistic texts to generate dictionaries is possible, but not very effective.
人工语言自动文本翻译系统
人工语言的数量和种类的不断增加,导致了创建自动词典的翻译,以方便人类交流的需要和相关性。这些语言包括词汇、语音和语法都是专门为实现特定目标而设计的语言,并与兴趣或居住地的一群人进行交流。这些语言可以在某些职业的人之间或在邻近的国家之间传播。例如俚语和苏尔日克语。它们的共同点是有一种基本的语言(文学),单词和短语的拼写和含义与之有很大的交集。该项目的主要目标是创建一个自动将单词和文本从任意语言翻译成任意语言的系统,包括混合语言、人工语言和俚语。该模型通过引入翻译错误反向传播的方法来解释词典的创建和调整模块与翻译模块之间的相互作用和部分依赖关系。为了对本文提出的组织任意语言文本自动翻译的方法进行实验和性能分析,开发了一个软件应用程序,该应用程序包括用于字典组织的初始词处理子程序、用于创建工作字典的子程序和用于通过添加新文本来双向改进已创建的字典以提高翻译质量的子程序,包括搜索单词短语;习语和翻译对于他们来说,将字典分成文本比例很小的子字典的子程序,译者本身的子程序。为了检验和分析所提出的广义模型的结果,我们使用了三种类型的源文本:文学诗歌翻译、文学散文翻译、文学散文翻译。实验结果表明,该方法在相似的语言(如句子中词序相等的乌克兰-俄罗斯语或乌克兰-乌克兰-俄罗斯语苏尔日克语)中提供了高水平的翻译(高达98.8%),特别是对字面翻译的源文本。人们已经知道,使用艺术文本来生成词典是可能的,但不是很有效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信