{"title":"基于序列到序列学习编码器的语言翻译系统","authors":"Sonia Sarode, Raghav Thatte, Kajal Toshniwal, Jatin Warade, Ranjeet Vasant Bidwe, Bhushan Zope","doi":"10.1109/ESCI56872.2023.10099876","DOIUrl":null,"url":null,"abstract":"Hindi is the mother tongue of nearly 133 crore Indians. Along with India, it is spoken in Nepal, Fiji, and Bangladesh. Since good knowledge of English is not common, there is a good opportunity for machine translation from English to Hindi and vice versa. Language translation is one task in which machines lag behind human power [1]. One task where machines fall short of human ability is language translation. Rule-Based Translation (RBT) systems and Statistical Machine Translation (SMT) systems are the conventional systems used for language translation. Rule Based Translation requires in-depth knowledge of the language. RBT is a fairly complicated system that can and must include many rules in order to improve quality. SMT is one of the traditional approaches to the machine translation issue. This technique works well with pairs of languages with comparable grammatical structures and requires enormous data sets. This paper proposes a better approach - a neural network model that uses “Recurrent Neural Network” (RNN) and “Gated Recurrent Unit” (GRU). The system consists of an RNN-encoder and RNN-decoder architecture and an attention mechanism to deal with long sentences.","PeriodicalId":441215,"journal":{"name":"2023 International Conference on Emerging Smart Computing and Informatics (ESCI)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A System for Language Translation using Sequence-to-sequence Learning based Encoder\",\"authors\":\"Sonia Sarode, Raghav Thatte, Kajal Toshniwal, Jatin Warade, Ranjeet Vasant Bidwe, Bhushan Zope\",\"doi\":\"10.1109/ESCI56872.2023.10099876\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Hindi is the mother tongue of nearly 133 crore Indians. Along with India, it is spoken in Nepal, Fiji, and Bangladesh. Since good knowledge of English is not common, there is a good opportunity for machine translation from English to Hindi and vice versa. Language translation is one task in which machines lag behind human power [1]. One task where machines fall short of human ability is language translation. Rule-Based Translation (RBT) systems and Statistical Machine Translation (SMT) systems are the conventional systems used for language translation. Rule Based Translation requires in-depth knowledge of the language. RBT is a fairly complicated system that can and must include many rules in order to improve quality. SMT is one of the traditional approaches to the machine translation issue. This technique works well with pairs of languages with comparable grammatical structures and requires enormous data sets. This paper proposes a better approach - a neural network model that uses “Recurrent Neural Network” (RNN) and “Gated Recurrent Unit” (GRU). The system consists of an RNN-encoder and RNN-decoder architecture and an attention mechanism to deal with long sentences.\",\"PeriodicalId\":441215,\"journal\":{\"name\":\"2023 International Conference on Emerging Smart Computing and Informatics (ESCI)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Emerging Smart Computing and Informatics (ESCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ESCI56872.2023.10099876\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Emerging Smart Computing and Informatics (ESCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ESCI56872.2023.10099876","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A System for Language Translation using Sequence-to-sequence Learning based Encoder
Hindi is the mother tongue of nearly 133 crore Indians. Along with India, it is spoken in Nepal, Fiji, and Bangladesh. Since good knowledge of English is not common, there is a good opportunity for machine translation from English to Hindi and vice versa. Language translation is one task in which machines lag behind human power [1]. One task where machines fall short of human ability is language translation. Rule-Based Translation (RBT) systems and Statistical Machine Translation (SMT) systems are the conventional systems used for language translation. Rule Based Translation requires in-depth knowledge of the language. RBT is a fairly complicated system that can and must include many rules in order to improve quality. SMT is one of the traditional approaches to the machine translation issue. This technique works well with pairs of languages with comparable grammatical structures and requires enormous data sets. This paper proposes a better approach - a neural network model that uses “Recurrent Neural Network” (RNN) and “Gated Recurrent Unit” (GRU). The system consists of an RNN-encoder and RNN-decoder architecture and an attention mechanism to deal with long sentences.