{"title":"A Hybrid Approach for Amazigh-English Machine Translation","authors":"I. Taghbalout, Fadoua Ataa-Allah","doi":"10.1145/3330089.3330113","DOIUrl":null,"url":null,"abstract":"In this paper, we present our hybrid methodology for building a bidirectional Amazigh-English machine translation. The architecture of the proposed system is based on both Interlingua-Based Machine Translation (IBMT) and Statistical-Based Machine Translation (SBMT) approaches. Amazigh is a less-resourced language. It does not have parallel corpora with enough size. So, using statistical approach for such language will not be a good choice, because this approach requires large parallel corpora to well train probabilistic models, and to ensure a translation of good quality. Since we dispose of an Amazigh IBMT based deconverter, we thought, firstly, to use it in building an Amazigh-English parallel corpus. This latter have been exploited to train the necessary models in SBMT.","PeriodicalId":251275,"journal":{"name":"Proceedings of the 7th International Conference on Software Engineering and New Technologies","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 7th International Conference on Software Engineering and New Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3330089.3330113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we present our hybrid methodology for building a bidirectional Amazigh-English machine translation. The architecture of the proposed system is based on both Interlingua-Based Machine Translation (IBMT) and Statistical-Based Machine Translation (SBMT) approaches. Amazigh is a less-resourced language. It does not have parallel corpora with enough size. So, using statistical approach for such language will not be a good choice, because this approach requires large parallel corpora to well train probabilistic models, and to ensure a translation of good quality. Since we dispose of an Amazigh IBMT based deconverter, we thought, firstly, to use it in building an Amazigh-English parallel corpus. This latter have been exploited to train the necessary models in SBMT.