W. Dilshani, S. Yashothara, R. T. Uthayasanker, Sanath Jayasena
{"title":"机器翻译中僧伽罗语和泰米尔语的语言差异","authors":"W. Dilshani, S. Yashothara, R. T. Uthayasanker, Sanath Jayasena","doi":"10.1109/IALP.2018.8629113","DOIUrl":null,"url":null,"abstract":"This paper presents a study of the lexical-semantic divergence between Sinhala and Tamil languages. Study of divergence is critical as differences in linguistic and extra-linguistic features in languages play pivotal roles in translation. This research the first study of the divergence between Sinhala and Tamil languages and is based on Dorr's classification. We propose a computer-assisted divergence study procedure using statistical machine translation, which is easy and gives good performance compared to traditional approaches. Accordingly, this research has the twin aims of revisiting classification of divergence types as outlined by Dorr and outlining some of the new divergence patterns specific to Sinhala and Tamil languages. This study proposes a rule-based algorithm to classify a divergence.","PeriodicalId":156896,"journal":{"name":"2018 International Conference on Asian Language Processing (IALP)","volume":"253 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Linguistic Divergence of Sinhala and Tamil Languages in Machine Translation\",\"authors\":\"W. Dilshani, S. Yashothara, R. T. Uthayasanker, Sanath Jayasena\",\"doi\":\"10.1109/IALP.2018.8629113\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a study of the lexical-semantic divergence between Sinhala and Tamil languages. Study of divergence is critical as differences in linguistic and extra-linguistic features in languages play pivotal roles in translation. This research the first study of the divergence between Sinhala and Tamil languages and is based on Dorr's classification. We propose a computer-assisted divergence study procedure using statistical machine translation, which is easy and gives good performance compared to traditional approaches. Accordingly, this research has the twin aims of revisiting classification of divergence types as outlined by Dorr and outlining some of the new divergence patterns specific to Sinhala and Tamil languages. This study proposes a rule-based algorithm to classify a divergence.\",\"PeriodicalId\":156896,\"journal\":{\"name\":\"2018 International Conference on Asian Language Processing (IALP)\",\"volume\":\"253 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Asian Language Processing (IALP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IALP.2018.8629113\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Asian Language Processing (IALP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP.2018.8629113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Linguistic Divergence of Sinhala and Tamil Languages in Machine Translation
This paper presents a study of the lexical-semantic divergence between Sinhala and Tamil languages. Study of divergence is critical as differences in linguistic and extra-linguistic features in languages play pivotal roles in translation. This research the first study of the divergence between Sinhala and Tamil languages and is based on Dorr's classification. We propose a computer-assisted divergence study procedure using statistical machine translation, which is easy and gives good performance compared to traditional approaches. Accordingly, this research has the twin aims of revisiting classification of divergence types as outlined by Dorr and outlining some of the new divergence patterns specific to Sinhala and Tamil languages. This study proposes a rule-based algorithm to classify a divergence.