{"title":"Steering Committee","authors":"S. Woodward, R. Ellis","doi":"10.1109/pgsret.2015.7349355","DOIUrl":"https://doi.org/10.1109/pgsret.2015.7349355","url":null,"abstract":"Ronald G. Addie (University of Southern Queensland, Australia) Adnan Al-Anbuky (Auckland University of Technology, New Zealand) Franco R. Davoli (University of Genoa & National Inter-University Consortium for Telecommunications (CNIT), Italy) Mark A. Gregory (RMIT University, Australia) Richard J Harris (Massey University, New Zealand) Phuoc Tran-Gia (University of Wuerzburg, Germany) Organising Committee General Co-Chair","PeriodicalId":124051,"journal":{"name":"2020 13th International Conference on Developments in eSystems Engineering (DeSE)","volume":"14 1-4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120965239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Danilov, L. Salekhova, K. Grigorieva, R. Zaripova, T. Zinnurov
{"title":"The application of statistical methods in the development of Cyrillic-Latin converter for Tatar language","authors":"A. Danilov, L. Salekhova, K. Grigorieva, R. Zaripova, T. Zinnurov","doi":"10.1109/DeSE51703.2020.9450742","DOIUrl":"https://doi.org/10.1109/DeSE51703.2020.9450742","url":null,"abstract":"This article deals with the problem of development of Cyrillic-Latin converter for Tatar language which will be able to convert a text written in Tatar to Latin using Cyrillic graphics. In addition, the article describes some aspects of Cyrillic graphics to Latin for the Tatar language. The authors worked in two directions: various statistical methods necessary for the Converter operation were considered, as well as the speed and accuracy of the conversion algorithms were analyzed.We created an algorithm and developed software modules that allow converting messages written in the Tatar Cyrillic alphabet to the Tatar Latin alphabet.According to the local law acts, a verbal and an algorithmic model of conversion was constructed. In the process of development, it turned out that the process of a Tatar word conversion depends on its origin. Native Tatar words are converted according to the phonetic principle (кәлам - qäläm), the borrowed words are converted according to the rules of transliteration. The main problem of the study is the problem of a word origin recognition. In order to solve this problem, the authors propose various algorithmic models. Software tools based on the statistical processing of linguistic data are considered and developed in the work: combined bigram analysis, naive Bayesian classification and a brute-force direct search. Each of these algorithms is used to determine the etymology of a word, on which depends the application of certain rules of conversion from Cyrillic to Latin.Thus, the result of our work is a developed software product that can perform the process of converting Cyrillic graphics into Latin for Tatar language. Further research in this area is related to the development of the software tool and its use in educational activities.","PeriodicalId":124051,"journal":{"name":"2020 13th International Conference on Developments in eSystems Engineering (DeSE)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127462954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}