{"title":"Research on Mongolian lexical analyzer based on NFA","authors":"S. Loglo, Sarula, Hua Shabao","doi":"10.1109/ICICISYS.2010.5658760","DOIUrl":null,"url":null,"abstract":"Mongolian is an adhesive language. Its word-formation and configuration is built through the stem is connected to different suffixes. In theory, Mongolian vocabulary is unlimited, so the dictionary can not encompass all of the words and their numerous morphological changes. Development of independent, efficient lexical analyzing software to identify and generate the words and their morphological changes is needed. In this paper, we have introduced a Mongolian lexical analyzer, which has used dictionaries and NFA-based methods to greatly improve the speed of analyzing. After used in the modern Mongolian parsing software, we found that compare with the simple dictionary or rules-based algorithm it improves the speed by nearly two orders of magnitudes.","PeriodicalId":339711,"journal":{"name":"2010 IEEE International Conference on Intelligent Computing and Intelligent Systems","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Intelligent Computing and Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICISYS.2010.5658760","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Mongolian is an adhesive language. Its word-formation and configuration is built through the stem is connected to different suffixes. In theory, Mongolian vocabulary is unlimited, so the dictionary can not encompass all of the words and their numerous morphological changes. Development of independent, efficient lexical analyzing software to identify and generate the words and their morphological changes is needed. In this paper, we have introduced a Mongolian lexical analyzer, which has used dictionaries and NFA-based methods to greatly improve the speed of analyzing. After used in the modern Mongolian parsing software, we found that compare with the simple dictionary or rules-based algorithm it improves the speed by nearly two orders of magnitudes.