{"title":"现代维吾尔语词频统计技术研究","authors":"Azragul, Nianmei, Yasen Yimin","doi":"10.1109/IALP.2013.20","DOIUrl":null,"url":null,"abstract":"With the development of our society, the languages are also constantly evolving. Word is the smallest meaningful language composition which able to activity independently, and is also important carrier of knowledge and the basic operation unit in the natural language processing system. Uyghur word frequency statistics technology is the process by computer automatic identification term boundary in the texts. It is the most important pretreatment of information processing technology. However, there is no a really mature Uighur word frequency statistics system, which became one of the bottlenecks that hampered the development of information processing in Uighur language seriously at present. This paper discusses the idea and algorithms of the Uyghur word frequency statistics system in detail. Secondly introduces functional design process of the word frequency statistics system. Third I describe methods and techniques of this system. Finally it states statement of the test results.","PeriodicalId":413833,"journal":{"name":"2013 International Conference on Asian Language Processing","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research of Modern Uyghur Word Frequency Statistical Technology\",\"authors\":\"Azragul, Nianmei, Yasen Yimin\",\"doi\":\"10.1109/IALP.2013.20\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of our society, the languages are also constantly evolving. Word is the smallest meaningful language composition which able to activity independently, and is also important carrier of knowledge and the basic operation unit in the natural language processing system. Uyghur word frequency statistics technology is the process by computer automatic identification term boundary in the texts. It is the most important pretreatment of information processing technology. However, there is no a really mature Uighur word frequency statistics system, which became one of the bottlenecks that hampered the development of information processing in Uighur language seriously at present. This paper discusses the idea and algorithms of the Uyghur word frequency statistics system in detail. Secondly introduces functional design process of the word frequency statistics system. Third I describe methods and techniques of this system. Finally it states statement of the test results.\",\"PeriodicalId\":413833,\"journal\":{\"name\":\"2013 International Conference on Asian Language Processing\",\"volume\":\"56 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-08-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 International Conference on Asian Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IALP.2013.20\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Asian Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP.2013.20","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research of Modern Uyghur Word Frequency Statistical Technology
With the development of our society, the languages are also constantly evolving. Word is the smallest meaningful language composition which able to activity independently, and is also important carrier of knowledge and the basic operation unit in the natural language processing system. Uyghur word frequency statistics technology is the process by computer automatic identification term boundary in the texts. It is the most important pretreatment of information processing technology. However, there is no a really mature Uighur word frequency statistics system, which became one of the bottlenecks that hampered the development of information processing in Uighur language seriously at present. This paper discusses the idea and algorithms of the Uyghur word frequency statistics system in detail. Secondly introduces functional design process of the word frequency statistics system. Third I describe methods and techniques of this system. Finally it states statement of the test results.