{"title":"当代维吾尔语语料库的自动形态标注","authors":"G. Altenbek","doi":"10.1109/IRI.2006.252474","DOIUrl":null,"url":null,"abstract":"In this paper, we propose methods of Uighur word lemmatization by using of morphemic analysis and word's structural analysis, integrating morphological processing and part-of speech (POS) tagging, so as to find linguistic information and automatic POS of Uighur Corpus as the final purpose. For the regular words, the accuracy of word lemmatization reach 85% and POS reach 80%","PeriodicalId":402255,"journal":{"name":"2006 IEEE International Conference on Information Reuse & Integration","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Automatic Morphological Tagging of Contemporary Uighur Corpus\",\"authors\":\"G. Altenbek\",\"doi\":\"10.1109/IRI.2006.252474\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose methods of Uighur word lemmatization by using of morphemic analysis and word's structural analysis, integrating morphological processing and part-of speech (POS) tagging, so as to find linguistic information and automatic POS of Uighur Corpus as the final purpose. For the regular words, the accuracy of word lemmatization reach 85% and POS reach 80%\",\"PeriodicalId\":402255,\"journal\":{\"name\":\"2006 IEEE International Conference on Information Reuse & Integration\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 IEEE International Conference on Information Reuse & Integration\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRI.2006.252474\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE International Conference on Information Reuse & Integration","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2006.252474","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Automatic Morphological Tagging of Contemporary Uighur Corpus
In this paper, we propose methods of Uighur word lemmatization by using of morphemic analysis and word's structural analysis, integrating morphological processing and part-of speech (POS) tagging, so as to find linguistic information and automatic POS of Uighur Corpus as the final purpose. For the regular words, the accuracy of word lemmatization reach 85% and POS reach 80%