{"title":"波斯语基于结构规则的词干","authors":"Elahe Rahimtoroghi, H. Faili, A. Shakery","doi":"10.1109/ISTEL.2010.5734090","DOIUrl":null,"url":null,"abstract":"This paper presents a new stemmer for Persian language. We used a structural approach for stemming which uses the structure of words and morphological rules of the language to recognize the stem of each word. We composed 33 rules to describe a structural rule-based stemmer. The rules are written based on the morphology of Persian language and its word derivation structure. For evaluation, we used our stemmer in an information retrieval system. The results demonstrated that by enhancing the system with this stemmer, the information retrieval system's precision increases, by the factor of 4.78% and the indexing file size decreases by the factor of 6%.","PeriodicalId":306663,"journal":{"name":"2010 5th International Symposium on Telecommunications","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"A structural rule-based stemmer for Persian\",\"authors\":\"Elahe Rahimtoroghi, H. Faili, A. Shakery\",\"doi\":\"10.1109/ISTEL.2010.5734090\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a new stemmer for Persian language. We used a structural approach for stemming which uses the structure of words and morphological rules of the language to recognize the stem of each word. We composed 33 rules to describe a structural rule-based stemmer. The rules are written based on the morphology of Persian language and its word derivation structure. For evaluation, we used our stemmer in an information retrieval system. The results demonstrated that by enhancing the system with this stemmer, the information retrieval system's precision increases, by the factor of 4.78% and the indexing file size decreases by the factor of 6%.\",\"PeriodicalId\":306663,\"journal\":{\"name\":\"2010 5th International Symposium on Telecommunications\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 5th International Symposium on Telecommunications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISTEL.2010.5734090\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 5th International Symposium on Telecommunications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISTEL.2010.5734090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This paper presents a new stemmer for Persian language. We used a structural approach for stemming which uses the structure of words and morphological rules of the language to recognize the stem of each word. We composed 33 rules to describe a structural rule-based stemmer. The rules are written based on the morphology of Persian language and its word derivation structure. For evaluation, we used our stemmer in an information retrieval system. The results demonstrated that by enhancing the system with this stemmer, the information retrieval system's precision increases, by the factor of 4.78% and the indexing file size decreases by the factor of 6%.