{"title":"Measure of fuzzy presence of descriptors on Arabic Text Mining","authors":"I. E. Hassani, Abdelaziz Kriouile, Y. Benghabrit","doi":"10.1109/CIST.2012.6388063","DOIUrl":null,"url":null,"abstract":"In the present work, we propose a new model of radical descriptors in Arabic Text Mining. This model will be based on the addition of lexical information contained in the morphological pattern of the Arabic word. We developed a statistical model by Hidden Markov Chain to disambiguate the morphological analysis of corpora, and we propose a new method to measure the relationship between descriptors based on a notion of “fuzzy measure of the presence” and we adapt the traditional statistical measures to this context, and we outline the key measures of similarity and distances used in Text Mining.","PeriodicalId":120664,"journal":{"name":"2012 Colloquium in Information Science and Technology","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Colloquium in Information Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIST.2012.6388063","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In the present work, we propose a new model of radical descriptors in Arabic Text Mining. This model will be based on the addition of lexical information contained in the morphological pattern of the Arabic word. We developed a statistical model by Hidden Markov Chain to disambiguate the morphological analysis of corpora, and we propose a new method to measure the relationship between descriptors based on a notion of “fuzzy measure of the presence” and we adapt the traditional statistical measures to this context, and we outline the key measures of similarity and distances used in Text Mining.