{"title":"Stemming impact on Arabic text categorization performance: A survey","authors":"Fawaz S. Al-Anzi, Dia AbuZeina","doi":"10.1109/ICTA.2015.7426875","DOIUrl":null,"url":null,"abstract":"The significant growth of online textual information has increased the demand for effective content-based Arabic text categorization methods. The categorization of Arabic texts has some challenges that need to be addressed specially when using stemming. In the literature, we found a debate among researchers about the benefits of using stemming in Arabic text categorization. Hence, we performed a study of this feature reduction method to clarify the impact of this widely used method in text mining and document classification. We also presented some Arabic text cases to deny the importance of stemming in Arabic text categorization.","PeriodicalId":375443,"journal":{"name":"2015 5th International Conference on Information & Communication Technology and Accessibility (ICTA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 5th International Conference on Information & Communication Technology and Accessibility (ICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTA.2015.7426875","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 26
Abstract
The significant growth of online textual information has increased the demand for effective content-based Arabic text categorization methods. The categorization of Arabic texts has some challenges that need to be addressed specially when using stemming. In the literature, we found a debate among researchers about the benefits of using stemming in Arabic text categorization. Hence, we performed a study of this feature reduction method to clarify the impact of this widely used method in text mining and document classification. We also presented some Arabic text cases to deny the importance of stemming in Arabic text categorization.