{"title":"Arabic Text Classification: A Literature Review","authors":"Bilel Elayeb","doi":"10.1109/AICCSA53542.2021.9686874","DOIUrl":null,"url":null,"abstract":"Automatic text classification or categorization consists to assign predefined classes or categories to a given set of text documents aiming to organize the document collection based on conceptual views. Although there are many text classifiers in the literature, most of them are assessed using English or other non-Arabic languages text collections. The lack of availability of a large collection in the Arabic language is one of the most important challenges facing the few numbers of existing Arabic text classifiers (ATC). We present in this paper a literature review in the domain of Arabic text classification. We firstly overview the ATC based on machine learning algorithms. Then, we investigate ATC based on deep learning techniques as well as a set of other classifiers based on non-ML algorithms. The assessment of these ATC is also discussed. Finally, we focus on some open problems and we suggest some future directions.","PeriodicalId":423896,"journal":{"name":"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICCSA53542.2021.9686874","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Automatic text classification or categorization consists to assign predefined classes or categories to a given set of text documents aiming to organize the document collection based on conceptual views. Although there are many text classifiers in the literature, most of them are assessed using English or other non-Arabic languages text collections. The lack of availability of a large collection in the Arabic language is one of the most important challenges facing the few numbers of existing Arabic text classifiers (ATC). We present in this paper a literature review in the domain of Arabic text classification. We firstly overview the ATC based on machine learning algorithms. Then, we investigate ATC based on deep learning techniques as well as a set of other classifiers based on non-ML algorithms. The assessment of these ATC is also discussed. Finally, we focus on some open problems and we suggest some future directions.