{"title":"构建阿拉伯语比喻语言的多类XGBoost模型","authors":"N. Elmitwally","doi":"10.1109/ICCIS49240.2020.9257669","DOIUrl":null,"url":null,"abstract":"In the Natural Language Processing (NLP) field, the text classification becomes a task on which many scholars and researchers concentrate. Rhetorical methods in the Arabic language are among the means of linguistic expression that express opinions and feelings through written or spoken texts. It is essential to pay attention to this specialized research point in the Arabic language and in particular in the so-called Arabic rhetoric sciences that are concerned with figurative devices (i.e. simile, hyperbole and sarcasm). In this paper, we build the eXtreme Gradient Boosting (XGBoost) classifier to classify the multi-class Arabic figurative texts. The XGBoost is quite efficient for its speed and performance. The XGBoost classifier was developed, trained, and tested on this Arabic Figurative Corpus (AFC). The performance of the XGBoost classifier obtained as Fl-score is 88%.","PeriodicalId":425637,"journal":{"name":"2020 2nd International Conference on Computer and Information Sciences (ICCIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Building A Multi-class XGBoost Model for Arabic Figurative Language\",\"authors\":\"N. Elmitwally\",\"doi\":\"10.1109/ICCIS49240.2020.9257669\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the Natural Language Processing (NLP) field, the text classification becomes a task on which many scholars and researchers concentrate. Rhetorical methods in the Arabic language are among the means of linguistic expression that express opinions and feelings through written or spoken texts. It is essential to pay attention to this specialized research point in the Arabic language and in particular in the so-called Arabic rhetoric sciences that are concerned with figurative devices (i.e. simile, hyperbole and sarcasm). In this paper, we build the eXtreme Gradient Boosting (XGBoost) classifier to classify the multi-class Arabic figurative texts. The XGBoost is quite efficient for its speed and performance. The XGBoost classifier was developed, trained, and tested on this Arabic Figurative Corpus (AFC). The performance of the XGBoost classifier obtained as Fl-score is 88%.\",\"PeriodicalId\":425637,\"journal\":{\"name\":\"2020 2nd International Conference on Computer and Information Sciences (ICCIS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 2nd International Conference on Computer and Information Sciences (ICCIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIS49240.2020.9257669\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd International Conference on Computer and Information Sciences (ICCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIS49240.2020.9257669","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Building A Multi-class XGBoost Model for Arabic Figurative Language
In the Natural Language Processing (NLP) field, the text classification becomes a task on which many scholars and researchers concentrate. Rhetorical methods in the Arabic language are among the means of linguistic expression that express opinions and feelings through written or spoken texts. It is essential to pay attention to this specialized research point in the Arabic language and in particular in the so-called Arabic rhetoric sciences that are concerned with figurative devices (i.e. simile, hyperbole and sarcasm). In this paper, we build the eXtreme Gradient Boosting (XGBoost) classifier to classify the multi-class Arabic figurative texts. The XGBoost is quite efficient for its speed and performance. The XGBoost classifier was developed, trained, and tested on this Arabic Figurative Corpus (AFC). The performance of the XGBoost classifier obtained as Fl-score is 88%.