构建阿拉伯语比喻语言的多类XGBoost模型

N. Elmitwally
{"title":"构建阿拉伯语比喻语言的多类XGBoost模型","authors":"N. Elmitwally","doi":"10.1109/ICCIS49240.2020.9257669","DOIUrl":null,"url":null,"abstract":"In the Natural Language Processing (NLP) field, the text classification becomes a task on which many scholars and researchers concentrate. Rhetorical methods in the Arabic language are among the means of linguistic expression that express opinions and feelings through written or spoken texts. It is essential to pay attention to this specialized research point in the Arabic language and in particular in the so-called Arabic rhetoric sciences that are concerned with figurative devices (i.e. simile, hyperbole and sarcasm). In this paper, we build the eXtreme Gradient Boosting (XGBoost) classifier to classify the multi-class Arabic figurative texts. The XGBoost is quite efficient for its speed and performance. The XGBoost classifier was developed, trained, and tested on this Arabic Figurative Corpus (AFC). The performance of the XGBoost classifier obtained as Fl-score is 88%.","PeriodicalId":425637,"journal":{"name":"2020 2nd International Conference on Computer and Information Sciences (ICCIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Building A Multi-class XGBoost Model for Arabic Figurative Language\",\"authors\":\"N. Elmitwally\",\"doi\":\"10.1109/ICCIS49240.2020.9257669\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the Natural Language Processing (NLP) field, the text classification becomes a task on which many scholars and researchers concentrate. Rhetorical methods in the Arabic language are among the means of linguistic expression that express opinions and feelings through written or spoken texts. It is essential to pay attention to this specialized research point in the Arabic language and in particular in the so-called Arabic rhetoric sciences that are concerned with figurative devices (i.e. simile, hyperbole and sarcasm). In this paper, we build the eXtreme Gradient Boosting (XGBoost) classifier to classify the multi-class Arabic figurative texts. The XGBoost is quite efficient for its speed and performance. The XGBoost classifier was developed, trained, and tested on this Arabic Figurative Corpus (AFC). The performance of the XGBoost classifier obtained as Fl-score is 88%.\",\"PeriodicalId\":425637,\"journal\":{\"name\":\"2020 2nd International Conference on Computer and Information Sciences (ICCIS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 2nd International Conference on Computer and Information Sciences (ICCIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIS49240.2020.9257669\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd International Conference on Computer and Information Sciences (ICCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIS49240.2020.9257669","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

在自然语言处理(NLP)领域,文本分类成为众多学者和研究者关注的课题。阿拉伯语中的修辞方法是通过书面或口头文本表达意见和感情的语言表达手段之一。在阿拉伯语中,特别是在所谓的阿拉伯修辞学中,关注这一专门的研究点是至关重要的,这些修辞学与比喻手段(即明喻、夸张和讽刺)有关。在本文中,我们建立了极端梯度增强(XGBoost)分类器来对多类阿拉伯语比喻文本进行分类。XGBoost在速度和性能方面都相当高效。XGBoost分类器是在这个阿拉伯语比喻语料库(AFC)上开发、训练和测试的。作为Fl-score获得的XGBoost分类器性能为88%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Building A Multi-class XGBoost Model for Arabic Figurative Language
In the Natural Language Processing (NLP) field, the text classification becomes a task on which many scholars and researchers concentrate. Rhetorical methods in the Arabic language are among the means of linguistic expression that express opinions and feelings through written or spoken texts. It is essential to pay attention to this specialized research point in the Arabic language and in particular in the so-called Arabic rhetoric sciences that are concerned with figurative devices (i.e. simile, hyperbole and sarcasm). In this paper, we build the eXtreme Gradient Boosting (XGBoost) classifier to classify the multi-class Arabic figurative texts. The XGBoost is quite efficient for its speed and performance. The XGBoost classifier was developed, trained, and tested on this Arabic Figurative Corpus (AFC). The performance of the XGBoost classifier obtained as Fl-score is 88%.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信