Team Wa’ed Al-Shrida at the Mowjaz Multi-Topic Labelling Task

2021 12th International Conference on Information and Communication Systems (ICICS) Pub Date : 2021-05-24 DOI:10.1109/ICICS52457.2021.9464574

Wa’ed Al-Shrida

引用次数: 0

Abstract

This paper describes an attempt in the "Mowjaz Multi-topic Labelling Task", the competition is about classifying the Arabic articles to its topic by using artificial intelligence and neural networks, the programming language that was used to classify the datasets is "Python". The attempt was started by uploading the datasets from the "Github website", the datasets that were used in the system include three groups, train, validation, and test datasets. The "Pyarabic" and simple-transformers" libraries were used to allow the system to manipulate Arabic letters and simplify the usage of Transformer models without having to compromise on utility, respectively. The model’s type that I used is "Bert" and its name is "Asafaya/Bert-base-Arabic". The accuracy of the result that was gotten is as follows: F1 macro: 0.864, F1 micro: 0.869, competition website on Codalab: 0.8430.

查看原文本刊更多论文

Wa 'ed Al-Shrida团队在Mowjaz多主题标签任务

本文描述了“Mowjaz多主题标记任务”中的一次尝试，该竞赛是通过使用人工智能和神经网络将阿拉伯语文章分类到其主题，用于对数据集进行分类的编程语言是“Python”。尝试从上传“Github网站”的数据集开始，系统中使用的数据集包括三组，训练，验证和测试数据集。“Pyarabic”和“simple- Transformer”库分别用于允许系统操作阿拉伯字母和简化Transformer模型的使用，而不必在实用性上妥协。我使用的模型类型是“Bert”，它的名字是“Asafaya/Bert-base- arabic”。所得结果的准确率为:F1宏观:0.864,F1微观:0.869,Codalab竞赛网站:0.8430。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 12th International Conference on Information and Communication Systems (ICICS)

自引率

0.00%

发文量