Introducing Arabic-SQuADv2.0 for Effective Arabic Machine Reading Comprehension

International journal of education and management engineering Pub Date : 2023-10-08 DOI:10.5815/ijeme.2023.05.03

Zeyad Ahmed, Mariam Zeyada, Youssef Amin, Donia Gamal, Hanan Hindy

{"title":"Introducing Arabic-SQuADv2.0 for Effective Arabic Machine Reading Comprehension","authors":"Zeyad Ahmed, Mariam Zeyada, Youssef Amin, Donia Gamal, Hanan Hindy","doi":"10.5815/ijeme.2023.05.03","DOIUrl":null,"url":null,"abstract":"Machine Reading Comprehension (MRC), known as the ability of computers to read and understand unstructured text and then answer questions, is still an open research field. MRC is considered one of the most research-demanding sub-tasks in Natural Language Processing (NLP) and Natural Language Understanding (NLU). MRC introduces multiple research challenges. One of these challenges is that the models should be trained to answer all questions and abstain from answering when the answer is not covered in the given context. Another challenge lies in dataset availability. These challenges are amplified for non-Latin-based languages; Arabic as an example. Currently, available Arabic MCR datasets are either small-sized high-quality collections or large-sized low-quality datasets. Additionally, they do not include unanswerable questions. This lack of resources depicts the model as incapable of real-world deployments. To tackle these challenges, this paper proposes a novel large-size high-quality Arabic MRC dataset that includes unanswerable questions, named “Arabic-SQuAD v2.0'”. The dataset consists of 96051 triplets {question, context, answer} in an attempt to help enrich the field of Arabic-MRC. Furthermore, a Machine Learning (ML)-based model is introduced that is capable of effectively solving Arabic MRC-with-unanswerable questions. The results of the proposed model are satisfactory and comparable with Latin-based language models. Furthermore, the results show a significant improvement of the current state-of-the-art Arabic MRC. To be exact, the model scores 71.49 F1-score and 65.12 Exact Match (EM). This proposed dataset and implementation pave the way to further Arabic MRC; aiming to reach a state when MRC models could mimic human text reasoning.","PeriodicalId":479503,"journal":{"name":"International journal of education and management engineering","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of education and management engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5815/ijeme.2023.05.03","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Machine Reading Comprehension (MRC), known as the ability of computers to read and understand unstructured text and then answer questions, is still an open research field. MRC is considered one of the most research-demanding sub-tasks in Natural Language Processing (NLP) and Natural Language Understanding (NLU). MRC introduces multiple research challenges. One of these challenges is that the models should be trained to answer all questions and abstain from answering when the answer is not covered in the given context. Another challenge lies in dataset availability. These challenges are amplified for non-Latin-based languages; Arabic as an example. Currently, available Arabic MCR datasets are either small-sized high-quality collections or large-sized low-quality datasets. Additionally, they do not include unanswerable questions. This lack of resources depicts the model as incapable of real-world deployments. To tackle these challenges, this paper proposes a novel large-size high-quality Arabic MRC dataset that includes unanswerable questions, named “Arabic-SQuAD v2.0'”. The dataset consists of 96051 triplets {question, context, answer} in an attempt to help enrich the field of Arabic-MRC. Furthermore, a Machine Learning (ML)-based model is introduced that is capable of effectively solving Arabic MRC-with-unanswerable questions. The results of the proposed model are satisfactory and comparable with Latin-based language models. Furthermore, the results show a significant improvement of the current state-of-the-art Arabic MRC. To be exact, the model scores 71.49 F1-score and 65.12 Exact Match (EM). This proposed dataset and implementation pave the way to further Arabic MRC; aiming to reach a state when MRC models could mimic human text reasoning.

查看原文本刊更多论文

介绍Arabic- squadv2.0用于有效的阿拉伯语机器阅读理解

机器阅读理解(MRC)，即计算机阅读和理解非结构化文本并回答问题的能力，仍然是一个开放的研究领域。MRC被认为是自然语言处理(NLP)和自然语言理解(NLU)中最需要研究的子任务之一。MRC引入了多种研究挑战。其中一个挑战是，应该训练模型回答所有问题，并在给定的上下文中不涉及答案时避免回答问题。另一个挑战在于数据集的可用性。对于非拉丁语系的语言，这些挑战更大;比如阿拉伯语。目前，可用的阿拉伯语MCR数据集要么是小规模的高质量集合，要么是大规模的低质量数据集。此外，它们不包括无法回答的问题。这种资源的缺乏将模型描述为无法进行实际部署。为了应对这些挑战，本文提出了一个新的大尺寸高质量阿拉伯语MRC数据集，其中包括无法回答的问题，命名为“Arabic- squad v2.0”。该数据集由96051个三元组{问题，上下文，答案}组成，旨在帮助丰富阿拉伯语mrc领域。此外，介绍了一种基于机器学习(ML)的模型，该模型能够有效地解决带有无法回答问题的阿拉伯语mrc。该模型的结果令人满意，并可与基于拉丁语的语言模型相媲美。此外，结果显示当前最先进的阿拉伯MRC的显著改进。准确地说，模型的f1得分为71.49，精确匹配(EM)得分为65.12。这个建议的数据集和实施为进一步的阿拉伯语MRC铺平了道路;旨在达到MRC模型可以模仿人类文本推理的状态。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International journal of education and management engineering

自引率

0.00%

发文量