Introducing Arabic-SQuADv2.0 for Effective Arabic Machine Reading Comprehension

Zeyad Ahmed, Mariam Zeyada, Youssef Amin, Donia Gamal, Hanan Hindy
{"title":"Introducing Arabic-SQuADv2.0 for Effective Arabic Machine Reading Comprehension","authors":"Zeyad Ahmed, Mariam Zeyada, Youssef Amin, Donia Gamal, Hanan Hindy","doi":"10.5815/ijeme.2023.05.03","DOIUrl":null,"url":null,"abstract":"Machine Reading Comprehension (MRC), known as the ability of computers to read and understand unstructured text and then answer questions, is still an open research field. MRC is considered one of the most research-demanding sub-tasks in Natural Language Processing (NLP) and Natural Language Understanding (NLU). MRC introduces multiple research challenges. One of these challenges is that the models should be trained to answer all questions and abstain from answering when the answer is not covered in the given context. Another challenge lies in dataset availability. These challenges are amplified for non-Latin-based languages; Arabic as an example. Currently, available Arabic MCR datasets are either small-sized high-quality collections or large-sized low-quality datasets. Additionally, they do not include unanswerable questions. This lack of resources depicts the model as incapable of real-world deployments. To tackle these challenges, this paper proposes a novel large-size high-quality Arabic MRC dataset that includes unanswerable questions, named “Arabic-SQuAD v2.0'”. The dataset consists of 96051 triplets {question, context, answer} in an attempt to help enrich the field of Arabic-MRC. Furthermore, a Machine Learning (ML)-based model is introduced that is capable of effectively solving Arabic MRC-with-unanswerable questions. The results of the proposed model are satisfactory and comparable with Latin-based language models. Furthermore, the results show a significant improvement of the current state-of-the-art Arabic MRC. To be exact, the model scores 71.49 F1-score and 65.12 Exact Match (EM). This proposed dataset and implementation pave the way to further Arabic MRC; aiming to reach a state when MRC models could mimic human text reasoning.","PeriodicalId":479503,"journal":{"name":"International journal of education and management engineering","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of education and management engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5815/ijeme.2023.05.03","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Machine Reading Comprehension (MRC), known as the ability of computers to read and understand unstructured text and then answer questions, is still an open research field. MRC is considered one of the most research-demanding sub-tasks in Natural Language Processing (NLP) and Natural Language Understanding (NLU). MRC introduces multiple research challenges. One of these challenges is that the models should be trained to answer all questions and abstain from answering when the answer is not covered in the given context. Another challenge lies in dataset availability. These challenges are amplified for non-Latin-based languages; Arabic as an example. Currently, available Arabic MCR datasets are either small-sized high-quality collections or large-sized low-quality datasets. Additionally, they do not include unanswerable questions. This lack of resources depicts the model as incapable of real-world deployments. To tackle these challenges, this paper proposes a novel large-size high-quality Arabic MRC dataset that includes unanswerable questions, named “Arabic-SQuAD v2.0'”. The dataset consists of 96051 triplets {question, context, answer} in an attempt to help enrich the field of Arabic-MRC. Furthermore, a Machine Learning (ML)-based model is introduced that is capable of effectively solving Arabic MRC-with-unanswerable questions. The results of the proposed model are satisfactory and comparable with Latin-based language models. Furthermore, the results show a significant improvement of the current state-of-the-art Arabic MRC. To be exact, the model scores 71.49 F1-score and 65.12 Exact Match (EM). This proposed dataset and implementation pave the way to further Arabic MRC; aiming to reach a state when MRC models could mimic human text reasoning.
介绍Arabic- squadv2.0用于有效的阿拉伯语机器阅读理解
机器阅读理解(MRC),即计算机阅读和理解非结构化文本并回答问题的能力,仍然是一个开放的研究领域。MRC被认为是自然语言处理(NLP)和自然语言理解(NLU)中最需要研究的子任务之一。MRC引入了多种研究挑战。其中一个挑战是,应该训练模型回答所有问题,并在给定的上下文中不涉及答案时避免回答问题。另一个挑战在于数据集的可用性。对于非拉丁语系的语言,这些挑战更大;比如阿拉伯语。目前,可用的阿拉伯语MCR数据集要么是小规模的高质量集合,要么是大规模的低质量数据集。此外,它们不包括无法回答的问题。这种资源的缺乏将模型描述为无法进行实际部署。为了应对这些挑战,本文提出了一个新的大尺寸高质量阿拉伯语MRC数据集,其中包括无法回答的问题,命名为“Arabic- squad v2.0”。该数据集由96051个三元组{问题,上下文,答案}组成,旨在帮助丰富阿拉伯语mrc领域。此外,介绍了一种基于机器学习(ML)的模型,该模型能够有效地解决带有无法回答问题的阿拉伯语mrc。该模型的结果令人满意,并可与基于拉丁语的语言模型相媲美。此外,结果显示当前最先进的阿拉伯MRC的显著改进。准确地说,模型的f1得分为71.49,精确匹配(EM)得分为65.12。这个建议的数据集和实施为进一步的阿拉伯语MRC铺平了道路;旨在达到MRC模型可以模仿人类文本推理的状态。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信