An English Islamic Articles Dataset (EIAD) for developing an IslamBot Question Answering Chatbot

M. Mohammed, Salsabil Amin, M. Aref
{"title":"An English Islamic Articles Dataset (EIAD) for developing an IslamBot Question Answering Chatbot","authors":"M. Mohammed, Salsabil Amin, M. Aref","doi":"10.1109/icci54321.2022.9756122","DOIUrl":null,"url":null,"abstract":"A chatbot is one of the most vastly recommended technologies to be used during these decades, especially through the digitization era. It could save much consumed time for both the users and the customer service employees. Chatbots could provide an answer to the asked questions instantly. IslamBot is an Islamic religion chatbot “i.e.,” responsible for answering any inquiries related to the Islamic religion. The aimed audience is non-Muslims people willing to join Islam or New-Muslims. Building such types of chatbots need to have an enormous amount of trusted data. Accordingly, in this paper The English Islamic Articles dataset (EIAD) is proposed as a benchmark reference for English Islamic question answering. So, this dataset contains about 10000 English Islamic articles. It is scrapped from authenticated and trusted websites like NewMuslims.com [1] IslamReligion.com [2], and IslamQA.com [3]. The dataset is about 275 articles from NewMuslims.com [1], 1550 articles from IslamReligion.com [2], and 8292 articles from IslamQA.com [3]. The EIAD dataset is a structured dataset “i.e.,” labeled and categorized. This dataset contains about 15 different categories. Each category is covering several different topics. This paper focuses on discussing how The English Islamic Articles dataset (EIAD) has been collected.","PeriodicalId":122550,"journal":{"name":"2022 5th International Conference on Computing and Informatics (ICCI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Computing and Informatics (ICCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icci54321.2022.9756122","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

A chatbot is one of the most vastly recommended technologies to be used during these decades, especially through the digitization era. It could save much consumed time for both the users and the customer service employees. Chatbots could provide an answer to the asked questions instantly. IslamBot is an Islamic religion chatbot “i.e.,” responsible for answering any inquiries related to the Islamic religion. The aimed audience is non-Muslims people willing to join Islam or New-Muslims. Building such types of chatbots need to have an enormous amount of trusted data. Accordingly, in this paper The English Islamic Articles dataset (EIAD) is proposed as a benchmark reference for English Islamic question answering. So, this dataset contains about 10000 English Islamic articles. It is scrapped from authenticated and trusted websites like NewMuslims.com [1] IslamReligion.com [2], and IslamQA.com [3]. The dataset is about 275 articles from NewMuslims.com [1], 1550 articles from IslamReligion.com [2], and 8292 articles from IslamQA.com [3]. The EIAD dataset is a structured dataset “i.e.,” labeled and categorized. This dataset contains about 15 different categories. Each category is covering several different topics. This paper focuses on discussing how The English Islamic Articles dataset (EIAD) has been collected.
一个英语伊斯兰文章数据集(EIAD),用于开发一个伊斯兰bot问答聊天机器人
聊天机器人是近几十年来最受推崇的技术之一,尤其是在数字化时代。它可以为用户和客户服务人员节省大量的时间。聊天机器人可以立即回答你提出的问题。IslamBot是一个伊斯兰宗教聊天机器人,“也就是说,”负责回答任何与伊斯兰宗教有关的问题。目标受众是愿意加入伊斯兰教或新穆斯林的非穆斯林。构建这种类型的聊天机器人需要拥有大量的可信数据。据此,本文提出了英语伊斯兰文章数据集(EIAD)作为英语伊斯兰问答的基准参考。因此,这个数据集包含大约10000篇英文伊斯兰文章。它从NewMuslims.com[1]、IslamReligion.com[2]和IslamQA.com[3]等经过认证和可信的网站上被废弃。数据集是来自NewMuslims.com[1]的275篇文章,来自IslamReligion.com[2]的1550篇文章,以及来自IslamQA.com[3]的8292篇文章。EIAD数据集是一个结构化数据集,“即”标记和分类。这个数据集包含大约15个不同的类别。每个类别涵盖几个不同的主题。本文重点讨论了英语伊斯兰文章数据集(EIAD)的收集过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信