Name Disambiguation Analysis Using the Word Sense Disambiguation Method in Hadith

Ageng Prasetio, M. Bijaksana, Arie A. Suryani
{"title":"Name Disambiguation Analysis Using the Word Sense Disambiguation Method in Hadith","authors":"Ageng Prasetio, M. Bijaksana, Arie A. Suryani","doi":"10.29408/EDUMATIC.V4I2.2551","DOIUrl":null,"url":null,"abstract":"Name disambiguation is the problem solving process to find similar names in sentences. The ambiguity of names can be found in hadith of Sahih Bukhari, names \"Abdullah bin Amru\" in hadiths no 27 and “Abdullah bin Amru” in hadith no 58, These names are the same, but there is no proof they are the same person. This problem is the early indication of ambiguity of name in the hadith. Based in this problem, this research aims to find name disambiguation of hadith narrators with classification by considering the perawi chain. To solved this problem the authors used Word Sense Disambiguation (WSD), WSD is a process to assign the same meaning from the sentences, based on the context in which the word appears. To classify several names in the hadith, the authors used KNN algorithm, by combining the WSD and KNN method can reduce the ambiguity of names in hadith. The data used in this study came from the hadith of Sahih Bukhori through the pre-processing stage. After conducting the research showed a collection of hadith numbers with the same name prediction with an accuracy of 99% at k = 1. Thus, this method can be used for name disambiguation.","PeriodicalId":314771,"journal":{"name":"Edumatic: Jurnal Pendidikan Informatika","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Edumatic: Jurnal Pendidikan Informatika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29408/EDUMATIC.V4I2.2551","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Name disambiguation is the problem solving process to find similar names in sentences. The ambiguity of names can be found in hadith of Sahih Bukhari, names "Abdullah bin Amru" in hadiths no 27 and “Abdullah bin Amru” in hadith no 58, These names are the same, but there is no proof they are the same person. This problem is the early indication of ambiguity of name in the hadith. Based in this problem, this research aims to find name disambiguation of hadith narrators with classification by considering the perawi chain. To solved this problem the authors used Word Sense Disambiguation (WSD), WSD is a process to assign the same meaning from the sentences, based on the context in which the word appears. To classify several names in the hadith, the authors used KNN algorithm, by combining the WSD and KNN method can reduce the ambiguity of names in hadith. The data used in this study came from the hadith of Sahih Bukhori through the pre-processing stage. After conducting the research showed a collection of hadith numbers with the same name prediction with an accuracy of 99% at k = 1. Thus, this method can be used for name disambiguation.
圣训中词义消歧法的名称消歧分析
名称消歧是在句子中寻找相似名称的问题解决过程。名字的歧义可以在布哈里圣训中找到,圣训27号中的“阿卜杜拉·本·阿姆鲁”和圣训58号中的“阿卜杜拉·本·阿姆鲁”,这些名字是相同的,但没有证据表明他们是同一个人。这个问题是圣训中名字模糊的早期迹象。基于这一问题,本研究的目的是通过考虑perawi链的分类来寻找圣训叙述者的名字消歧。为了解决这个问题,作者使用了词义消歧(WSD), WSD是一种根据单词出现的上下文赋予句子相同含义的过程。为了对圣训中的多个名字进行分类,作者采用了KNN算法,将WSD和KNN方法相结合,减少了圣训中名字的歧义性。本研究使用的数据通过预处理阶段来自Sahih Bukhori的圣训。在进行研究后,显示了一组具有相同名称的圣训数预测,在k = 1时准确率为99%。因此,此方法可用于名称消歧。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信