Clustering RDF data using K-medoids

2019 First International Conference of Intelligent Computing and Engineering (ICOICE) Pub Date : 2019-12-01 DOI:10.1109/ICOICE48418.2019.9035160

Seham A. Bamatraf, Rasha A. BinThalab

引用次数: 0

Abstract

Semantic web is a knowledge graph formed around semantic languages to enable computers and software to understand contents on the web. The content is explicitly annotated with semantic metadata using Resource Description Framework (RDF) language. However, the main issue is how to efficiently retrieve the RDF data taking into account a wide variety semantic and syntax nature and large-scale of such data. This paper aims to introduce a novel mechanism based on K-medoids algorithm for narrowing down the contents of the Web to clusters pertaining subset of information. We integrated sequence alignment algorithms with linguistic similarity measures to build a distance matrix which is used later in K-medoids clustering algorithm. The experimental outcomes showed a promised result for accuracy and quality of clustering.

查看原文本刊更多论文

使用k - medioids聚类RDF数据

语义网是围绕语义语言形成的知识图谱，使计算机和软件能够理解网络上的内容。使用资源描述框架(RDF)语言显式地用语义元数据对内容进行注释。然而，主要的问题是如何有效地检索RDF数据，同时考虑到各种各样的语义和语法性质以及此类数据的大规模。本文旨在介绍一种基于K-medoids算法的机制，用于将Web内容缩小到属于信息子集的聚类。我们将序列比对算法与语言相似性度量相结合，建立了一个距离矩阵，该矩阵在K-medoids聚类算法中得到了应用。实验结果表明，该方法在聚类精度和聚类质量方面取得了令人满意的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 First International Conference of Intelligent Computing and Engineering (ICOICE)

自引率

0.00%

发文量