{"title":"Clustering RDF data using K-medoids","authors":"Seham A. Bamatraf, Rasha A. BinThalab","doi":"10.1109/ICOICE48418.2019.9035160","DOIUrl":null,"url":null,"abstract":"Semantic web is a knowledge graph formed around semantic languages to enable computers and software to understand contents on the web. The content is explicitly annotated with semantic metadata using Resource Description Framework (RDF) language. However, the main issue is how to efficiently retrieve the RDF data taking into account a wide variety semantic and syntax nature and large-scale of such data. This paper aims to introduce a novel mechanism based on K-medoids algorithm for narrowing down the contents of the Web to clusters pertaining subset of information. We integrated sequence alignment algorithms with linguistic similarity measures to build a distance matrix which is used later in K-medoids clustering algorithm. The experimental outcomes showed a promised result for accuracy and quality of clustering.","PeriodicalId":109414,"journal":{"name":"2019 First International Conference of Intelligent Computing and Engineering (ICOICE)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 First International Conference of Intelligent Computing and Engineering (ICOICE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOICE48418.2019.9035160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Semantic web is a knowledge graph formed around semantic languages to enable computers and software to understand contents on the web. The content is explicitly annotated with semantic metadata using Resource Description Framework (RDF) language. However, the main issue is how to efficiently retrieve the RDF data taking into account a wide variety semantic and syntax nature and large-scale of such data. This paper aims to introduce a novel mechanism based on K-medoids algorithm for narrowing down the contents of the Web to clusters pertaining subset of information. We integrated sequence alignment algorithms with linguistic similarity measures to build a distance matrix which is used later in K-medoids clustering algorithm. The experimental outcomes showed a promised result for accuracy and quality of clustering.