Query-independent learning to rank RDF entity results of SPARQL queries

2014 4th International Conference on Computer and Knowledge Engineering (ICCKE) Pub Date : 2014-12-22 DOI:10.1109/ICCKE.2014.6993425

Sara Latifi, M. Nematbakhsh

{"title":"Query-independent learning to rank RDF entity results of SPARQL queries","authors":"Sara Latifi, M. Nematbakhsh","doi":"10.1109/ICCKE.2014.6993425","DOIUrl":null,"url":null,"abstract":"RDF is a data model to represent structured data on the web. SPARQL is a query language for RDF data that returns exactly matching results. Number of these results may be very high. By rapid growth of web of data the need for efficient ranking methods for results of this kind of queries is increased. Because of exactly matching results in SPARQL queries, the focus is on the query independent features for ranking them. We use a learning to rank approach with four sets of query independent features to rank entity results of SPARQL queries over DBpedia. These features include: features extracted from RDF graph, weighted LinkCount, search engine based and information content of the RDF resource. We investigate the performance of individual features and the combination of them in learning to rank entity results. Experiments show that the complete feature set has the best performance in rankings. As an individual feature, the proposed information content of the RDF resource is a good choice based on its performance in ranking and the elapsed time for extracting this feature.","PeriodicalId":152540,"journal":{"name":"2014 4th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 4th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE.2014.6993425","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

RDF is a data model to represent structured data on the web. SPARQL is a query language for RDF data that returns exactly matching results. Number of these results may be very high. By rapid growth of web of data the need for efficient ranking methods for results of this kind of queries is increased. Because of exactly matching results in SPARQL queries, the focus is on the query independent features for ranking them. We use a learning to rank approach with four sets of query independent features to rank entity results of SPARQL queries over DBpedia. These features include: features extracted from RDF graph, weighted LinkCount, search engine based and information content of the RDF resource. We investigate the performance of individual features and the combination of them in learning to rank entity results. Experiments show that the complete feature set has the best performance in rankings. As an individual feature, the proposed information content of the RDF resource is a good choice based on its performance in ranking and the elapsed time for extracting this feature.

查看原文本刊更多论文

对SPARQL查询的RDF实体结果进行排序的查询独立学习

RDF是一种表示web上结构化数据的数据模型。SPARQL是RDF数据的查询语言，它返回完全匹配的结果。这些结果的数量可能非常高。随着网络数据的快速增长，对这类查询结果的高效排序方法的需求日益增加。因为SPARQL查询中的结果是精确匹配的，所以重点放在查询无关的特性上，以便对它们进行排序。我们使用了一种带有四组查询独立特性的学习排序方法，对DBpedia上SPARQL查询的实体结果进行排序。这些特征包括:从RDF图中提取的特征、加权的LinkCount、基于搜索引擎的特征和RDF资源的信息内容。我们研究了单个特征的性能以及它们在学习对实体结果进行排序中的组合。实验表明，完整的特征集具有最佳的排序性能。作为一个单独的特征，建议的RDF资源的信息内容是一个很好的选择，这取决于它的排名性能和提取该特征所花费的时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2014 4th International Conference on Computer and Knowledge Engineering (ICCKE)

自引率

0.00%

发文量