{"title":"Query-independent learning to rank RDF entity results of SPARQL queries","authors":"Sara Latifi, M. Nematbakhsh","doi":"10.1109/ICCKE.2014.6993425","DOIUrl":null,"url":null,"abstract":"RDF is a data model to represent structured data on the web. SPARQL is a query language for RDF data that returns exactly matching results. Number of these results may be very high. By rapid growth of web of data the need for efficient ranking methods for results of this kind of queries is increased. Because of exactly matching results in SPARQL queries, the focus is on the query independent features for ranking them. We use a learning to rank approach with four sets of query independent features to rank entity results of SPARQL queries over DBpedia. These features include: features extracted from RDF graph, weighted LinkCount, search engine based and information content of the RDF resource. We investigate the performance of individual features and the combination of them in learning to rank entity results. Experiments show that the complete feature set has the best performance in rankings. As an individual feature, the proposed information content of the RDF resource is a good choice based on its performance in ranking and the elapsed time for extracting this feature.","PeriodicalId":152540,"journal":{"name":"2014 4th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 4th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE.2014.6993425","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
RDF is a data model to represent structured data on the web. SPARQL is a query language for RDF data that returns exactly matching results. Number of these results may be very high. By rapid growth of web of data the need for efficient ranking methods for results of this kind of queries is increased. Because of exactly matching results in SPARQL queries, the focus is on the query independent features for ranking them. We use a learning to rank approach with four sets of query independent features to rank entity results of SPARQL queries over DBpedia. These features include: features extracted from RDF graph, weighted LinkCount, search engine based and information content of the RDF resource. We investigate the performance of individual features and the combination of them in learning to rank entity results. Experiments show that the complete feature set has the best performance in rankings. As an individual feature, the proposed information content of the RDF resource is a good choice based on its performance in ranking and the elapsed time for extracting this feature.