Bryan-Elliott Tam, Ruben Taelman, Julián Rojas Meléndez, Pieter Colpaert
{"title":"Optimizing Traversal Queries of Sensor Data Using a Rule-Based Reachability Approach","authors":"Bryan-Elliott Tam, Ruben Taelman, Julián Rojas Meléndez, Pieter Colpaert","doi":"arxiv-2408.17157","DOIUrl":null,"url":null,"abstract":"Link Traversal queries face challenges in completeness and long execution\ntime due to the size of the web. Reachability criteria define completeness by\nrestricting the links followed by engines. However, the number of links to\ndereference remains the bottleneck of the approach. Web environments often have\nstructures exploitable by query engines to prune irrelevant sources. Current\ncriteria rely on using information from the query definition and predefined\npredicate. However, it is difficult to use them to traverse environments where\nlogical expressions indicate the location of resources. We propose to use a\nrule-based reachability criterion that captures logical statements expressed in\nhypermedia descriptions within linked data documents to prune irrelevant\nsources. In this poster paper, we show how the Comunica link traversal engine\nis modified to take hints from a hypermedia control vocabulary, to prune\nirrelevant sources. Our preliminary findings show that by using this strategy,\nthe query engine can significantly reduce the number of HTTP requests and the\nquery execution time without sacrificing the completeness of results. Our work\nshows that the investigation of hypermedia controls in link pruning of\ntraversal queries is a worthy effort for optimizing web queries of unindexed\ndecentralized databases.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.17157","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Link Traversal queries face challenges in completeness and long execution
time due to the size of the web. Reachability criteria define completeness by
restricting the links followed by engines. However, the number of links to
dereference remains the bottleneck of the approach. Web environments often have
structures exploitable by query engines to prune irrelevant sources. Current
criteria rely on using information from the query definition and predefined
predicate. However, it is difficult to use them to traverse environments where
logical expressions indicate the location of resources. We propose to use a
rule-based reachability criterion that captures logical statements expressed in
hypermedia descriptions within linked data documents to prune irrelevant
sources. In this poster paper, we show how the Comunica link traversal engine
is modified to take hints from a hypermedia control vocabulary, to prune
irrelevant sources. Our preliminary findings show that by using this strategy,
the query engine can significantly reduce the number of HTTP requests and the
query execution time without sacrificing the completeness of results. Our work
shows that the investigation of hypermedia controls in link pruning of
traversal queries is a worthy effort for optimizing web queries of unindexed
decentralized databases.