Fine-Grained Complexity of Regular Path Queries

Database theory-- ICDT : International Conference ... proceedings. International Conference on Database Theory Pub Date : 2021-01-06 DOI:10.4230/LIPIcs.ICDT.2021.19

Katrin Casel, Markus L. Schmid

{"title":"Fine-Grained Complexity of Regular Path Queries","authors":"Katrin Casel, Markus L. Schmid","doi":"10.4230/LIPIcs.ICDT.2021.19","DOIUrl":null,"url":null,"abstract":"A regular path query (RPQ) is a regular expression q that returns all node pairs (u, v) from a graph database that are connected by an arbitrary path labelled with a word from L(q). The obvious algorithmic approach to RPQ-evaluation (called PG-approach), i.e., constructing the product graph between an NFA for q and the graph database, is appealing due to its simplicity and also leads to efficient algorithms. However, it is unclear whether the PG-approach is optimal. We address this question by thoroughly investigating which upper complexity bounds can be achieved by the PG-approach, and we complement these with conditional lower bounds (in the sense of the fine-grained complexity framework). A special focus is put on enumeration and delay bounds, as well as the data complexity perspective. A main insight is that we can achieve optimal (or near optimal) algorithms with the PG-approach, but the delay for enumeration is rather high (linear in the database). We explore three successful approaches towards enumeration with sub-linear delay: super-linear preprocessing, approximations of the solution sets, and restricted classes of RPQs.","PeriodicalId":90482,"journal":{"name":"Database theory-- ICDT : International Conference ... proceedings. International Conference on Database Theory","volume":"18 1","pages":"19:1-19:20"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Database theory-- ICDT : International Conference ... proceedings. International Conference on Database Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/LIPIcs.ICDT.2021.19","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

A regular path query (RPQ) is a regular expression q that returns all node pairs (u, v) from a graph database that are connected by an arbitrary path labelled with a word from L(q). The obvious algorithmic approach to RPQ-evaluation (called PG-approach), i.e., constructing the product graph between an NFA for q and the graph database, is appealing due to its simplicity and also leads to efficient algorithms. However, it is unclear whether the PG-approach is optimal. We address this question by thoroughly investigating which upper complexity bounds can be achieved by the PG-approach, and we complement these with conditional lower bounds (in the sense of the fine-grained complexity framework). A special focus is put on enumeration and delay bounds, as well as the data complexity perspective. A main insight is that we can achieve optimal (or near optimal) algorithms with the PG-approach, but the delay for enumeration is rather high (linear in the database). We explore three successful approaches towards enumeration with sub-linear delay: super-linear preprocessing, approximations of the solution sets, and restricted classes of RPQs.

查看原文本刊更多论文

正则路径查询的细粒度复杂度

正则路径查询(RPQ)是一个正则表达式q，它返回图数据库中所有节点对(u, v)，这些节点对由一个标记为L(q)中的单词的任意路径连接。rpq评估的明显算法方法(称为pg方法)，即在q的NFA和图形数据库之间构建乘积图，由于其简单性和高效算法而具有吸引力。然而，目前尚不清楚pg方法是否最佳。我们通过彻底研究pg方法可以实现哪些上限复杂性界限来解决这个问题，并且我们用条件下界(在细粒度复杂性框架的意义上)来补充这些上限。特别关注枚举和延迟边界，以及数据复杂性透视图。主要的见解是，我们可以使用pg方法实现最优(或接近最优)算法，但是枚举的延迟相当高(在数据库中是线性的)。我们探索了三种成功的亚线性延迟枚举方法:超线性预处理、解集逼近和rpq的限制类。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Database theory-- ICDT : International Conference ... proceedings. International Conference on Database Theory

自引率

0.00%

发文量