Proceedings of the ACM on Management of Data最新文献

Bag Semantics Conjunctive Query Containment. Four Small Steps Towards Undecidability. 包语义连接查询包含。迈向不可判定性的四小步

Proceedings of the ACM on Management of Data Pub Date : 2024-05-10 DOI: 10.1145/3651604

Jerzy Marcinkowski, Mateusz Orda

引用次数: 0

Containment of Graph Queries Modulo Schema 图查询的包含模式

Proceedings of the ACM on Management of Data Pub Date : 2024-05-10 DOI: 10.1145/3651140

Víctor Gutiérrez-Basulto, Albert Gutowski, Yazmín Ibáñez-García, Filip Murlak

{"title":"Containment of Graph Queries Modulo Schema","authors":"Víctor Gutiérrez-Basulto, Albert Gutowski, Yazmín Ibáñez-García, Filip Murlak","doi":"10.1145/3651140","DOIUrl":"https://doi.org/10.1145/3651140","url":null,"abstract":"With multiple graph database systems on the market and a new Graph Query Language standard on the horizon, it is time to revisit some classic static analysis problems. Query containment, arguably the workhorse of static analysis, has already received a lot of attention in the context of graph databases, but not so in the presence of schemas. We aim to change this. Because there is no universal agreement yet on what graph schemas should be, we rely on an abstract formalism borrowed from the knowledge representation community: we assume that schemas are expressed in a description logic (DL). We identify a suitable DL that capture both basic constraints on the labels of incident nodes and edges, and more refined schema features such as participation, cardinality, and unary key constraints. Basing upon, and extending, the rich body of work on DLs, we solve the containment modulo schema problem for unions of conjunctive regular path queries (UCRPQs) and schemas whose descriptions do not mix inverses and counting. For two-way UCRPQs (UC2RPQs) we solve the problem under additional assumptions that tend to hold in practice: we restrict the use of concatenation in queries and participation constraints in schemas.","PeriodicalId":498157,"journal":{"name":"Proceedings of the ACM on Management of Data","volume":" 35","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140990636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On Density-based Local Community Search 基于密度的本地社区搜索

Proceedings of the ACM on Management of Data Pub Date : 2024-05-10 DOI: 10.1145/3651589

Yizhou Dai, Miao Qiao, Rong-Hua Li

引用次数: 0

PACMMOD Volume 2 Issue 2: Editorial PACMMOD 第 2 卷第 2 期：社论

Proceedings of the ACM on Management of Data Pub Date : 2024-05-10 DOI: 10.1145/3651136

F. Geerts, Wim Martens, Matthias Niewerth

{"title":"PACMMOD Volume 2 Issue 2: Editorial","authors":"F. Geerts, Wim Martens, Matthias Niewerth","doi":"10.1145/3651136","DOIUrl":"https://doi.org/10.1145/3651136","url":null,"abstract":"We are excited to announce the first issue dedicated to the PODS research track of the Proceedings of the ACM on Management of Data, or PACMMOD, journal. In its current form, this new journal hosts a SIGMOD and a PODS research track. The PODS research track aims to provide a solid scientific basis for methods, techniques, and solutions for the data management challenges that continually arise in our data-driven society. Articles for the PODS track of PACMMOD present principled contributions to modeling, application, system building, and both theoretical and experimental validation in the context of data management. Such articles might be based, among others, on establishing theoretical results, developing new concepts and frameworks that deserve further exploration, providing experimental work that sheds light on the scientific foundations of the discipline, or a rigorous analysis of both widely used and recently developed industry artifacts. At a time when computer science is increasingly data centric, it is essential to promote an active exchange of tools and techniques between principles of database systems and other communities focused on data management. The PODS track thus pays special attention to those papers that help in the urgent process of integrating data management techniques within broader computer science. Articles published in this track will be invited for presentation to the ACM Symposium on Principles of Database Systems (PODS), which is held jointly with SIGMOD each year.","PeriodicalId":498157,"journal":{"name":"Proceedings of the ACM on Management of Data","volume":" 29","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140992771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fast Matrix Multiplication for Query Processing 用于查询处理的快速矩阵乘法

Proceedings of the ACM on Management of Data Pub Date : 2024-05-10 DOI: 10.1145/3651599

Xiao Hu

引用次数: 0

Combined Approximations for Uniform Operational Consistent Query Answering 统一运算一致性查询回答的组合近似法

Proceedings of the ACM on Management of Data Pub Date : 2024-05-10 DOI: 10.1145/3651600

M. Calautti, Ester Livshits, Andreas Pieris, Markus Schneider

{"title":"Combined Approximations for Uniform Operational Consistent Query Answering","authors":"M. Calautti, Ester Livshits, Andreas Pieris, Markus Schneider","doi":"10.1145/3651600","DOIUrl":"https://doi.org/10.1145/3651600","url":null,"abstract":"Operational consistent query answering (CQA) is a recent framework for CQA based on revised definitions of repairs, which are built by applying a sequence of operations (e.g., fact deletions) starting from an inconsistent database until we reach a database that is consistent w.r.t. the given set of constraints. It has been recently shown that there is an efficient approximation for computing the percentage of repairs that entail a given query when we focus on primary keys, conjunctive queries, and assuming the query is fixed (i.e., in data complexity). However, it has been left open whether such an approximation exists when the query is part of the input (i.e., in combined complexity). We show that this is the case when we focus on self-join-free conjunctive queries of bounded generelized hypertreewidth. We also show that it is unlikely that efficient approximation schemes exist once we give up one of the adopted syntactic restrictions, i.e., self-join-freeness or bounding the generelized hypertreewidth. Towards the desired approximation, we introduce a counting complexity class, called SpanTL, show that each problem in it admits an efficient approximation scheme by using a recent approximability result about tree automata, and then place the problem of interest in SpanTL.","PeriodicalId":498157,"journal":{"name":"Proceedings of the ACM on Management of Data","volume":" 8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140993686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Topology-aware Parallel Joins 拓扑感知并行连接

Proceedings of the ACM on Management of Data Pub Date : 2024-05-10 DOI: 10.1145/3651598

Xiao Hu, Paraschos Koutris

引用次数: 0

Query Optimization by Quantifier Elimination 通过消除量词优化查询

Proceedings of the ACM on Management of Data Pub Date : 2024-05-10 DOI: 10.1145/3651607

Christoph Koch, Peter Lindner

引用次数: 0

Tight Lower Bounds for Directed Cut Sparsification and Distributed Min-Cut 有向切分稀疏化和分布式最小切分的严格下界

Proceedings of the ACM on Management of Data Pub Date : 2024-05-10 DOI: 10.1145/3651148

Yu Cheng, Max Li, Honghao Lin, Zi-Yi Tai, David P. Woodruff, Jason Zhang

{"title":"Tight Lower Bounds for Directed Cut Sparsification and Distributed Min-Cut","authors":"Yu Cheng, Max Li, Honghao Lin, Zi-Yi Tai, David P. Woodruff, Jason Zhang","doi":"10.1145/3651148","DOIUrl":"https://doi.org/10.1145/3651148","url":null,"abstract":"In this paper, we consider two fundamental cut approximation problems on large graphs. We prove new lower bounds for both problems that are optimal up to logarithmic factors.\u0000 \u0000 The first problem is approximating cuts in balanced directed graphs. In this problem, we want to build a data structure that can provide (1 ± ε)-approximation of cut values on a graph with n vertices. For arbitrary directed graphs, such a data structure requires Ω(n\u0000 2\u0000 ) bits even for constant ε. To circumvent this, recent works study β-balanced graphs, meaning that for every directed cut, the total weight of edges in one direction is at most β times the total weight in the other direction. We consider the for-each model, where the goal is to approximate each cut with constant probability, and the for-all model, where all cuts must be preserved simultaneously. We improve the previous Ømega(n √β/ε) lower bound in the for-each model to ~Ω (n √β /ε) and we improve the previous Ω(n β/ε) lower bound in the for-all model to Ω(n β/ε\u0000 2\u0000 ). This resolves the main open questions of (Cen et al., ICALP, 2021).\u0000 \u0000 \u0000 The second problem is approximating the global minimum cut in a local query model, where we can only access the graph via degree, edge, and adjacency queries. We prove an ΩL(min m, m/ε\u0000 2\u0000 k R) lower bound for this problem, which improves the previous ΩL(m/k R) lower bound, where m is the number of edges, k is the minimum cut size, and we seek a (1+ε)-approximation. In addition, we show that existing upper bounds with minor modifications match our lower bound up to logarithmic factors.\u0000","PeriodicalId":498157,"journal":{"name":"Proceedings of the ACM on Management of Data","volume":" 20","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140990526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Streaming Algorithms with Few State Changes 状态变化少的流算法

Proceedings of the ACM on Management of Data Pub Date : 2024-05-10 DOI: 10.1145/3651145

Rajesh Jayaram, David P. Woodruff, Samson Zhou

{"title":"Streaming Algorithms with Few State Changes","authors":"Rajesh Jayaram, David P. Woodruff, Samson Zhou","doi":"10.1145/3651145","DOIUrl":"https://doi.org/10.1145/3651145","url":null,"abstract":"In this paper, we study streaming algorithms that minimize the number of changes made to their internal state (i.e., memory contents). While the design of streaming algorithms typically focuses on minimizing space and update time, these metrics fail to capture the asymmetric costs, inherent in modern hardware and database systems, of reading versus writing to memory. In fact, most streaming algorithms write to their memory on every update, which is undesirable when writing is significantly more expensive than reading. This raises the question of whether streaming algorithms with small space and number of memory writes are possible.\u0000 \u0000 We first demonstrate that, for the fundamental F\u0000 p\u0000 moment estimation problem with p ≥ 1, any streaming algorithm that achieves a constant factor approximation must make Ω(n\u0000 1-1/p\u0000 ) internal state changes, regardless of how much space it uses. Perhaps surprisingly, we show that this lower bound can be matched by an algorithm which also has near-optimal space complexity. Specifically, we give a (1+ε)-approximation algorithm for F\u0000 p\u0000 moment estimation that use a near-optimal ~O\u0000 ε\u0000 (n\u0000 1-1/p\u0000 ) number of state changes, while simultaneously achieving near-optimal space, i.e., for p∈[1,2), our algorithm uses poly(log n,1/ε) bits of space for, while for p>2, the algorithm uses ~O\u0000 ε\u0000 (n\u0000 1-1/p\u0000 ) space. We similarly design streaming algorithms that are simultaneously near-optimal in both space complexity and the number of state changes for the heavy-hitters problem, sparse support recovery, and entropy estimation. Our results demonstrate that an optimal number of state changes can be achieved without sacrificing space complexity.\u0000","PeriodicalId":498157,"journal":{"name":"Proceedings of the ACM on Management of Data","volume":" 83","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140991085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0