Incremental Graph Computations: Doable and Undoable

IF 2.2 2区 计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS
Wenfei Fan, Chao Tian
{"title":"Incremental Graph Computations: Doable and Undoable","authors":"Wenfei Fan, Chao Tian","doi":"https://dl.acm.org/doi/full/10.1145/3500930","DOIUrl":null,"url":null,"abstract":"<p>The incremental problem for a class \\( {\\mathcal {Q}} \\) of graph queries aims to compute, given a query \\( Q \\in {\\mathcal {Q}} \\), graph <i>G</i>, answers <i>Q</i>(<i>G</i>) to <i>Q</i> in <i>G</i> and updates <i>ΔG</i> to <i>G</i> as input, changes <i>ΔO</i> to output <i>Q</i>(<i>G</i>) such that <i>Q</i>(<i>G</i>⊕<i>ΔG</i>) = <i>Q</i>(<i>G</i>)⊕<i>ΔO</i>. It is called <i>bounded</i> if its cost can be expressed as a polynomial function in the sizes of <i>Q</i>, <i>ΔG</i> and <i>ΔO</i>, which reduces the computations on possibly big <i>G</i> to small <i>ΔG</i> and <i>ΔO</i>. No matter how desirable, however, our first results are negative: For common graph queries such as traversal, connectivity, keyword search, pattern matching, and maximum cardinality matching, their incremental problems are unbounded. </p><p>In light of the negative results, we propose two characterizations for the effectiveness of incremental graph computation: (a) <i>localizable</i>, if its cost is decided by small neighbors of nodes in <i>ΔG</i> instead of the entire <i>G</i>; and (b) <i>bounded relative to</i> a batch graph algorithm \\( {\\mathcal {T}} \\), if the cost is determined by the sizes of <i>ΔG</i> and changes to the affected area that is necessarily checked by any algorithms that incrementalize \\( {\\mathcal {T}} \\). We show that the incremental computations above are either localizable or relatively bounded by providing corresponding incremental algorithms. That is, we can either reduce the incremental computations on big graphs to small data, or incrementalize existing batch graph algorithms by minimizing unnecessary recomputation. Using real-life and synthetic data, we experimentally verify the effectiveness of our incremental algorithms.</p>","PeriodicalId":50915,"journal":{"name":"ACM Transactions on Database Systems","volume":null,"pages":null},"PeriodicalIF":2.2000,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Database Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/https://dl.acm.org/doi/full/10.1145/3500930","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

The incremental problem for a class \( {\mathcal {Q}} \) of graph queries aims to compute, given a query \( Q \in {\mathcal {Q}} \), graph G, answers Q(G) to Q in G and updates ΔG to G as input, changes ΔO to output Q(G) such that Q(GΔG) = Q(G)⊕ΔO. It is called bounded if its cost can be expressed as a polynomial function in the sizes of Q, ΔG and ΔO, which reduces the computations on possibly big G to small ΔG and ΔO. No matter how desirable, however, our first results are negative: For common graph queries such as traversal, connectivity, keyword search, pattern matching, and maximum cardinality matching, their incremental problems are unbounded.

In light of the negative results, we propose two characterizations for the effectiveness of incremental graph computation: (a) localizable, if its cost is decided by small neighbors of nodes in ΔG instead of the entire G; and (b) bounded relative to a batch graph algorithm \( {\mathcal {T}} \), if the cost is determined by the sizes of ΔG and changes to the affected area that is necessarily checked by any algorithms that incrementalize \( {\mathcal {T}} \). We show that the incremental computations above are either localizable or relatively bounded by providing corresponding incremental algorithms. That is, we can either reduce the incremental computations on big graphs to small data, or incrementalize existing batch graph algorithms by minimizing unnecessary recomputation. Using real-life and synthetic data, we experimentally verify the effectiveness of our incremental algorithms.

增量图计算:可行和不可行的
图查询类\( {\mathcal {Q}} \)的增量问题旨在计算,给定查询\( Q \in {\mathcal {Q}} \),图G,将G中的Q(G)回答为Q,并将ΔG更新为G作为输入,将ΔO更改为输出Q(G),使得Q(G⊕ΔG) = Q(G)⊕ΔO。如果它的代价可以表示为大小为Q, ΔG和ΔO的多项式函数,则称为有界,这将可能的大G的计算减少到小ΔG和ΔO。然而,无论多么理想,我们的第一个结果都是否定的:对于常见的图查询,如遍历、连通性、关键字搜索、模式匹配和最大基数匹配,它们的增量问题是无界的。鉴于负面结果,我们提出了增量图计算有效性的两个特征:(a)可本地化,如果其成本由ΔG中节点的小邻居决定,而不是整个G;(b)相对于批处理图算法\( {\mathcal {T}} \)有界,如果成本由ΔG的大小和受影响区域的变化决定,则必须由任何对\( {\mathcal {T}} \)进行增量化的算法检查。通过提供相应的增量算法,我们证明了上述增量计算要么是可本地化的,要么是相对有界的。也就是说,我们可以将大图上的增量计算减少到小数据上,或者通过最小化不必要的重新计算来增量化现有的批处理图算法。使用真实数据和合成数据,我们通过实验验证了增量算法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACM Transactions on Database Systems
ACM Transactions on Database Systems 工程技术-计算机:软件工程
CiteScore
5.60
自引率
0.00%
发文量
15
审稿时长
>12 weeks
期刊介绍: Heavily used in both academic and corporate R&D settings, ACM Transactions on Database Systems (TODS) is a key publication for computer scientists working in data abstraction, data modeling, and designing data management systems. Topics include storage and retrieval, transaction management, distributed and federated databases, semantics of data, intelligent databases, and operations and algorithms relating to these areas. In this rapidly changing field, TODS provides insights into the thoughts of the best minds in database R&D.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信