Vldb JournalPub Date : 2023-05-17DOI: 10.1007/s00778-023-00793-1
Juliane Verwiebe, P. M. Grulich, J. Traub, Volker Mark
{"title":"Correction to: Survey of window types for aggregation in stream processing systems","authors":"Juliane Verwiebe, P. M. Grulich, J. Traub, Volker Mark","doi":"10.1007/s00778-023-00793-1","DOIUrl":"https://doi.org/10.1007/s00778-023-00793-1","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46946111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Vldb JournalPub Date : 2023-04-03DOI: 10.1007/s00778-023-00792-2
Felix Naumann, Xin Dong
{"title":"Editorial: Special Issue for Selected Papers of VLDB 2021","authors":"Felix Naumann, Xin Dong","doi":"10.1007/s00778-023-00792-2","DOIUrl":"https://doi.org/10.1007/s00778-023-00792-2","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49063738","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Vldb JournalPub Date : 2023-03-20DOI: 10.1007/s00778-023-00785-1
Z. Wang, Kai Zeng, Botong Huang, Wei Chen, Xiaozong Cui, Bo Wang, J. Liu, Liya Fan, Dachuan Qu, Zhenyu Hou, Tao Guan, Chen Li, Jingren Zhou
{"title":"Tempura: a general cost-based optimizer framework for incremental data processing (Journal Version)","authors":"Z. Wang, Kai Zeng, Botong Huang, Wei Chen, Xiaozong Cui, Bo Wang, J. Liu, Liya Fan, Dachuan Qu, Zhenyu Hou, Tao Guan, Chen Li, Jingren Zhou","doi":"10.1007/s00778-023-00785-1","DOIUrl":"https://doi.org/10.1007/s00778-023-00785-1","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":""},"PeriodicalIF":4.2,"publicationDate":"2023-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45285784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"(p,q)-biclique counting and enumeration for large sparse bipartite graphs.","authors":"Jianye Yang, Yun Peng, Dian Ouyang, Wenjie Zhang, Xuemin Lin, Xiang Zhao","doi":"10.1007/s00778-023-00786-0","DOIUrl":"10.1007/s00778-023-00786-0","url":null,"abstract":"<p><p>In this paper, we study the problem of (<math><mi>p</mi></math>, <math><mi>q</mi></math>)-biclique counting and enumeration for large sparse bipartite graphs. Given a bipartite graph <math><mrow><mi>G</mi><mo>=</mo><mo>(</mo><mi>U</mi><mo>,</mo><mi>V</mi><mo>,</mo><mi>E</mi><mo>)</mo></mrow></math> and two integer parameters <i>p</i> and <i>q</i>, we aim to efficiently count and enumerate all (<math><mi>p</mi></math>, <math><mi>q</mi></math>)-bicliques in <i>G</i>, where a (<math><mi>p</mi></math>, <math><mi>q</mi></math>)-biclique <i>B</i>(<i>L</i>, <i>R</i>) is a complete subgraph of <i>G</i> with <math><mrow><mi>L</mi><mo>⊆</mo><mi>U</mi></mrow></math>, <math><mrow><mi>R</mi><mo>⊆</mo><mi>V</mi></mrow></math>, <math><mrow><mo>|</mo><mi>L</mi><mo>|</mo><mo>=</mo><mi>p</mi></mrow></math>, and <math><mrow><mo>|</mo><mi>R</mi><mo>|</mo><mo>=</mo><mi>q</mi></mrow></math>. The problem of (<math><mi>p</mi></math>, <math><mi>q</mi></math>)-biclique counting and enumeration has many applications, such as graph neural network information aggregation, densest subgraph detection, and cohesive subgroup analysis. Despite the wide range of applications, to the best of our knowledge, we note that there is no efficient and scalable solution to this problem in the literature . This problem is computationally challenging, due to the worst-case exponential number of (<math><mi>p</mi></math>, <math><mi>q</mi></math>)-bicliques. In this paper, we propose a competitive branch-and-bound baseline method, namely BCList, which explores the search space in a depth-first manner, together with a variety of pruning techniques. Although BCList offers a useful computation framework to our problem, its worst-case time complexity is exponential to <math><mrow><mi>p</mi><mo>+</mo><mi>q</mi></mrow></math>. To alleviate this, we propose an advanced approach, called BCList++. Particularly, BCList++ applies a layer-based exploring strategy to enumerate (<math><mi>p</mi></math>, <math><mi>q</mi></math>)-bicliques by anchoring the search on either <i>U</i> or <i>V</i> only, which has a worst-case time complexity exponential to either <i>p</i> or <i>q</i> only. Consequently, a vital task is to choose a layer with the least computation cost. To this end, we develop a cost model, which is built upon an unbiased estimator for the density of 2-hop graph induced by <i>U</i> or <i>V</i>. To improve computation efficiency, BCList++ exploits pre-allocated arrays and vertex labeling techniques such that the frequent subgraph creating operations can be substituted by array element switching operations. We conduct extensive experiments on 16 real-life datasets, and the experimental results demonstrate that BCList++ significantly outperforms the baseline methods by up to 3 orders of magnitude. We show via a case study that (<math><mi>p</mi></math>, <math><mi>q</mi></math>)-bicliques optimizes the efficiency of graph neural networks. In this paper, we extend our techniques to count and enumer","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":" ","pages":"1-25"},"PeriodicalIF":2.8,"publicationDate":"2023-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10008723/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10075335","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Vldb JournalPub Date : 2023-03-13DOI: 10.1007/s00778-023-00784-2
F. Schiavio, Daniele Bonetta, Walter Binder
{"title":"DynQ: a dynamic query engine with query-reuse capabilities embedded in a polyglot runtime","authors":"F. Schiavio, Daniele Bonetta, Walter Binder","doi":"10.1007/s00778-023-00784-2","DOIUrl":"https://doi.org/10.1007/s00778-023-00784-2","url":null,"abstract":"","PeriodicalId":49373,"journal":{"name":"Vldb Journal","volume":"1 1","pages":"1-25"},"PeriodicalIF":4.2,"publicationDate":"2023-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44551203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}