Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)最新文献
Amine Mhedhbi, Matteo Lissandrini, Laurens Kuiper, Jack Waudby, Gábor Szárnyas
{"title":"LSQB","authors":"Amine Mhedhbi, Matteo Lissandrini, Laurens Kuiper, Jack Waudby, Gábor Szárnyas","doi":"10.1145/3461837.3464516","DOIUrl":"https://doi.org/10.1145/3461837.3464516","url":null,"abstract":"We introduce LSQB, a new large-scale subgraph query benchmark. LSQB tests the performance of database management systems on an important class of subgraph queries overlooked by existing benchmarks. Matching a labelled structural graph pattern, referred to as subgraph matching, is the focus of LSQB. In relational terms, the benchmark tests DBMSs' join performance as a choke-point since subgraph matching is equivalent to multi-way joins between base Vertex and base Edge tables on ID attributes. The benchmark focuses on read-heavy workloads by relying on global queries which have been ignored by prior benchmarks. Global queries, also referred to as unseeded queries, are a type of queries that are only constrained by labels on the query vertices and edges. LSQB contains a total of nine queries and leverages the LDBC social network data generator for scalability. The benchmark gained both academic and industrial interest and is used internally by 5+ different vendors.","PeriodicalId":102703,"journal":{"name":"Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124007850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Networked data and COVID-19","authors":"S. Scarpino","doi":"10.1145/3461837.3464688","DOIUrl":"https://doi.org/10.1145/3461837.3464688","url":null,"abstract":"The COVID-19 pandemic has upended our societies and re-shaped the way we go about our day-to-day lives---from how we work and interact to the way we buy groceries and attend school. Leveraging global data sets that represent billions of people, I will present a series of studies exploring how our behavior [2, 10], mobility patterns [6, 7], and social networks [3, 9] have altered and been altered by COVID-19 and the non-pharmaceutical interventions implemented to control its spread. Next, I will examine how we can better incorporate stochasticity and social network heterogeneity [4] and link directionality [1] into forecasting pandemic risk. With these results, I will demonstrate how the complexity of COVID-19 creates epistemological challenges associated with model identifiability [5, 8, 11]. Finally, I will discuss work by Global.health, a new collaborative network of researchers, technologists, and public health experts that has developed and built an open access platform for collecting, storing, securing, and sharing anonymized, individual-level COVID-19 data. Currently, our data includes almost 30M individual-level cases from 160 countries, which are tagged with up to 40 fields of meta-data. Writing for The New York Times Magazine, Steven Johnson said the data captured by Global.health, \"may well be the single most accurate portrait of the virus's spread through the human population in existence.\"","PeriodicalId":102703,"journal":{"name":"Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131891389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Context-free path querying with all-path semantics by matrix multiplication","authors":"Rustam Azimov, I. Epelbaum, S. Grigorev","doi":"10.1145/3461837.3464513","DOIUrl":"https://doi.org/10.1145/3461837.3464513","url":null,"abstract":"Context-Free Path Querying (CFPQ) allows one to use context-free grammars as path constraints in navigational graph queries. Many algorithms for CFPQ were proposed but recently showed that the state-of-the-art CFPQ algorithms are still not performant enough for practical use. One promising way to achieve high-performance solutions for graph querying problems is to reduce them to linear algebra operations. Recently, there are two CFPQ solutions formulated in terms of linear algebra: the one based on the Boolean matrix multiplication operation proposed by Azimov et al. (2018) and the Kronecker product-based CFPQ algorithm proposed by Orachev et al. (2020). However, the algorithm based on matrix multiplication still does not support the most expressive all-path query semantics and cannot be truly compared with Kronecker product-based CFPQ algorithm. In this work, we introduce a new matrix-based CFPQ algorithm with all-path query semantics that allows us to extract all found paths for each pair of vertices. Also, we implement our algorithm by using appropriate high-performance libraries for linear algebra. Finally, we provide a comparison of the most performant linear algebra-based CFPQ algorithms for different query semantics.","PeriodicalId":102703,"journal":{"name":"Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134600765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"R2GSync and edge views: practical RDBMS to GDBMS synchronization","authors":"Nafisa Anzum, S. Salihoglu","doi":"10.1145/3461837.3464515","DOIUrl":"https://doi.org/10.1145/3461837.3464515","url":null,"abstract":"Graph databases that are used in enterprises are primarily extracted from a main transactional store that is often an RDBMS. This data infrastructure set up raises the challenge of keeping the extracted graph in a graph database management system (GDBMS) in sync with the source RDBMS. When the extracted graphs contain edge types that are results of join queries, this synchronization requires incrementally maintaining these join queries. In this paper, we investigate an alternative design where we can map the individual relations in these joins to virtual nodes and edges to keep the synchronization very efficient and instead support view-based querying in the GDBMS. We present a system called R2GSync, that synchronizes an RDBMS with a GDBMS and our accompanying edge view design for a GDBMS. We describe our implementation of edge views in GraphflowDB and query optimization techniques for improving the performance of queries that involve edge views.","PeriodicalId":102703,"journal":{"name":"Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133414098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Graph processing systems back to the future","authors":"A. Bonifati","doi":"10.1145/3461837.3464687","DOIUrl":"https://doi.org/10.1145/3461837.3464687","url":null,"abstract":"Graphs are data model abstractions that are becoming pervasive in several real-life applications and use cases. In these settings, users focus on entities and their relationships, further enhanced with multiple labels and properties to form the so called property graphs. Modern graph processing systems need to keep pace with the increasing fundamental requirements of these applications and to tackle unforeseen challenges. Motivated by a community vision on future graph processing systems [6], in this talk I will present the system challenges that are lying behind the current research topics on graph processing and graph analytics. Many current graph query engines support subsets of graph queries that they can efficiently evaluate, thus disregarding more expressive query fragments on top of property graphs [2]. It becomes crucial to address efficient query evaluation for complex graph queries, as well the extensibility of the underlying graph query and constraint languages [1, 3]. Moreover, the dynamic aspects [5] of evaluating queries on streaming graphs are equally important and need to be considered in ongoing and future benchmarking efforts [4]. The overarching goal of my talk is to touch upon our past and ongoing work on these topics and to pinpoint the research directions shaping the already bright future of graph processing systems.","PeriodicalId":102703,"journal":{"name":"Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128694273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Large-scale influence maximization with the influence maximization benchmarker suite","authors":"H. Geppert, Sukanya Bhowmik, K. Rothermel","doi":"10.1145/3461837.3464510","DOIUrl":"https://doi.org/10.1145/3461837.3464510","url":null,"abstract":"Maximizing the influence of a fixed size seed set in a graph was investigated intensively in the past decade. Two very relevant questions are 1) how to solve the influence maximization problem on very large graphs within short time and 2) how to compare possible findings with the current state-of-the-art in a fair manner. To solve the first problem, proxy-based influence maximization strategies emerged. However, today's graphs became too large to be solved quickly for many well-established proxy strategies, since they do not scale to such large graphs. In this paper we propose 1) a novel update scheme for iterative influence maximization strategies named Update Approximation (UA) capable of large influence spreads within a few seconds on billion-scale graphs. Further, we present 2) a generic benchmark suite (Influence Maximization Benchmarker --- IMB) to implement and evaluate influence maximization strategies, alongside with implementations for several established strategies. IMB allows for easy to use benchmarks for further research by the community.","PeriodicalId":102703,"journal":{"name":"Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123910117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Position paper: bitemporal dynamic graph analytics","authors":"Hassan Halawa, M. Ripeanu","doi":"10.1145/3461837.3464514","DOIUrl":"https://doi.org/10.1145/3461837.3464514","url":null,"abstract":"Most of today's graph analytics systems model static graphs and do not support business use cases that require the ability to: (i) query the dynamic graph data for a time-evolving system, (ii) carry out investigations on its historical evolution, and (iii) audit past business decisions made with potentially stale or incorrect data. This position paper presents our vision for bi-temporal dynamic graph analytics, and sketches a design for a system that efficiently supports these requirements.","PeriodicalId":102703,"journal":{"name":"Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121171668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Florentin Dörre, Alexander Krause, Dirk Habich, Martin Junghanns
{"title":"A GraphBLAS implementation in pure Java","authors":"Florentin Dörre, Alexander Krause, Dirk Habich, Martin Junghanns","doi":"10.1145/3461837.3464627","DOIUrl":"https://doi.org/10.1145/3461837.3464627","url":null,"abstract":"Analyzing connected data in forms of graphs is more relevant than ever. To allow users to write their own custom graph algorithms, graph computation models such as GraphBLAS have been developed. Unfortunately, the popular Java programming language was mostly neglected by existing GraphBLAS implementations so far. To overcome that issue, we present our implementation of essential GraphBLAS concepts in the Java programming language in this paper. For our purpose, we extended the linear algebra library Efficient Java Matrix Library (EJML). To show the benefits of our implementation, we compare us against existing graph algorithm libraries in Java using real world graphs and three graph algorithms.","PeriodicalId":102703,"journal":{"name":"Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"201 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115164907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Demystifying memory access patterns of FPGA-based graph processing accelerators","authors":"Jonas Dann, Daniel Ritter, H. Fröning","doi":"10.1145/3461837.3464512","DOIUrl":"https://doi.org/10.1145/3461837.3464512","url":null,"abstract":"Recent advances in reprogrammable hardware (e. g., FPGAs) and memory technology (e. g., DDR4, HBM) promise to solve performance problems inherent to graph processing like irregular memory access patterns on traditional hardware (e. g., CPU). While several of these graph accelerators were proposed in recent years, it remains difficult to assess their performance and compare them on common graph workloads and accelerator platforms, due to few open source implementations and excessive implementation effort. In this work, we build on a simulation environment for graph processing accelerators, to make several existing accelerator approaches comparable. This allows us to study relevant performance dimensions such as partitioning schemes and memory technology, among others. The evaluation yields insights into the strengths and weaknesses of current graph processing accelerators along these dimensions, and features a novel in-depth comparison.","PeriodicalId":102703,"journal":{"name":"Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131106079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","authors":"","doi":"10.1145/3461837","DOIUrl":"https://doi.org/10.1145/3461837","url":null,"abstract":"","PeriodicalId":102703,"journal":{"name":"Proceedings of the 4th ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133032999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}