ACM SIGMOD RecordPub Date : 2023-01-25DOI: https://dl.acm.org/doi/10.1145/3582302.3582316
Holger Pirk
{"title":"Collaborative Data Science using Scalable Homoiconicity","authors":"Holger Pirk","doi":"https://dl.acm.org/doi/10.1145/3582302.3582316","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3582302.3582316","url":null,"abstract":"<p>Motivation: Data science is increasingly collaborative. On the one hand, results need to be distributed, e.g., as interactive visualizations. On the other, collaboration in the data development process improves quality and timeliness. This can take many forms: partitioning a problem and working on aspects in parallel, exploring different solutions or reviewing someone else's work.</p>","PeriodicalId":501169,"journal":{"name":"ACM SIGMOD Record","volume":"2 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
ACM SIGMOD RecordPub Date : 2023-01-25DOI: https://dl.acm.org/doi/10.1145/3582302.3582310
Tilmann Rabl
{"title":"Reminiscences on Influential Papers","authors":"Tilmann Rabl","doi":"https://dl.acm.org/doi/10.1145/3582302.3582310","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3582302.3582310","url":null,"abstract":"<p>When I started my PhD, I wanted to do something related to systems but I wasn't sure exactly what. I didn't consider data management systems initially, because I was unaware of the richness of the systems work that data management systems were build on. I thought the field was mainly about SQL. Luckily, that view changed quickly.</p>","PeriodicalId":501169,"journal":{"name":"ACM SIGMOD Record","volume":"1 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"PDQ 2.0: Flexible Infrastructure for Integrating Reasoning and Query Planning: ACM SIGMOD Record: Vol 51, No 4","authors":"Michael Benedikt, Fergus Cooper, Stefano Germano, Gabor Gyorkei, Efthymia Tsamoura, Brandon Moore, Camilo Ortiz","doi":"https://dl.acm.org/doi/10.1145/3582302.3582308","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3582302.3582308","url":null,"abstract":"<p>Reasoning-based query planning has been explored in many contexts, including relational data integration, the SemanticWeb, and query reformulation. But infrastructure to build reasoning-based optimization in the relational context has been slow to develop. We overview PDQ 2.0, a platform supporting a number of reasoningenhanced querying tasks. We focus on a major goal of PDQ 2.0: obtaining a more modular and flexible architecture for reasoning-based query optimization.</p>","PeriodicalId":501169,"journal":{"name":"ACM SIGMOD Record","volume":"1 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
ACM SIGMOD RecordPub Date : 2023-01-25DOI: https://dl.acm.org/doi/10.1145/3582302.3582306
Marius Schlegel, Kai-Uwe Sattler
{"title":"Management of Machine Learning Lifecycle Artifacts: A Survey: ACM SIGMOD Record: Vol 51, No 4","authors":"Marius Schlegel, Kai-Uwe Sattler","doi":"https://dl.acm.org/doi/10.1145/3582302.3582306","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3582302.3582306","url":null,"abstract":"<p>The explorative and iterative nature of developing and operating ML applications leads to a variety of artifacts, such as datasets, features, models, hyperparameters, metrics, software, configurations, and logs. In order to enable comparability, reproducibility, and traceability of these artifacts across the ML lifecycle steps and iterations, systems and tools have been developed to support their collection, storage, and management. It is often not obvious what precise functional scope such systems offer so that the comparison and the estimation of synergy effects between candidates are quite challenging. In this paper, we aim to give an overview of systems and platforms which support the management of ML lifecycle artifacts. Based on a systematic literature review, we derive assessment criteria and apply them to a representative selection of more than 60 systems and platforms.</p>","PeriodicalId":501169,"journal":{"name":"ACM SIGMOD Record","volume":"1 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
ACM SIGMOD RecordPub Date : 2023-01-25DOI: https://dl.acm.org/doi/10.1145/3582302.3582322
Michael Perscheid, Hasso Plattner, Daniel Ritter, Rainer Schlosser, Ralf Teusner
{"title":"Enterprise Platform and Integration Concepts Research at HPI","authors":"Michael Perscheid, Hasso Plattner, Daniel Ritter, Rainer Schlosser, Ralf Teusner","doi":"https://dl.acm.org/doi/10.1145/3582302.3582322","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3582302.3582322","url":null,"abstract":"<p>The Hasso Plattner Institute (HPI), academically structured as the independent Faculty of Digital Engineering at the University of Potsdam, unites computer science research and teaching with the advantages of a privately financed institute and a tuition-free study program. Founder and namesake of the institute is the SAP co-founder Hasso Plattner, who also heads the Enterprise Platform and Integration Concepts (EPIC) research center which focuses on the technical aspects of business software with a vision to provide the fastest way to get insights out of enterprise data. Founded in 2006, the EPIC combines three research groups comprising autonomous data management, enterprise software engineering, and data-driven decision support.</p>","PeriodicalId":501169,"journal":{"name":"ACM SIGMOD Record","volume":"2 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
ACM SIGMOD RecordPub Date : 2023-01-25DOI: https://dl.acm.org/doi/10.1145/3582302.3582324
Sourav S. Bhowmick
{"title":"How Connected Are Our Conference Review Boards?","authors":"Sourav S. Bhowmick","doi":"https://dl.acm.org/doi/10.1145/3582302.3582324","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3582302.3582324","url":null,"abstract":"<p>Dense co-authorship network formed by the review board members of a conference may adversely impact the quality and integrity of the review process. In this report, we shed light on the topological characteristics of such networks for three major data management conference venues. Our results show all these venues give rise to dense networks with a large giant component. We advocate to rethink the traditional way review boards are formed to mitigate the emergence of dense networks.</p>","PeriodicalId":501169,"journal":{"name":"ACM SIGMOD Record","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
ACM SIGMOD RecordPub Date : 2023-01-25DOI: https://dl.acm.org/doi/10.1145/3582302.3582312
Laks V.S. Lakshmanan
{"title":"Mid-Career Researcher, huh?: What just Changed?: ACM SIGMOD Record: Vol 51, No 4","authors":"Laks V.S. Lakshmanan","doi":"https://dl.acm.org/doi/10.1145/3582302.3582312","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3582302.3582312","url":null,"abstract":"<p>You just got promoted to Associate Professor. Like most things in life, whether joys or sorrows, the joy of this accomplishment will not last forever. However, that doesn't mean that you should not look back and reflect on years of hard work and tenacity that you have put in which have earned you this promotion, so first of all, congratulations! Take a moment to savor this accomplishment. On the other hand, it would be a mistake to not ask the question, what just changed about me. Let's see. You now have tenure and you have been promoted to a senior rank. In one sense, that translates to less stress, but in another, you do have to wonder whether it necessarily does mean less stress. On the flip side, you should also take advantage of the opportunity to ask, what are some new freedoms I have just earned. The stress component is driven by partly knowing, but also partly being unsure of, the expectations from a newly minted Associate Professor. The freedom component stems from knowing that you are now tenured, which hopefully means that you can embark on more daring, high risk projects, even if you don't feel like you know quite how to negotiate the trade-off between risk and impact.</p>","PeriodicalId":501169,"journal":{"name":"ACM SIGMOD Record","volume":"1 6","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
ACM SIGMOD RecordPub Date : 2023-01-25DOI: https://dl.acm.org/doi/10.1145/3582302.3582314
Efthimia Aivaloglou, George Fletcher, Michael Liut, Daphne Miedema
{"title":"Report on the First International Workshop on Data Systems Education (DataEd '22)","authors":"Efthimia Aivaloglou, George Fletcher, Michael Liut, Daphne Miedema","doi":"https://dl.acm.org/doi/10.1145/3582302.3582314","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3582302.3582314","url":null,"abstract":"<p>This report summarizes the outcomes of the first international workshop on Data Systems Education: Bridging Education Practice with Education Research (DataEd '22). The workshop was held in conjunction with the SIGMOD '22 conference in Philadelphia, USA on June 17, 2022. The aim of the workshop was to provide a dedicated venue for presenting and and discussing data management systems education experiences and research by bringing together the database and the computing education research communities to share findings, to crosspollinate perspectives and methods, and to shed light on opportunities for mutual progress in data systems education. The program featured two keynote talks, ten research paper presentations, a discussion session, and an industry panel discussion. In this report, we present the workshop's main results, observations, and emerging research directions.</p>","PeriodicalId":501169,"journal":{"name":"ACM SIGMOD Record","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
ACM SIGMOD RecordPub Date : 2023-01-25DOI: https://dl.acm.org/doi/10.1145/3582302.3582320
Yuanyuan Tian
{"title":"The World of Graph Databases from An Industry Perspective","authors":"Yuanyuan Tian","doi":"https://dl.acm.org/doi/10.1145/3582302.3582320","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3582302.3582320","url":null,"abstract":"<p>Rapidly growing social networks and other graph data have created a high demand for graph technologies in the market. A plethora of graph databases, systems, and solutions have emerged, as a result. On the other hand, graph has long been a well studied area in the database research community. Despite the numerous surveys on various graph research topics, there is a lack of survey on graph technologies from an industry perspective. The purpose of this paper is to provide the research community with an industrial perspective on the graph database landscape, so that graph researcher can better understand the industry trend and the challenges that the industry is facing, and work on solutions to help address these problems.</p>","PeriodicalId":501169,"journal":{"name":"ACM SIGMOD Record","volume":"2 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
ACM SIGMOD RecordPub Date : 2022-11-21DOI: https://dl.acm.org/doi/10.1145/3572751.3572765
Oana Balmau
{"title":"Characterizing I/O in Machine Learning with MLPerf Storage","authors":"Oana Balmau","doi":"https://dl.acm.org/doi/10.1145/3572751.3572765","DOIUrl":"https://doi.org/https://dl.acm.org/doi/10.1145/3572751.3572765","url":null,"abstract":"<p>Data is the driving force behind machine learning (ML) algorithms. The way we ingest, store, and serve data can impact the performance of end-to-end training and inference significantly [11]. However, efficient storage and pre-processing of training data has received far less focus in ML compared to efforts in building specialized software frameworks and hardware accelerators. The amount of data that we produce is growing exponentially, making it expensive and difficult to keep entire training datasets in main memory. Increasingly, ML algorithms will need to access data from persistent storage in an efficient way.</p>","PeriodicalId":501169,"journal":{"name":"ACM SIGMOD Record","volume":"3 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138510314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}