{"title":"SynopsisDB: Distributed Synopsis-based Data Processing System","authors":"Xin Zhang","doi":"10.1145/3555041.3589394","DOIUrl":null,"url":null,"abstract":"As the data volume continues to expand at an unprecedented rate, data scientists face the challenge of effectively processing and exploring vast amounts of data. To carry out tasks such as analyzing wildfire clusters, querying diverse datasets, and visualizing results with tools like IncVisage, Pangloss, Marviq, and GeoSparkViz, data scientists require data processing systems that are efficient, flexible, and capable of handling different types of queries across various data sources. Two critical features that these systems should possess are the ability to process data efficiently and handle a wide range of queries for diverse data types.","PeriodicalId":161812,"journal":{"name":"Companion of the 2023 International Conference on Management of Data","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Companion of the 2023 International Conference on Management of Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3555041.3589394","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
As the data volume continues to expand at an unprecedented rate, data scientists face the challenge of effectively processing and exploring vast amounts of data. To carry out tasks such as analyzing wildfire clusters, querying diverse datasets, and visualizing results with tools like IncVisage, Pangloss, Marviq, and GeoSparkViz, data scientists require data processing systems that are efficient, flexible, and capable of handling different types of queries across various data sources. Two critical features that these systems should possess are the ability to process data efficiently and handle a wide range of queries for diverse data types.