Sachin Basil John, P. Lindner, Zhekai Jiang, Christoph E. Koch
{"title":"使用数独数据立方体引擎聚合和探索高维数据","authors":"Sachin Basil John, P. Lindner, Zhekai Jiang, Christoph E. Koch","doi":"10.1145/3555041.3589729","DOIUrl":null,"url":null,"abstract":"We present Sudokube, a novel system that supports interactive speed querying on high-dimensional data using partially materialized data cubes. Given a storage budget, it judiciously chooses what projections to precompute and materialize during cube construction time. Then, at query time, it uses whatever information is available from the materialized projections and extrapolates missing information to approximate query results. Thus, Sudokube avoids costly projections at query time while also avoiding the astronomical compute and storage requirements needed for fully materialized high-dimensional data cubes. In this paper, we show the capabilities of the Sudokube system and how it approximates query results using different techniques and materialization strategies.","PeriodicalId":161812,"journal":{"name":"Companion of the 2023 International Conference on Management of Data","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Aggregation and Exploration of High-Dimensional Data Using the Sudokube Data Cube Engine\",\"authors\":\"Sachin Basil John, P. Lindner, Zhekai Jiang, Christoph E. Koch\",\"doi\":\"10.1145/3555041.3589729\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present Sudokube, a novel system that supports interactive speed querying on high-dimensional data using partially materialized data cubes. Given a storage budget, it judiciously chooses what projections to precompute and materialize during cube construction time. Then, at query time, it uses whatever information is available from the materialized projections and extrapolates missing information to approximate query results. Thus, Sudokube avoids costly projections at query time while also avoiding the astronomical compute and storage requirements needed for fully materialized high-dimensional data cubes. In this paper, we show the capabilities of the Sudokube system and how it approximates query results using different techniques and materialization strategies.\",\"PeriodicalId\":161812,\"journal\":{\"name\":\"Companion of the 2023 International Conference on Management of Data\",\"volume\":\"84 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Companion of the 2023 International Conference on Management of Data\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3555041.3589729\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Companion of the 2023 International Conference on Management of Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3555041.3589729","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Aggregation and Exploration of High-Dimensional Data Using the Sudokube Data Cube Engine
We present Sudokube, a novel system that supports interactive speed querying on high-dimensional data using partially materialized data cubes. Given a storage budget, it judiciously chooses what projections to precompute and materialize during cube construction time. Then, at query time, it uses whatever information is available from the materialized projections and extrapolates missing information to approximate query results. Thus, Sudokube avoids costly projections at query time while also avoiding the astronomical compute and storage requirements needed for fully materialized high-dimensional data cubes. In this paper, we show the capabilities of the Sudokube system and how it approximates query results using different techniques and materialization strategies.