{"title":"最小压缩立方体:数据组织、快速计算和增量更新","authors":"Zhuo Wang, Ye Xu","doi":"10.1109/ICICSE.2008.35","DOIUrl":null,"url":null,"abstract":"The condensed cube has been proposed to reduce the huge size of data cubes in OLAP system. The intuition of condensed cube is to compress semantically redundant tuples into their representative base single tuples (BSTs). However, previous studies showed that a minimal condensed cube is expensive to compute, and thus mainly concentrated on alternative computation methods for non-minimal condensed cube, which does not guarantee to find and compress all BSTs. In this paper, we focus on the minimal condensed cube and address several practical issues, including physical organization, fast computation, and incremental update. Experiments on both synthetic and real-world datasets show that our proposed algorithms outperform previous methods by a large margin.","PeriodicalId":333889,"journal":{"name":"2008 International Conference on Internet Computing in Science and Engineering","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Minimal Condensed Cube: Data Organization, Fast Computation, and Incremental Update\",\"authors\":\"Zhuo Wang, Ye Xu\",\"doi\":\"10.1109/ICICSE.2008.35\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The condensed cube has been proposed to reduce the huge size of data cubes in OLAP system. The intuition of condensed cube is to compress semantically redundant tuples into their representative base single tuples (BSTs). However, previous studies showed that a minimal condensed cube is expensive to compute, and thus mainly concentrated on alternative computation methods for non-minimal condensed cube, which does not guarantee to find and compress all BSTs. In this paper, we focus on the minimal condensed cube and address several practical issues, including physical organization, fast computation, and incremental update. Experiments on both synthetic and real-world datasets show that our proposed algorithms outperform previous methods by a large margin.\",\"PeriodicalId\":333889,\"journal\":{\"name\":\"2008 International Conference on Internet Computing in Science and Engineering\",\"volume\":\"49 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-01-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 International Conference on Internet Computing in Science and Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICICSE.2008.35\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 International Conference on Internet Computing in Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICSE.2008.35","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Minimal Condensed Cube: Data Organization, Fast Computation, and Incremental Update
The condensed cube has been proposed to reduce the huge size of data cubes in OLAP system. The intuition of condensed cube is to compress semantically redundant tuples into their representative base single tuples (BSTs). However, previous studies showed that a minimal condensed cube is expensive to compute, and thus mainly concentrated on alternative computation methods for non-minimal condensed cube, which does not guarantee to find and compress all BSTs. In this paper, we focus on the minimal condensed cube and address several practical issues, including physical organization, fast computation, and incremental update. Experiments on both synthetic and real-world datasets show that our proposed algorithms outperform previous methods by a large margin.