{"title":"A storage scheme for multidimensional data alleviating dimension dependency","authors":"Teppei Shimada, T. Tsuji, K. Higuchi","doi":"10.1109/ICDIM.2008.4746713","DOIUrl":null,"url":null,"abstract":"Multidimensional arrays storing multidimensional data in MOLAP are usually very sparse. They also suffer from the problem that the time consumed in sequential access to array elements heavily depends on the dimension along which the elements are accessed. This problem of ldquodimension dependencyrdquo would be alleviated by dividing the whole array into the set of smaller hypercube shaped subarrays called ldquochunksrdquo. But the chunks are also sparse and should be compressed. However, further dimension dependency in accessing array elements would be caused, unless these compressed chunks are arranged judiciously in the page buffer. The difference among the dimension cardinalities could also cause dimension dependency; slice operation along a dimension of large cardinality tends to consume much time. We will alleviate these two kinds of dimension dependency by introducing the notion of an ldquoextended chunkrdquo. Extended chunks can adapt flexibly to the general situation where data densities in chunks are low and are not uniformly distributed. Employing extended chunks, we will propose some secondary storage schemes for a multidimensional array using a space-filling curve such as Z-curve. The evaluation result shows that the proposed storage schemes exhibit good performance while alleviating the dimension dependency.","PeriodicalId":415013,"journal":{"name":"2008 Third International Conference on Digital Information Management","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Third International Conference on Digital Information Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIM.2008.4746713","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 20
Abstract
Multidimensional arrays storing multidimensional data in MOLAP are usually very sparse. They also suffer from the problem that the time consumed in sequential access to array elements heavily depends on the dimension along which the elements are accessed. This problem of ldquodimension dependencyrdquo would be alleviated by dividing the whole array into the set of smaller hypercube shaped subarrays called ldquochunksrdquo. But the chunks are also sparse and should be compressed. However, further dimension dependency in accessing array elements would be caused, unless these compressed chunks are arranged judiciously in the page buffer. The difference among the dimension cardinalities could also cause dimension dependency; slice operation along a dimension of large cardinality tends to consume much time. We will alleviate these two kinds of dimension dependency by introducing the notion of an ldquoextended chunkrdquo. Extended chunks can adapt flexibly to the general situation where data densities in chunks are low and are not uniformly distributed. Employing extended chunks, we will propose some secondary storage schemes for a multidimensional array using a space-filling curve such as Z-curve. The evaluation result shows that the proposed storage schemes exhibit good performance while alleviating the dimension dependency.