Baili Zhang, Le Yang, Zhikai Zhou, Hongjian Jiang, Ruizhao Liu, Jianxiong Han
{"title":"物化视图选择的预处理器","authors":"Baili Zhang, Le Yang, Zhikai Zhou, Hongjian Jiang, Ruizhao Liu, Jianxiong Han","doi":"10.1109/IC3.2018.00010","DOIUrl":null,"url":null,"abstract":"The availability and performance of data warehouse is gradually degrading with variable requirements. The set of materialized views is far from being optimal, so it is necessary to implement the dynamic adjustment to meet the demand of the users. Since the current static algorithms are not suitable for this purpose due to their larger space search and higher time consumption, this paper proposes PMVS (Preprocessor of Materialized Views Selection), which is composed of three algorithms: QSDM (Query Set Dynamic Management), CVLC (Candidate View Lattice Construction) and CVF (Candidate View Filter). Of the three algorithms, QSDM monitors the distribution of each query and determines by hypothesis testing whether the query should be added into or discarded from the query set. Based on the given query set, CVLC is in charge of producing candidate view set, which has been proven to be sufficient and necessary for selecting the best set of materialized views. As a heuristic algorithm, CVF utilizes the data sparse in multi-dimensional datasets to remove a part of candidate views, with very limited contribution to the optimal solution. The contrastive experiment indicates that PMVS can be employed by the static algorithms to reduce the amount of previous views effectively. The cost of static algorithms on space and time can be decreased to fit online demand.","PeriodicalId":236366,"journal":{"name":"2018 1st International Cognitive Cities Conference (IC3)","volume":"20 9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Preprocessor of Materialized Views Selection\",\"authors\":\"Baili Zhang, Le Yang, Zhikai Zhou, Hongjian Jiang, Ruizhao Liu, Jianxiong Han\",\"doi\":\"10.1109/IC3.2018.00010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The availability and performance of data warehouse is gradually degrading with variable requirements. The set of materialized views is far from being optimal, so it is necessary to implement the dynamic adjustment to meet the demand of the users. Since the current static algorithms are not suitable for this purpose due to their larger space search and higher time consumption, this paper proposes PMVS (Preprocessor of Materialized Views Selection), which is composed of three algorithms: QSDM (Query Set Dynamic Management), CVLC (Candidate View Lattice Construction) and CVF (Candidate View Filter). Of the three algorithms, QSDM monitors the distribution of each query and determines by hypothesis testing whether the query should be added into or discarded from the query set. Based on the given query set, CVLC is in charge of producing candidate view set, which has been proven to be sufficient and necessary for selecting the best set of materialized views. As a heuristic algorithm, CVF utilizes the data sparse in multi-dimensional datasets to remove a part of candidate views, with very limited contribution to the optimal solution. The contrastive experiment indicates that PMVS can be employed by the static algorithms to reduce the amount of previous views effectively. The cost of static algorithms on space and time can be decreased to fit online demand.\",\"PeriodicalId\":236366,\"journal\":{\"name\":\"2018 1st International Cognitive Cities Conference (IC3)\",\"volume\":\"20 9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 1st International Cognitive Cities Conference (IC3)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC3.2018.00010\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 1st International Cognitive Cities Conference (IC3)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3.2018.00010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
数据仓库的可用性和性能随着需求的变化而逐渐降低。物化视图集远非最优,因此有必要实现动态调整,以满足用户的需求。由于目前的静态算法搜索空间较大,耗时较长,不适合此目的,本文提出了PMVS (Preprocessor of Materialized Views Selection)算法,该算法由QSDM (Query Set Dynamic Management)、CVLC (Candidate View Lattice Construction)和CVF (Candidate View Filter)三种算法组成。在这三种算法中,QSDM监视每个查询的分布,并通过假设检验确定是否应该将查询添加到查询集中或从查询集中丢弃。基于给定的查询集,CVLC负责生成候选视图集,该候选视图集被证明是选择最佳物化视图集的充分和必要条件。CVF算法是一种启发式算法,它利用多维数据集中的数据稀疏性去除部分候选视图,对最优解的贡献非常有限。对比实验表明,静态算法可以利用PMVS有效地减少前视图的数量。静态算法在空间和时间上的成本可以降低,以适应在线需求。
The availability and performance of data warehouse is gradually degrading with variable requirements. The set of materialized views is far from being optimal, so it is necessary to implement the dynamic adjustment to meet the demand of the users. Since the current static algorithms are not suitable for this purpose due to their larger space search and higher time consumption, this paper proposes PMVS (Preprocessor of Materialized Views Selection), which is composed of three algorithms: QSDM (Query Set Dynamic Management), CVLC (Candidate View Lattice Construction) and CVF (Candidate View Filter). Of the three algorithms, QSDM monitors the distribution of each query and determines by hypothesis testing whether the query should be added into or discarded from the query set. Based on the given query set, CVLC is in charge of producing candidate view set, which has been proven to be sufficient and necessary for selecting the best set of materialized views. As a heuristic algorithm, CVF utilizes the data sparse in multi-dimensional datasets to remove a part of candidate views, with very limited contribution to the optimal solution. The contrastive experiment indicates that PMVS can be employed by the static algorithms to reduce the amount of previous views effectively. The cost of static algorithms on space and time can be decreased to fit online demand.