{"title":"DVH: a query processing method using domain vectors and hashing","authors":"J. Gustafson, W. Perrizo, Kirk Scott, D. Thureen","doi":"10.1109/RIDE.1992.227416","DOIUrl":null,"url":null,"abstract":"The authors introduce a fast, space-efficient technique for accelerating equijoins between very large relations. The technique, called Domain Vector Hash (DVH) join Acceleration, is described and compared with three other join accelerators: hybrid-hash, join indexes, and materialized views. An analytic cost model is developed for each join method, and a detailed analytic performance comparison is made between each technique. The results show that DVH-join produces much faster joins than any of the other techniques considered, and over a considerably wider range of join selectivities and memory sizes. Moreover, the approach achieves this performance improvement while simultaneously minimizing the amount of additional join information that is cached on disk. When compared with either join indexes or materialized views, DVH typically requires from one to four orders of magnitude less cache space.<<ETX>>","PeriodicalId":138660,"journal":{"name":"[1992 Proceedings] Second International Workshop on Research Issues on Data Engineering: Transaction and Query Processing","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1992-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[1992 Proceedings] Second International Workshop on Research Issues on Data Engineering: Transaction and Query Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RIDE.1992.227416","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The authors introduce a fast, space-efficient technique for accelerating equijoins between very large relations. The technique, called Domain Vector Hash (DVH) join Acceleration, is described and compared with three other join accelerators: hybrid-hash, join indexes, and materialized views. An analytic cost model is developed for each join method, and a detailed analytic performance comparison is made between each technique. The results show that DVH-join produces much faster joins than any of the other techniques considered, and over a considerably wider range of join selectivities and memory sizes. Moreover, the approach achieves this performance improvement while simultaneously minimizing the amount of additional join information that is cached on disk. When compared with either join indexes or materialized views, DVH typically requires from one to four orders of magnitude less cache space.<>