Stavros Maroulis, Nikos Bikakis, Vassilis Stamatopoulos, George Papastefanatos
{"title":"Partial Adaptive Indexing for Approximate Query Answering","authors":"Stavros Maroulis, Nikos Bikakis, Vassilis Stamatopoulos, George Papastefanatos","doi":"arxiv-2407.18702","DOIUrl":null,"url":null,"abstract":"In data exploration, users need to analyze large data files quickly, aiming\nto minimize data-to-analysis time. While recent adaptive indexing approaches\naddress this need, they are cases where demonstrate poor performance.\nParticularly, during the initial queries, in regions with a high density of\nobjects, and in very large files over commodity hardware. This work introduces\nan approach for adaptive indexing driven by both query workload and\nuser-defined accuracy constraints to support approximate query answering. The\napproach is based on partial index adaptation which reduces the costs\nassociated with reading data files and refining indexes. We leverage a\nhierarchical tile-based indexing scheme and its stored metadata to provide\nefficient query evaluation, ensuring accuracy within user-specified bounds. Our\npreliminary evaluation demonstrates improvement on query evaluation time,\nespecially during initial user exploration.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.18702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In data exploration, users need to analyze large data files quickly, aiming
to minimize data-to-analysis time. While recent adaptive indexing approaches
address this need, they are cases where demonstrate poor performance.
Particularly, during the initial queries, in regions with a high density of
objects, and in very large files over commodity hardware. This work introduces
an approach for adaptive indexing driven by both query workload and
user-defined accuracy constraints to support approximate query answering. The
approach is based on partial index adaptation which reduces the costs
associated with reading data files and refining indexes. We leverage a
hierarchical tile-based indexing scheme and its stored metadata to provide
efficient query evaluation, ensuring accuracy within user-specified bounds. Our
preliminary evaluation demonstrates improvement on query evaluation time,
especially during initial user exploration.