Stavros Maroulis, Nikos Bikakis, Vassilis Stamatopoulos, George Papastefanatos
{"title":"用于近似查询回答的部分自适应索引","authors":"Stavros Maroulis, Nikos Bikakis, Vassilis Stamatopoulos, George Papastefanatos","doi":"arxiv-2407.18702","DOIUrl":null,"url":null,"abstract":"In data exploration, users need to analyze large data files quickly, aiming\nto minimize data-to-analysis time. While recent adaptive indexing approaches\naddress this need, they are cases where demonstrate poor performance.\nParticularly, during the initial queries, in regions with a high density of\nobjects, and in very large files over commodity hardware. This work introduces\nan approach for adaptive indexing driven by both query workload and\nuser-defined accuracy constraints to support approximate query answering. The\napproach is based on partial index adaptation which reduces the costs\nassociated with reading data files and refining indexes. We leverage a\nhierarchical tile-based indexing scheme and its stored metadata to provide\nefficient query evaluation, ensuring accuracy within user-specified bounds. Our\npreliminary evaluation demonstrates improvement on query evaluation time,\nespecially during initial user exploration.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Partial Adaptive Indexing for Approximate Query Answering\",\"authors\":\"Stavros Maroulis, Nikos Bikakis, Vassilis Stamatopoulos, George Papastefanatos\",\"doi\":\"arxiv-2407.18702\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In data exploration, users need to analyze large data files quickly, aiming\\nto minimize data-to-analysis time. While recent adaptive indexing approaches\\naddress this need, they are cases where demonstrate poor performance.\\nParticularly, during the initial queries, in regions with a high density of\\nobjects, and in very large files over commodity hardware. This work introduces\\nan approach for adaptive indexing driven by both query workload and\\nuser-defined accuracy constraints to support approximate query answering. The\\napproach is based on partial index adaptation which reduces the costs\\nassociated with reading data files and refining indexes. We leverage a\\nhierarchical tile-based indexing scheme and its stored metadata to provide\\nefficient query evaluation, ensuring accuracy within user-specified bounds. Our\\npreliminary evaluation demonstrates improvement on query evaluation time,\\nespecially during initial user exploration.\",\"PeriodicalId\":501123,\"journal\":{\"name\":\"arXiv - CS - Databases\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Databases\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2407.18702\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.18702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Partial Adaptive Indexing for Approximate Query Answering
In data exploration, users need to analyze large data files quickly, aiming
to minimize data-to-analysis time. While recent adaptive indexing approaches
address this need, they are cases where demonstrate poor performance.
Particularly, during the initial queries, in regions with a high density of
objects, and in very large files over commodity hardware. This work introduces
an approach for adaptive indexing driven by both query workload and
user-defined accuracy constraints to support approximate query answering. The
approach is based on partial index adaptation which reduces the costs
associated with reading data files and refining indexes. We leverage a
hierarchical tile-based indexing scheme and its stored metadata to provide
efficient query evaluation, ensuring accuracy within user-specified bounds. Our
preliminary evaluation demonstrates improvement on query evaluation time,
especially during initial user exploration.