Query Optimization on Large Scale Nested Data with Service Tree and Frequent Trajectory

J. Inf. Process. Syst. Pub Date : 2021-02-09 DOI:10.3745/JIPS.04.0205

Li Wang, Guodong Wang

引用次数: 0

Abstract

Query applications based on nested data, the most commonly used form of data representation on the web, especially precise query, is becoming more extensively used. MapReduce, a distributed architecture with parallel computing power, provides a good solution for big data processing. However, in practical application, query requests are usually concurrent, which causes bottlenecks in server processing. To solve this problem, this paper first combines a column storage structure and an inverted index to build index for nested data on MapReduce. On this basis, this paper puts forward an optimization strategy which combines query execution service tree and frequent sub-query trajectory to reduce the response time of frequent queries and further improve the efficiency of multi-user concurrent queries on large scale nested data. Experiments show that this method greatly improves the efficiency of nested data query.

查看原文本刊更多论文

基于服务树和频繁轨迹的大规模嵌套数据查询优化

基于嵌套数据的查询应用程序是web上最常用的数据表示形式，尤其是精确查询的应用越来越广泛。MapReduce是一种具有并行计算能力的分布式架构，为大数据处理提供了很好的解决方案。但是，在实际应用中，查询请求通常是并发的，这会给服务器处理带来瓶颈。为了解决这个问题，本文首先结合了列存储结构和倒排索引，在MapReduce上为嵌套数据建立索引。在此基础上，本文提出了查询执行服务树与频繁子查询轨迹相结合的优化策略，以减少频繁查询的响应时间，进一步提高大规模嵌套数据上多用户并发查询的效率。实验表明，该方法大大提高了嵌套数据查询的效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

J. Inf. Process. Syst.

自引率

0.00%

发文量