Distributed and Parallel Databases最新文献_第10页

Introduction to spatio-temporal data driven urban computing 时空数据驱动的城市计算导论

IF 1.2 4区计算机科学

Distributed and Parallel Databases Pub Date : 2020-06-19 DOI: 10.1007/s10619-020-07300-3

Shuo Shang, K. Zheng, Panos Kalnis

引用次数: 1

On accurate POI recommendation via transfer learning 基于迁移学习的准确POI推荐

IF 1.2 4区计算机科学

Distributed and Parallel Databases Pub Date : 2020-06-15 DOI: 10.1007/s10619-020-07299-7

Hao Zhang, Siyi Wei, Xiaojiao Hu, Ying Li, Jiajie Xu

引用次数: 7

A framework for dependency estimation in heterogeneous data streams 异构数据流中依赖估计的框架

IF 1.2 4区计算机科学

Distributed and Parallel Databases Pub Date : 2020-06-06 DOI: 10.1007/s10619-020-07295-x

Edouard Fouché, Alan Mazankiewicz, Florian Kalinke, Klemens Böhm

引用次数: 3

A data distribution model for RDF RDF的数据分布模型

IF 1.2 4区计算机科学

Distributed and Parallel Databases Pub Date : 2020-05-16 DOI: 10.1007/s10619-020-07296-w

Rebeca Schroeder, Raqueline R. M. Penteado, Carmem S. Hara

引用次数: 5

LSTM-based deep learning for spatial–temporal software testing 基于lstm的时空软件测试深度学习

IF 1.2 4区计算机科学

Distributed and Parallel Databases Pub Date : 2020-05-09 DOI: 10.1007/s10619-020-07291-1

Lei Xiao, Huai-kou Miao, Tingting Shi, Yu Hong

引用次数: 7

Self-adapting data migration in the context of schema evolution in NoSQL databases NoSQL数据库模式进化背景下的自适应数据迁移

IF 1.2 4区计算机科学

Distributed and Parallel Databases Pub Date : 2020-04-01 DOI: 10.1109/ICDEW49219.2020.00013

Andrea Hillenbrand, U. Störl, Shamil Nabiyev, Meike Klettke

{"title":"Self-adapting data migration in the context of schema evolution in NoSQL databases","authors":"Andrea Hillenbrand, U. Störl, Shamil Nabiyev, Meike Klettke","doi":"10.1109/ICDEW49219.2020.00013","DOIUrl":"https://doi.org/10.1109/ICDEW49219.2020.00013","url":null,"abstract":"When NoSQL database systems are used in an agile software development setting, data model changes occur frequently and thus, data is routinely stored in different versions. The management of versioned data leads to an overhead potentially impeding the software development. Several data migration strategies exist that handle legacy data differently during data accesses, each of which can be characterized by certain advantages and disadvantages. Depending on the requirements for the software application, we evaluate and compare different migration strategies through metrics like migration costs and latency as well as precision and recall. Ideally, exactly that strategy should be selected whose characteristics fulfill service-level agreements and match the migration scenario, which depends on the query workload and the changes in the data model which imply an evolution of the database schema. In this paper, we present a methodology of self-adapting data migration, which automatically adjusts migration strategies and their parameters with respect to the migration scenario and service-level agreements, thereby contributing to the self-management of database systems and supporting agile development.","PeriodicalId":50568,"journal":{"name":"Distributed and Parallel Databases","volume":"40 1","pages":"5 - 25"},"PeriodicalIF":1.2,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICDEW49219.2020.00013","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48202535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Selective caching: a persistent memory approach for multi-dimensional index structures 选择性缓存:用于多维索引结构的持久内存方法

IF 1.2 4区计算机科学

Distributed and Parallel Databases Pub Date : 2020-04-01 DOI: 10.1109/ICDEW49219.2020.00010

M. Jibril, Philipp Götze, David Broneske, K. Sattler

{"title":"Selective caching: a persistent memory approach for multi-dimensional index structures","authors":"M. Jibril, Philipp Götze, David Broneske, K. Sattler","doi":"10.1109/ICDEW49219.2020.00010","DOIUrl":"https://doi.org/10.1109/ICDEW49219.2020.00010","url":null,"abstract":"After the introduction of Persistent Memory in the form of Intel’s Optane DC Persistent Memory on the market in 2019, it has found its way into manifold applications and systems. As Google and other cloud infrastructure providers are starting to incorporate Persistent Memory into their portfolio, it is only logical that cloud applications have to exploit its inherent properties. Persistent Memory can serve as a DRAM substitute, but guarantees persistence at the cost of compromised read/write performance compared to standard DRAM. These properties particularly affect the performance of index structures, since they are subject to frequent updates and queries. However, adapting each and every index structure to exploit the properties of Persistent Memory is tedious. Hence, we require a general technique that hides this access gap, e.g., by using DRAM caching strategies. To exploit Persistent Memory properties for analytical index structures, we propose selective caching . It is based on a mixture of dynamic and static caching of tree nodes in DRAM to reach near-DRAM access speeds for index structures. In this paper, we evaluate selective caching on the OLAP-optimized main-memory index structure Elf, because its memory layout allows for an easy caching. Our experiments show that if configured well, selective caching with a suitable replacement strategy can keep pace with pure DRAM storage of Elf while guaranteeing persistence. These results are also reflected when selective caching is used for parallel workloads.","PeriodicalId":50568,"journal":{"name":"Distributed and Parallel Databases","volume":"40 1","pages":"47-66"},"PeriodicalIF":1.2,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICDEW49219.2020.00010","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47157551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

On the necessity of explicit cross-layer data formats in near-data processing systems 近数据处理系统中显式跨层数据格式的必要性

IF 1.2 4区计算机科学

Distributed and Parallel Databases Pub Date : 2020-04-01 DOI: 10.1109/ICDEW49219.2020.00009

Tobias Vinçon, Arthur Bernhardt, Lukas Weber, A. Koch, Ilia Petrov

引用次数: 6

A gray-box modeling methodology for runtime prediction of Apache Spark jobs 用于Apache Spark作业运行时预测的灰盒建模方法

IF 1.2 4区计算机科学

Distributed and Parallel Databases Pub Date : 2020-03-10 DOI: 10.1007/s10619-020-07286-y

Hani Al-Sayeh, Stefan Hagedorn, K. Sattler

引用次数: 12

Multi-objective spatial keyword query with semantics: a distance-owner based approach 具有语义的多目标空间关键字查询:一种基于距离所有者的方法