A 2/sup d/-tree-based blocking method for microaggregating very large data sets

First International Conference on Availability, Reliability and Security (ARES'06) Pub Date : 2006-04-20 DOI:10.1109/ARES.2006.1

A. Solanas, A. Martmez-Balleste, J. Domingo-Ferrer, J. M. Mateo-Sanz

引用次数: 15

Abstract

Blocking is a well-known technique used to partition a set of records into several subsets of manageable size. The standard approach to blocking is to split the records according to the values of one or several attributes (called blocking attributes). This paper presents a new blocking method based on 2/sup d/-trees for intelligently partitioning very large data sets for micro aggregation. A number of experiments has been carried out in order to compare our method with the most typical univariate one.

查看原文本刊更多论文

一种基于2/sup /-tree的阻塞方法，用于微聚合非常大的数据集

阻塞是一种众所周知的技术，用于将一组记录划分为几个可管理大小的子集。阻塞的标准方法是根据一个或多个属性(称为阻塞属性)的值拆分记录。本文提出了一种基于2/sup /-树的块化方法，用于对超大数据集进行微聚合智能分区。为了将我们的方法与最典型的单变量方法进行比较，进行了许多实验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

First International Conference on Availability, Reliability and Security (ARES'06)

自引率

0.00%

发文量