Genetic algorithm for sampling from scale-free data and networks

Proceedings of the 2014 Annual Conference on Genetic and Evolutionary Computation Pub Date : 2014-07-12 DOI:10.1145/2576768.2598391

P. Krömer, J. Platoš

引用次数: 9

Abstract

A variety of real-world data and networks can be described by a heavy-tailed probability distribution of its values, vertex degrees, or other significant properties, that follows the power law. Such a scale-free data and networks can be found in both natural phenomena such as protein interaction networks and gene regulation networks and man-made structures like the Internet, language, and various social networks. An efficient analysis of large scale data and networks is often impractical and various heuristic and metaheuristc sampling techniques are deployed to select smaller subsets of the data for analysis and visualisation. A key goal of data and network sampling is to select such a subset of the original data that would accurately represent the original data with respect to selected attributes. In this work we propose a novel genetic algorithm for scale-free data and network sampling and evaluate the algorithm in a series of computational experiments.

查看原文本刊更多论文

无标度数据和网络采样的遗传算法

各种现实世界的数据和网络可以通过其值、顶点度或其他重要属性的重尾概率分布来描述，该分布遵循幂律。这种无标度的数据和网络既存在于蛋白质相互作用网络、基因调控网络等自然现象中，也存在于互联网、语言和各种社会网络等人工结构中。大规模数据和网络的有效分析通常是不切实际的，各种启发式和元启发式采样技术被部署来选择数据的较小子集进行分析和可视化。数据和网络采样的一个关键目标是选择原始数据的一个子集，该子集将根据所选属性准确地表示原始数据。在这项工作中，我们提出了一种新的无标度数据和网络采样的遗传算法，并在一系列的计算实验中对该算法进行了评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2014 Annual Conference on Genetic and Evolutionary Computation

自引率

0.00%

发文量